-
-
Notifications
You must be signed in to change notification settings - Fork 10.7k
[FEAT] [ROCm] Enabling AITER Kernel #14007
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Closed
+1,168
−266
Closed
Changes from 25 commits
Commits
Show all changes
39 commits
Select commit
Hold shift + click to select a range
09133e9
integrate aiter kernels: Linear, Norm, MOE
vllmellm ead17c7
maintain a consistent import pattern
vllmellm 2527956
add aiter fp8 block scaled moe kernel
vllmellm 814702a
bugfix: fix import paths and wrong env variables
vllmellm 024cfc5
rename importing module names from amd/rocm aiter package to avoid c…
vllmellm 7cfe429
bugfixe on wrong env variable spelling an add missing statment cond…
vllmellm 41e7e4f
enabled VLLM_ROCM_USE_AITER in unit-tests
vllmellm 5f668ea
include the AMD AITER package in rocm_base docker file
vllmellm 8c5eb52
integrate AITER paged attention
vllmellm 77cb436
bugfixes and disable rocm aiter paged attention
vllmellm 942aa5b
Merge remote-tracking branch origin/main into aiter-integration
vllmellm 4c41781
revert back the custom pa condition
vllmellm c09a740
enable AITER tgemm.mm per tensor scaled mm unittest
tjtanaa e19b7f5
bugfix: shuffle the weights when using aiter fmoe block scaled kernel
vllmellm 11ac580
fix environment wrong variable in unit tests
vllmellm 0865124
add aiter block gemm kernel and refactor aiter envs conditions
tjtanaa 623dadb
add dispatch tests
vllmellm 459bb02
add dispatch tests
vllmellm acc27ff
add dispatch tests
vllmellm 11b6aba
bugfixes in layernorm and fix spelling mistakes
vllmellm 0a6b8a0
enable rocm aiter paged attention
vllmellm 1474828
bugfix: add the missing argument in dispatch
vllmellm b78114a
update rocm AITER commit version
vllmellm d20d757
bug fix
vllmellm 7754c2e
add more comments for code documentation
vllmellm 5e31c3e
disable some model tests
vllmellm d21c912
move rocm-aiter env flag checks to vllm.platforms.current_platform
vllmellm 59f0208
bugfixes after refactoring the aiter modules enablility in current pl…
vllmellm 17b4d6a
update AMD CI to skip certain test cases
vllmellm c32c31f
refactor dispatching for w8a8 scaled-mm
vllmellm e412824
Merge remote-tracking branch 'origin/main' into aiter-integration
vllmellm a5d7339
fix cutlass flag bug
vllmellm ce30f63
revert test requirements
vllmellm 375e9db
revert test requirements
vllmellm fd3f4e3
revert test requirements
vllmellm f64bfe0
addressing PR comment reviews: fix isort ignores, revert back missing…
vllmellm c1297e5
add missing comment in fp8_utils
vllmellm 529714c
Merge remote-tracking branch 'origin/main' into aiter-integration
vllmellm 3859abc
Merge remote-tracking branch 'origin/main' into aiter-integration
vllmellm File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
will this cause import error for non-rocm platform?
Uh oh!
There was an error while loading. Please reload this page.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
No,
rocm_aiter_*
functions are wrappers that will only be called when rocm platform is detected and aiter-specific env vars are set to True.There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@vllmellm @tjtanaa Please rebase to resolve the conflict and then we will put "ready" label to finalized the review. Thank you!