-
-
Notifications
You must be signed in to change notification settings - Fork 11k
[V1] AsyncLLM data parallel #13923
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
[V1] AsyncLLM data parallel #13923
Changes from 15 commits
Commits
Show all changes
65 commits
Select commit
Hold shift + click to select a range
9ca44ce
[V1] AsyncLLM data parallel WIP
njhill 3f51611
Handle pausing loop
njhill d8c591e
More single-node updates
njhill 65e225d
some cleanup
njhill 5ce57b6
fix up utility methods
njhill a3f1102
revert config check
njhill a66fb01
fixes
njhill 67672c2
cleanup
njhill cf52fbf
fixes
njhill a4ec81b
reconcile with LLMEngine DP in decoupled engine case
njhill 292aa00
minor simplification
njhill 4b62ffd
rework
njhill 407c72e
class refactor
njhill 31bf7ea
fix
njhill fde51ce
adjust core engine init
njhill d5a3e68
Merge remote-tracking branch 'refs/remotes/origin/main' into multi-en…
njhill 6d89a1b
fix new typing
njhill 448abd9
fix :facepalm:
njhill a1e513e
bind socket first
njhill 50cf64c
do you have to let it linger
njhill f365998
Merge remote-tracking branch 'origin/main' into multi-engine
njhill b2571f0
add comments
njhill 32c6f24
aggregate stats
njhill 9c30cd7
Merge remote-tracking branch 'origin/main' into multi-engine
njhill 672d07e
Fix test
njhill dea382b
Merge remote-tracking branch 'origin/main' into multi-engine
njhill d24a626
fix and minor cleanup
njhill cd03c80
Add CI test
njhill f1004b7
Merge remote-tracking branch 'origin/main' into multi-engine
njhill d3298fa
Some simplification and fixes
njhill 74dde48
Merge remote-tracking branch 'origin/main' into multi-engine
njhill 5fe1b75
address @markmc's stats suggestion
njhill 648659f
address @tms's arg comment
njhill 119d1ec
fix utility method breakage
njhill 55328ee
rename AsyncMPClient output_processor to output_handler
njhill 4f5330e
Merge remote-tracking branch 'origin/main' into multi-engine
njhill 48770ec
Merge remote-tracking branch 'origin/main' into multi-engine
njhill d229f4d
Fix
njhill 2f91cc4
Merge remote-tracking branch 'origin/main' into multi-engine
njhill 518047a
Remove redundant logic related to removed stats aggregation
njhill cb2b099
Fixes
njhill ff1137a
Merge remote-tracking branch 'refs/remotes/origin/main' into multi-en…
njhill 61f4fcb
fix issue from main merge
njhill 44874c2
remove leftover unused field
njhill 66fc582
Fix offline DP compatibility
njhill 7764466
Add timeout to data_parallel.py
njhill 51e8bf0
Merge remote-tracking branch 'refs/remotes/origin/main' into multi-en…
njhill f692c12
Merge remote-tracking branch 'origin/main' into multi-engine
njhill 47b5e1c
Enable less-frequent all-reduce optimization
njhill f226139
Merge remote-tracking branch 'origin/main' into multi-engine
njhill af47920
Merge remote-tracking branch 'origin/main' into multi-engine
njhill 693c521
Merge remote-tracking branch 'origin/main' into multi-engine
njhill 6e131e3
clean distributed shutdown
njhill d9ac856
address misc loose-ends
njhill 3abbdef
Merge remote-tracking branch 'origin/main' into multi-engine
njhill b18417e
further tweaks
njhill 56b2b78
Merge remote-tracking branch 'refs/remotes/origin/main' into multi-en…
njhill 05ab310
Additional debug
njhill 5295c34
Merge remote-tracking branch 'origin/main' into multi-engine
njhill 4f897b8
Address review comments on tests
njhill 62f32ed
Merge remote-tracking branch 'origin/main' into multi-engine
njhill 771ccf1
Fix env var fallback
njhill 05a0e83
Fix test supports_v1 check
njhill bc41b13
Fix yapf :facepalm:
njhill ccecb42
Merge remote-tracking branch 'origin/main' into multi-engine
njhill File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.