-
-
Notifications
You must be signed in to change notification settings - Fork 10.8k
Remove unused kwargs from model definitions #13555
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from 37 commits
Commits
Show all changes
40 commits
Select commit
Hold shift + click to select a range
28c7f27
Remove `kv_cache` and `attn_metadata` from `Attention`
hmellor 1fe2b0d
Remove `attn_metadata` from `MambaMixer` 1 & 2
hmellor 153d253
Remove `kv_caches` and `attn_metadata` from `forward` call
hmellor eb30940
Remove `kv_caches` and `attn_metadata` from new model docs
hmellor 7a75753
Remove `kv_caches` and `attn_metadata` from model interface
hmellor 7ddfd1f
Remove args from a batch of models
hmellor f8794e9
Remove args from another batch of models
hmellor f81cad0
Remove `attn_metadata` from a couple more places
hmellor 6beb1b1
Attempt fix HPU model runner
hmellor c784070
Update CPU model runners
hmellor 72450ae
Update V1 GPU model runner
hmellor fdda9c6
Update draft model runner
hmellor f9a1ee8
Update enc dec model runner
hmellor b91538a
Update remaining non-device model runners
hmellor 59f01be
Allow `kv_caches` to be passed to `execute_model`
hmellor 778910f
Update XPU model runner
hmellor c7cd852
Update V1 GPU model runner
hmellor 334d2b3
Update OpenVINO model runner
hmellor 0735ed9
Update Neuron model runner
hmellor 5a8a73d
Add unused `kv_caches` arg to runners to limit scope of PR
hmellor 3b9a35b
Update TPU V0 and V1
hmellor bb094d2
Update HPU model runner
hmellor 46d8fab
Make `kv_caches` optional in `HPUModelRunner.execute_model`
hmellor 39ad6d4
Make linter happy
hmellor 164ee32
Fix whisper test
hmellor f6c8e2a
Add `kv_caches` back to remaining `*ModelRunner.execute_model()`
hmellor c917880
Fix kernel tests
hmellor 6a29698
Kick CI
hmellor cd1e845
Merge branch 'main' into remove-unused-attn-args
hmellor f8b4d36
Fix missing import
hmellor 39742a3
Fix call to `execute_model` in encoder decoder model runner
hmellor cc087b0
Fix call to `execute_model` in XPU model runner
hmellor 6f703ba
Fix call to `execute_model` in multi-step model runner
hmellor d0ee431
Fix V1 TPU model runner
hmellor 29cff77
Fix multi-step model runner
hmellor 7e0c808
Merge branch 'main' into remove-unused-attn-args
hmellor 5d84b99
Deprecate args in `Attention.forward` instead
hmellor 8925e30
Revert "Deprecate args in `Attention.forward` instead"
hmellor b7ec2d9
Merge branch 'main' into remove-unused-attn-args
hmellor a775d1c
Fix `mllama` KV cache access
hmellor File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.