-
Notifications
You must be signed in to change notification settings - Fork 31.3k
Fix MoE for V5 #42456
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Fix MoE for V5 #42456
Changes from all commits
Commits
Show all changes
46 commits
Select commit
Hold shift + click to select a range
c480438
remove zero_like + scatter
3outeille c615c47
Merge branch 'main' into fix-moe-v5
3outeille 073326f
fix mixtral moe
3outeille 8ff6c18
Merge branch 'fix-moe-v5' of https://github.com/huggingface/transform…
3outeille f3457e2
fix other moe models as well
3outeille 16737a4
fix ci
3outeille 01da12d
Merge branch 'main' into fix-moe-v5
3outeille 57541cd
fix modular mixtral
3outeille b7eb918
Merge branch 'fix-moe-v5' of https://github.com/huggingface/transform…
3outeille 3992748
fix qwen2_moe + qwen3_next
3outeille 15f41b9
fix device mismatch for qwen3_vl_moe to pass tests
3outeille 35e8bf8
fix modular mixtral
3outeille e6f026f
fix other models
3outeille 14b7ac0
rm slow tokenizers (#40936)
itazap ec3f555
[loading/saving] Reverse all loading operations when saving (#42396)
Cyrilvallez 326eb75
Fix T5 tests: use generation_config for generation parameters (#42419)
Abdennacer-Badaoui 50cc1e9
Merge branch 'main' into fix-moe-v5
3outeille 8bccd8c
linting
3outeille 74e84d5
Merge branch 'fix-moe-v5' of https://github.com/huggingface/transform…
3outeille 718cc64
more fix to pass the CI tests
3outeille 19db8c9
Merge branch 'main' into fix-moe-v5
3outeille 1100864
fix lfm2 moe
3outeille 7d024b9
Merge branch 'fix-moe-v5' of https://github.com/huggingface/transform…
3outeille e6f82dc
Merge branch 'main' into fix-moe-v5
3outeille e982a15
fix docstring
3outeille 98703cc
Merge branch 'fix-moe-v5' of https://github.com/huggingface/transform…
3outeille 84bb660
Merge branch 'main' into fix-moe-v5
3outeille 3b14e7b
fix docstring
3outeille 0ac90c8
Merge branch 'fix-moe-v5' of https://github.com/huggingface/transform…
3outeille 5e4e7de
Merge branch 'main' into fix-moe-v5
3outeille 0399e13
fix qwen like model
3outeille af29eee
fix flex olmo
3outeille bf66927
revert lfm2 moe config
3outeille 4d6e993
Merge branch 'main' into fix-moe-v5
3outeille 144ec86
Merge branch 'main' into fix-moe-v5
3outeille ede2116
make fixup
3outeille 3132b5f
fix docstring
3outeille 2e04f12
fix conversion mapping
3outeille 61d1b87
Merge branch 'main' into fix-moe-v5
3outeille cdb3eb1
fix inference of gpt-oss
3outeille 5edd375
add some fixes to gpt-oss (but still not good)
3outeille 8cc40f0
Merge branch 'main' into fix-moe-v5
3outeille a02f8bf
fix modular
ArthurZucker d213808
we need errors I think
ArthurZucker 1317b4d
fix config issue
ArthurZucker 51d4b52
this was fixed
ArthurZucker File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nice