Skip to content

Pull requests: hiyouga/LLaMA-Factory

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

support new processor arg video_maxlen_ttl pending This problem is yet to be addressed
#7810 opened Apr 22, 2025 by Luffy-ZY-Wang Draft
1 of 2 tasks
Distributed improvement of Muon implementation
#7808 opened Apr 22, 2025 by tianshijing Loading…
1 task
fit the cases of mmdata in sys_msg
#7694 opened Apr 12, 2025 by Luffy-ZY-Wang Loading…
1 of 2 tasks
在setup.py 中修改了deepspeed在内的几个包,现在可以从某个checkpoint继续训练了 pending This problem is yet to be addressed
#7609 opened Apr 5, 2025 by ZevineXu Loading…
2 tasks done
Update PR #3976: Add dataset % sampling with equal distribution pending This problem is yet to be addressed
#7445 opened Mar 23, 2025 by Katehuuh Loading…
1 task done
updated phi4 template-update sync-unsloth-bugs pending This problem is yet to be addressed
#7413 opened Mar 21, 2025 by ankitcoder123 Loading…
Support decision tasks by providing reward wrapper for Gym-like RL environment pending This problem is yet to be addressed
#7347 opened Mar 17, 2025 by MA-Wenhui Loading…
add Sequence Parallelism (reopened #6506) pending This problem is yet to be addressed
#7338 opened Mar 17, 2025 by HaoshengZou Loading…
Use Unsloth FastVisionModel for VLM
#7295 opened Mar 13, 2025 by PLoic Draft
This change adds support for Intel Gaudi HPUs. pending This problem is yet to be addressed
#7275 opened Mar 12, 2025 by emascarenhas Loading…
refactor(data): refactor train mask, supports more fine-grained mask for ShareGPT pending This problem is yet to be addressed
#7264 opened Mar 12, 2025 by zzc0430 Loading…
2 tasks done
ProTip! Updated in the last three days: updated:>2025-04-24.