generated from fastai/nbdev_template
-
Notifications
You must be signed in to change notification settings - Fork 2.2k
Pull requests: huggingface/trl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix docstring interlink to parent class for NashMDTrainer and XPOTrainer
#4179
opened Sep 30, 2025 by
albertvillanova
Loading…
Have vLLM return processed (temperature scaled) log probs
#4163
opened Sep 29, 2025 by
YonatanGideoni
Loading…
🧺 [5/N] Refactor
_generate
in GRPO/RLOO: Insert images in the prompt
#4155
opened Sep 26, 2025 by
qgallouedec
Loading…
5 tasks
🧺 [4/N] Refactor
_generate
in GRPO/RLOO: Move forward_kwargs
outside generation method
#4154
opened Sep 26, 2025 by
qgallouedec
Loading…
5 tasks
🧺 [3/N] Refactor
_generate
in GRPO/RLOO: Rely on generator for prompt truncation
#4153
opened Sep 26, 2025 by
qgallouedec
Loading…
5 tasks
🧺 [2/N] Refactor
_generate
in GRPO/RLOO: Use prompt_ids
from generation
#4152
opened Sep 26, 2025 by
qgallouedec
Loading…
5 tasks
🧺 [1/N] Refactor
_generate
in GRPO/RLOO: list of ints instead of tensors
#4146
opened Sep 26, 2025 by
qgallouedec
Loading…
Disable cache in compute_loss only when checkpointing is enabled
#4133
opened Sep 24, 2025 by
cyyever
Loading…
update guided decoding param to structured outputs
#4117
opened Sep 22, 2025 by
jiqing-feng
Loading…
🎞️ Support sequence classification models in
clone_chat_template
#4097
opened Sep 16, 2025 by
qgallouedec
Loading…
feat:add support for 'image_grid_thw'(QwenVL) in DPOTrainer
#4091
opened Sep 15, 2025 by
ycma8
Loading…
2 of 5 tasks
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.