generated from fastai/nbdev_template
-
Notifications
You must be signed in to change notification settings - Fork 2.4k
Pull requests: huggingface/trl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix(grpo-trainer): init self.args before use
#4801
opened Jan 9, 2026 by
carlyou
Loading…
1 of 5 tasks
Monkey patch for
HybridCache in Liger-Kernel with transformers v5
#4798
opened Jan 9, 2026 by
qgallouedec
Loading…
Refactor KTO coordinated with DPO [a/N]: Remove encoder-decoder support
#4792
opened Jan 8, 2026 by
albertvillanova
Loading…
Temporarily Work Around init_communicator self.device Initialization Failure with vLLM-Ascend Server and TRL 0.26.2 in Training
#4789
opened Jan 8, 2026 by
ShareableXue
Loading…
Refactor KTO [3/N]: Extract dataset processing to _prepare_dataset method
#4788
opened Jan 8, 2026 by
albertvillanova
Loading…
Refactor KTO [2/N]: Improve config validation in KTOConfig
#4787
opened Jan 8, 2026 by
albertvillanova
Loading…
add support for GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization
#4785
opened Jan 7, 2026 by
nbasyl
Loading…
feat(sft): add generation-based evaluation support to SFTTrainer
#4768
opened Jan 2, 2026 by
CodersAcademy006
Loading…
Add a config to limit the number of tool calling iterations.
#4761
opened Dec 29, 2025 by
pramodith
Loading…
4 of 5 tasks
fix: handle None eval_dataset in example code
#4756
opened Dec 27, 2025 by
ciaoyizhen
Loading…
1 of 4 tasks
perf: avoid output_hidden_states when only last_hidden_state is used
#4755
opened Dec 27, 2025 by
ciaoyizhen
Loading…
2 of 5 tasks
Fix GRPO
scale_rewards type specification to fix __post_init__ validation
#4752
opened Dec 26, 2025 by
apalmas-saifh
Loading…
1 of 5 tasks
Clarify Accelerate usage in SFTTrainer documentation
#4744
opened Dec 23, 2025 by
Likhita-17
Loading…
1 task done
[GRPOTrainer]: Agent Training Supports Async Tool Calls
#4742
opened Dec 23, 2025 by
pramodith
Loading…
5 tasks done
feat: Bidirectional masked importance sampling ratio (MIS) for IcePop
#4732
opened Dec 20, 2025 by
casinca
Loading…
5 tasks
Previous Next
ProTip!
no:milestone will show everything without a milestone.