Issues · huggingface/trl

[Tracking issue] General dataset support

#2071 opened Sep 15, 2024 by qgallouedec

Open

[Tracking issue] Integrate native liger-kernel losses

#2495 opened Dec 17, 2024 by qgallouedec

Open 4

[Tracking issue] Wrong loss scaling when accumulating gradient

#2617 opened Jan 23, 2025 by qgallouedec

Open

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

190 Open 1,243 Closed

⚡accelerate 🐛 bug ⚡ PEFT

#2781 opened Feb 6, 2025 by zhourunlong

5 tasks done

🐛 bug ⚡ PEFT

#2780 opened Feb 6, 2025 by zhangguoxin1

5 tasks done

✨ enhancement 🏋 GRPO

#2775 opened Feb 5, 2025 by cfpark00

🐛 bug 🏋 GRPO

#2774 opened Feb 5, 2025 by tyler-romero

5 tasks done

⚡accelerate ✨ enhancement 🏋 GRPO

#2768 opened Feb 5, 2025 by andrewsiah

5 tasks done

✨ enhancement 🏋 GRPO 🏋 Online DPO 🏋 Reward 🏋 RLOO

#2767 opened Feb 4, 2025 by xzuyn

🐛 bug 📚 documentation 🏋 PPO

#2764 opened Feb 4, 2025 by elliot-zzh

⚡accelerate ⚡ PEFT 🏋 Reward

#2758 opened Feb 4, 2025 by JohnGiorgi

5 tasks done

🐛 bug 🏋 GRPO ❓ question

#2752 opened Feb 3, 2025 by liranringel

feat(GRPOTrainer): reward_func return None to skip ✨ enhancement 🏋 GRPO

#2737 opened Feb 2, 2025 by ctjlewis

PLZ make padding_free for DataCollatorForChatML. ✨ enhancement 🏋 GKD 🙋 help from community wanted

#2736 opened Feb 2, 2025 by YooSungHyun

✨ enhancement 🏋 GRPO

#2734 opened Feb 2, 2025 by sunildkumar

🏋 GKD ❓ question

#2732 opened Feb 2, 2025 by YooSungHyun

5 tasks done

🐛 bug 🏋 GRPO

#2731 opened Feb 2, 2025 by abacaj

5 tasks done

🐛 bug

#2719 opened Jan 31, 2025 by JohnConnor123

5 tasks done

ProTip! Add no:assignee to see everything that’s not assigned.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issues: huggingface/trl

Issues list