AttributeError: 'Qwen2ForCausalLM' object has no attribute 'optimizer' during GRPO training with ZERO-3 #2782

Co1lin · 2025-02-06T05:57:07Z

Reproduction

See here, basically training qwen-2.5 models with GRPO using the latest trl library.

It is also mentioned here.

System Info

trl  0.15.0.dev0
transformers  4.49.0.dev0
deepspeed  0.16.3

Checklist

I have checked that my issue isn't already filed (see open issues)
I have included my system information
Any code provided is minimal, complete, and reproducible (more on MREs)
Any code provided is properly formatted in code blocks, (no screenshot, more on code blocks)
Any traceback provided is complete

The text was updated successfully, but these errors were encountered:

Co1lin changed the title ~~AttributeError: 'Qwen2ForCausalLM' object has no attribute 'optimizer' during DRPO training~~ AttributeError: 'Qwen2ForCausalLM' object has no attribute 'optimizer' during GRPO training with ZERO-3 Feb 6, 2025

github-actions bot added 🐛 bug Something isn't working 🏋 GRPO Related to GRPO labels Feb 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AttributeError: 'Qwen2ForCausalLM' object has no attribute 'optimizer' during GRPO training with ZERO-3 #2782

AttributeError: 'Qwen2ForCausalLM' object has no attribute 'optimizer' during GRPO training with ZERO-3 #2782

Co1lin commented Feb 6, 2025

AttributeError: 'Qwen2ForCausalLM' object has no attribute 'optimizer' during GRPO training with ZERO-3 #2782

AttributeError: 'Qwen2ForCausalLM' object has no attribute 'optimizer' during GRPO training with ZERO-3 #2782

Comments

Co1lin commented Feb 6, 2025

Reproduction

System Info

Checklist