Skip to content

Commit

Permalink
🌀 Fix GRPO default completion length doc (#2662)
Browse files Browse the repository at this point in the history
  • Loading branch information
andyl98 authored Jan 26, 2025
1 parent 4720656 commit 55a329e
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion trl/trainer/grpo_config.py
Original file line number Diff line number Diff line change
Expand Up @@ -48,7 +48,7 @@ class GRPOConfig(TrainingArguments):
Number of generations per prompt to sample.
temperature (`float`, *optional*, defaults to `0.9`):
Temperature for sampling. The higher the temperature, the more random the completions.
max_completion_length (`int` or `None`, *optional*, defaults to `None`):
max_completion_length (`int` or `None`, *optional*, defaults to `256`):
Maximum length of the generated completion.
> Parameters that control the training
Expand Down

0 comments on commit 55a329e

Please sign in to comment.