Fix documentation for converting SFT/DPO weights back to HF Llama #1318

jacobthebanana · 2024-11-03T12:50:46Z

Quick documentation fix.

In the documentation post-training/README.md, the base weights were converted from HF Llama:

    
           python tools/ckpts/convert_hf_llama_to_neox.py --tp 4 --model meta-llama/Meta-Llama-3-8B-Instruct --model_path checkpoints/neox_converted/llama3-8b-instruct

When converting fine-tuned weights from GPTNeoX format back to HF Llama, the --architecture llama flag seems to be required (otherwise the default value "neox" would be selected.) While specified for RM conversion, this flag is not included in the command for converting GPTNeoX SFT/DPO weights back to HF in this documentation.

https://github.com/EleutherAI/gpt-neox/blob/59a5236ddaf721890e3d6ef98fb8ca66c2266ce0/post-training/README.md?plain=1#L53-56

gpt-neox/tools/ckpts/convert_neox_to_hf.py

Lines 469 to 478 in 59a5236

    
           def convert( 
        
               input_checkpoint_path, 
        
               loaded_config, 
        
               output_checkpoint_path, 
        
               sequential: bool = True, 
        
               precision: Literal["auto", "fp16", "bf16", "fp32"] = "auto", 
        
               architecture: Literal["neox", "llama", "mistral"] = "neox", 
        
               is_rm: bool = False, 
        
               pad_token_id: int = -1, 
        
           ):

Fixes #1317

)

Fix documentation for converting SFT/DPO weights back to HF Llama

5038bc6

jacobthebanana requested a review from Quentin-Anthony as a code owner November 3, 2024 12:50

Quentin-Anthony approved these changes Nov 13, 2024

View reviewed changes

Quentin-Anthony merged commit 6552654 into EleutherAI:main Nov 13, 2024
1 check passed

jahatef pushed a commit that referenced this pull request Nov 29, 2024

Fix documentation for converting SFT/DPO weights back to HF Llama (#1318

ee2f142

)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix documentation for converting SFT/DPO weights back to HF Llama #1318

Fix documentation for converting SFT/DPO weights back to HF Llama #1318

jacobthebanana commented Nov 3, 2024

	def convert(
	input_checkpoint_path,
	loaded_config,
	output_checkpoint_path,
	sequential: bool = True,
	precision: Literal["auto", "fp16", "bf16", "fp32"] = "auto",
	architecture: Literal["neox", "llama", "mistral"] = "neox",
	is_rm: bool = False,
	pad_token_id: int = -1,
	):

Fix documentation for converting SFT/DPO weights back to HF Llama #1318

Fix documentation for converting SFT/DPO weights back to HF Llama #1318

Conversation

jacobthebanana commented Nov 3, 2024