nvidia uses the LLaMAForCausalLM string in their config.json, example… #9485

csabakecskemeti · 2024-09-14T19:04:30Z

Nvidia uses the LLaMAForCausalLM string in their config.json so though there's support for LlamaForCausalLM
@Model.register("LlamaForCausalLM", "MistralForCausalLM", "MixtralForCausalLM")
the conversion failes on the case.

I've added the LLaMAForCausalLM

Example models with this arch string:

nvidia/Llama3-ChatQA-2-8B
nvidia/Llama3-ChatQA-2-70B
I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

… nvidia/Llama3-ChatQA-2-8B

Co-authored-by: Csaba Kecskemeti <csabakecskemeti@Csabas-Mac-Pro.local>

nvidia uses the LLaMAForCausalLM string in their config.json, example…

aaf7f53

… nvidia/Llama3-ChatQA-2-8B

github-actions bot added the python python script changes label Sep 14, 2024

ggerganov approved these changes Sep 15, 2024

View reviewed changes

ggerganov merged commit 3c7989f into ggerganov:master Sep 15, 2024
9 checks passed

dsx1986 pushed a commit to dsx1986/llama.cpp that referenced this pull request Oct 29, 2024

py : add "LLaMAForCausalLM" conversion support (ggerganov#9485)

575edd4

Co-authored-by: Csaba Kecskemeti <csabakecskemeti@Csabas-Mac-Pro.local>

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Nov 15, 2024

py : add "LLaMAForCausalLM" conversion support (ggerganov#9485)

217ff68

Co-authored-by: Csaba Kecskemeti <csabakecskemeti@Csabas-Mac-Pro.local>

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Nov 18, 2024

py : add "LLaMAForCausalLM" conversion support (ggerganov#9485)

ae4b72a

Co-authored-by: Csaba Kecskemeti <csabakecskemeti@Csabas-Mac-Pro.local>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nvidia uses the LLaMAForCausalLM string in their config.json, example… #9485

nvidia uses the LLaMAForCausalLM string in their config.json, example… #9485

csabakecskemeti commented Sep 14, 2024

nvidia uses the LLaMAForCausalLM string in their config.json, example… #9485

nvidia uses the LLaMAForCausalLM string in their config.json, example… #9485

Conversation

csabakecskemeti commented Sep 14, 2024