Made convert.py work with LLaMA 3 files distributed by meta #7568

0xMana-git · 2024-05-27T14:14:08Z

The fix is overall pretty hacky and not very clean, but it does work and doesn't have any pre vocab issues afaik
Improvements such as not using another script can be made, and detection for llama 3 is basically nonexistent, but I guess if someone just wants to get stuff working then this could be applicable

Simple math question test can be seen here(cpu inference, i messed up the cuda build):

teleprint-me · 2024-05-27T16:11:03Z

It should be noted that the convert.py script is in the process of being deprecated and will be moved to examples.

0xMana-git · 2024-05-28T01:00:55Z

It should be noted that the convert.py script is in the process of being deprecated and will be moved to examples.

😭

mofosyne · 2024-05-28T02:08:01Z

Hmmm... as long as it doesn't break the legacy convert script let's get this merged in quickly before we do the refactor

compilade · 2024-05-28T02:19:00Z

Hmmm... as long as it doesn't break the legacy convert script let's get this merged in quickly before we do the refactor

That's not a reason to avoid doing a proper review. This looks useful, but there are some parts of it I'd like to review in more detail.

Regarding the order of the merges, I think #7430 should be merged first, because it will be much easier to test this with the new file names and locations. (because things will break, especially since convert.py here assumes convert_llama_weights_to_hf.py is in the same directory)

teleprint-me · 2024-05-28T02:43:02Z

@Manaball123 Do not lose hope. Programming isn't easy, but that's what makes it fun 😉.

@compilade If this is merged, get this merged this first. This should be merged before PR #7430. I'm going to be awhile. I did crack the vocab though. 🥲

My only issue with this PR is that the vocab wasn't properly handled. Other than that, I think it's a good start. It just needs some guidance.

0xMana-git · 2024-05-28T04:48:47Z

You could probably merge convert_llama_weights_to_hf.py into the same file as-is, I didn't do it because it would make convert.py more cluttered than it already is lol
Regarding the vocab issue, can you elaborate on what it is? I could probably write a fix if it's nothing major.

mofosyne · 2024-05-30T11:40:53Z

Heads up that #7430 has merged, so will need to refactor this pr

0xMana-git · 2024-05-30T17:49:48Z

uh oh i think i messed something up :(

compilade · 2024-05-30T17:52:58Z

uh oh i think i messed something up :(

To fix this, check your git reflog and git reset to a previous commit (but read about --soft, --mixed, --hard, and --keep first (git reset --help)), then git push --force-with-lease this branch back to that commit.

0xMana-git · 2024-05-30T18:07:55Z

Sorry for the inconvenience, made new pr here
😭

github-actions bot added the python python script changes label May 27, 2024

mofosyne added the Review Complexity : Low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix label May 28, 2024

0xMana-git closed this May 30, 2024

0xMana-git force-pushed the llama-meta-convert-fix branch from 4f64f7e to 5921b8f Compare May 30, 2024 17:47

0xMana-git mentioned this pull request May 30, 2024

Merging #7568 with #7430(Implementing LLaMA 3 torch to gguf conversion) #7651

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Made convert.py work with LLaMA 3 files distributed by meta #7568

Made convert.py work with LLaMA 3 files distributed by meta #7568

0xMana-git commented May 27, 2024 •

edited

Loading

teleprint-me commented May 27, 2024 •

edited

Loading

0xMana-git commented May 28, 2024

mofosyne commented May 28, 2024

compilade commented May 28, 2024 •

edited

Loading

teleprint-me commented May 28, 2024 •

edited

Loading

0xMana-git commented May 28, 2024

mofosyne commented May 30, 2024 •

edited

Loading

0xMana-git commented May 30, 2024

compilade commented May 30, 2024

0xMana-git commented May 30, 2024

Made convert.py work with LLaMA 3 files distributed by meta #7568

Made convert.py work with LLaMA 3 files distributed by meta #7568

Conversation

0xMana-git commented May 27, 2024 • edited Loading

teleprint-me commented May 27, 2024 • edited Loading

0xMana-git commented May 28, 2024

mofosyne commented May 28, 2024

compilade commented May 28, 2024 • edited Loading

teleprint-me commented May 28, 2024 • edited Loading

0xMana-git commented May 28, 2024

mofosyne commented May 30, 2024 • edited Loading

0xMana-git commented May 30, 2024

compilade commented May 30, 2024

0xMana-git commented May 30, 2024

0xMana-git commented May 27, 2024 •

edited

Loading

teleprint-me commented May 27, 2024 •

edited

Loading

compilade commented May 28, 2024 •

edited

Loading

teleprint-me commented May 28, 2024 •

edited

Loading

mofosyne commented May 30, 2024 •

edited

Loading