[LoRA] Falcon merges still don't work. Ideas as to why? #3713

Ph0rk0z · 2023-10-21T13:25:41Z

I've been merging lora into quantized models for a while now with export_lora and have had good results. The models definitely merge and performance appears to improve. Converting the lora to GGUF and then applying it to models results in a working model.

The same can't be said for falcon. All falcon tunes are released as PEFT and the model is simply too large to d/l as FP16. It's several hundred GB unless quantized.

I applied the PR #3333 and am able to successfully convert lora to GGUF. I can then use export_lora to merge. However the models come out repeating gibberish and having sentence piece errors when used with HF sampling.

Looking over the code, there is nothing llama specific that I can find in it. Has anyone been able to load a lora to any falcon models, either live or as merges? Anyone have ideas of what's wrong?

github-actions · 2024-04-04T01:07:36Z

This issue was closed because it has been inactive for 14 days since being marked as stale.

github-actions bot added the stale label Mar 19, 2024

github-actions bot closed this as completed Apr 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LoRA] Falcon merges still don't work. Ideas as to why? #3713

[LoRA] Falcon merges still don't work. Ideas as to why? #3713

Ph0rk0z commented Oct 21, 2023

github-actions bot commented Apr 4, 2024

[LoRA] Falcon merges still don't work. Ideas as to why? #3713

[LoRA] Falcon merges still don't work. Ideas as to why? #3713

Comments

Ph0rk0z commented Oct 21, 2023

github-actions bot commented Apr 4, 2024