RuntimeError: Unsloth: llama.cpp GGUF seems to be too buggy to install. #479

Jiar · 2024-05-16T17:08:03Z

prerequisites

%%capture
# Installs Unsloth, Xformers (Flash Attention) and all other packages!
!pip install "unsloth[colab-new] @ git+https://github.com/unslothai/unsloth.git"
!pip install --no-deps "xformers<0.0.26" trl peft accelerate bitsandbytes

run code:

if True: model.push_to_hub_gguf(hf_model_name, tokenizer, quantization_method = "q4_k_m", token = token)

error:

/usr/bin/ld: unicode-data.cpp:(.text._ZNSt8multimapIjjSt4lessIjESaISt4pairIKjjEEEC2ESt16initializer_listIS4_ERKS1_RKS5_[_ZNSt8multimapIjjSt4lessIjESaISt4pairIKjjEEEC5ESt16initializer_listIS4_ERKS1_RKS5_]+0xbe): undefined reference to `std::_Rb_tree_insert_and_rebalance(bool, std::_Rb_tree_node_base*, std::_Rb_tree_node_base*, std::_Rb_tree_node_base&)'
collect2: error: ld returned 1 exit status
make: *** [Makefile:956: vdot] Error 1
make: Leaving directory '/content/llama.cpp'
---------------------------------------------------------------------------
RuntimeError                              Traceback (most recent call last)
<ipython-input-19-4b22232c01dc> in <cell line: 3>()
      1 # Save to q4_k_m GGUF
      2 #if True: model.save_pretrained_gguf(model_name, tokenizer, quantization_method = "q4_k_m")
----> 3 if True: model.push_to_hub_gguf(hf_model_name, tokenizer, quantization_method = "q4_k_m", token = token)
      4 
      5 # Save to 16bit GGUF

2 frames
/usr/local/lib/python3.10/dist-packages/unsloth/save.py in install_llama_cpp_old(version)
    783     # Check if successful
    784     if not os.path.exists("llama.cpp/quantize"):
--> 785         raise RuntimeError(
    786             "Unsloth: llama.cpp GGUF seems to be too buggy to install.\n"\
    787             "File a report to llama.cpp's main repo since this is not an Unsloth issue."

RuntimeError: Unsloth: llama.cpp GGUF seems to be too buggy to install.
File a report to llama.cpp's main repo since this is not an Unsloth issue.

screenshot

The text was updated successfully, but these errors were encountered:

danielhanchen · 2024-05-16T17:12:10Z

@Jiar Might be relevant: #476

Please try uninstalling peft==0.11.0 then install peft==0.10.0 as a temporary fix

Jiar · 2024-05-16T18:01:11Z

@Jiar Might be relevant: #476

Please try uninstalling peft==0.11.0 then install peft==0.10.0 as a temporary fix

Thanks!

joytsay · 2024-06-28T02:44:44Z

@danielhanchen @Jiar
for what is worth this RuntimeError: Unsloth: llama.cpp GGUF seems to be too buggy to install. happened again
and it was less a hassle to save to VLLM hf first

model.save_pretrained_merged("/your/awesome/model/", tokenizer, save_method = "merged_16bit",)

and then use llama.cpp to convert-hf-to-gguf

cd unsloth/llama.cpp

python3 convert-hf-to-gguf.py /your/awesome/model/

danielhanchen · 2024-07-01T00:20:39Z

@joytsay So sorry on the delay! Ye sadly sometimes directly using Unsloth for GGUF might be worse than simply using GGUF directly - if using it directly works, then that's good!

Jiar closed this as completed May 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RuntimeError: Unsloth: llama.cpp GGUF seems to be too buggy to install. #479

RuntimeError: Unsloth: llama.cpp GGUF seems to be too buggy to install. #479

Jiar commented May 16, 2024 •

edited

Loading

danielhanchen commented May 16, 2024

Jiar commented May 16, 2024

joytsay commented Jun 28, 2024

danielhanchen commented Jul 1, 2024

RuntimeError: Unsloth: llama.cpp GGUF seems to be too buggy to install. #479

RuntimeError: Unsloth: llama.cpp GGUF seems to be too buggy to install. #479

Comments

Jiar commented May 16, 2024 • edited Loading

danielhanchen commented May 16, 2024

Jiar commented May 16, 2024

joytsay commented Jun 28, 2024

danielhanchen commented Jul 1, 2024

Jiar commented May 16, 2024 •

edited

Loading