Is there any way to save lora-converted model? #12

Adamska1008 · 2024-04-03T09:08:05Z

I tried to fine tune TinyLlama with this crate. I use candle-lora/candle-lora-transformers/examples/llama.rs to load model.safetensors, do stuff about training, eventually find that there's no way to save the model in safetensors format.

I tried to implement a save method myself wrapping candle_core::safetensors::save(), but how can I get the weight of lora part? All I can get is the raw model before it converted to lora model.

For example, if you run /candle-lora-macro/examples/linear.rs, by println!("{:?}", model.a); you will see it printed as Linear struct, not a LoraLinear struct, and you can't get ff_a、ff_b from model.a, despite that the model is converted to a lora model.

The text was updated successfully, but these errors were encountered:

EricLBuehler · 2024-04-03T11:13:16Z

This is implemented/fixed in #13 which has been merged. Please note that the weight naming is incompatible with peft at the moment. If this is a problem, please feel free to raise an issue and I will fix it

Adamska1008 · 2024-04-03T12:24:36Z

This is implemented/fixed in #13 which has been merged. Please note that the weight naming is incompatible with peft at the moment. If this is a problem, please feel free to raise an issue and I will fix it

Thank you very much! I tried this and get a 536KB safetensors file with header:

{"lora_llamaa0.weight":{"data_offsets":[0,512000],"dtype":"F16","shape":[8,32000]},"lora_llamab0.weight":{"data_offsets":[512000,544768],"dtype":"F16","shape":[2048,8]}}

Is it as expected? I also want to know how to apply the Lora tensors after loading a VarBuilder from original model.

EricLBuehler · 2024-04-03T12:33:38Z

No, the prefix was incorrect but it should be fixed now. To load the Lora tensors, pass get_lora_model the VarBuilder returned by from_mmaped_safetensors. Here is an example of loading the VarBuilder:

candle-lora/candle-lora-transformers/examples/llama.rs

Line 202 in a7ea48a

let vb = from_mmaped_safetensors(&filenames, dtype, &device, false)?;

That vb is then passed to get_lora_model:

candle-lora/candle-lora-transformers/src/llama.rs

Lines 573 to 591 in a7ea48a

if merge {

this.get_merged_lora_model(

lora_config,

&vb.pp("lora_llama"),

Some(linear_config),

None,

None,

Some(embed_config),

)

} else {

this.get_lora_model(

lora_config,

&vb.pp("lora_llama"),

Some(linear_config),

None,

None,

Some(embed_config),

)

}

Adamska1008 · 2024-04-03T13:13:53Z

No, the prefix was incorrect but it should be fixed now. To load the Lora tensors, pass get_lora_model the VarBuilder returned by from_mmaped_safetensors. Here is an example of loading the VarBuilder:

candle-lora/candle-lora-transformers/examples/llama.rs

Line 202 in a7ea48a

let vb = from_mmaped_safetensors(&filenames, dtype, &device, false)?;

That vb is then passed to get_lora_model:

candle-lora/candle-lora-transformers/src/llama.rs

Lines 573 to 591 in a7ea48a

if merge {

this.get_merged_lora_model(

lora_config,

&vb.pp("lora_llama"),

Some(linear_config),

None,

None,

Some(embed_config),

)

} else {

this.get_lora_model(

lora_config,

&vb.pp("lora_llama"),

Some(linear_config),

None,

None,

Some(embed_config),

)

}

Really helpful, thanks again!

EricLBuehler · 2024-04-03T13:37:10Z

Glad to help!

EricLBuehler added the candle-lora label Apr 3, 2024

EricLBuehler self-assigned this Apr 3, 2024

Adamska1008 closed this as completed Apr 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is there any way to save lora-converted model? #12

Is there any way to save lora-converted model? #12

Adamska1008 commented Apr 3, 2024

EricLBuehler commented Apr 3, 2024 •

edited

Loading

Adamska1008 commented Apr 3, 2024

EricLBuehler commented Apr 3, 2024

Adamska1008 commented Apr 3, 2024

EricLBuehler commented Apr 3, 2024

Is there any way to save lora-converted model? #12

Is there any way to save lora-converted model? #12

Comments

Adamska1008 commented Apr 3, 2024

EricLBuehler commented Apr 3, 2024 • edited Loading

Adamska1008 commented Apr 3, 2024

EricLBuehler commented Apr 3, 2024

Adamska1008 commented Apr 3, 2024

EricLBuehler commented Apr 3, 2024

EricLBuehler commented Apr 3, 2024 •

edited

Loading