Creating a LoRa of Your Own Data #469

olihough86 · 2023-03-21T10:08:06Z

olihough86
Mar 21, 2023

I'm knocked back at the speed things have developed int he past couple of weeks.

Now with the ability to create Alpaca as a LoRA which is great, how much of a step is this from being able to create LoRAs with a bunch of your own data? for example feeding in a whole bunch of papers you wrote or niche subjects that you know the base models will be lacking on, or even trying to push the model to lean towards a certain writing style or subject? I'm seeing a lot of guides on instruct fine tuning but nothing on just cramming in more text.

So I guess my question is, can someone put together a short guide for those of us that don't have any experience training models? (I'm very tech savvy, but many AI subjects are witchcraft to me)

olihough86 · 2023-03-21T11:12:12Z

olihough86
Mar 21, 2023
Author

Looks like I'm not the only one :D seems progress was made here

tloen/alpaca-lora#45

0 replies

oobabooga · 2023-03-22T02:11:14Z

oobabooga
Mar 22, 2023
Maintainer

See here, it's super easy to create a LoRA: https://github.com/oobabooga/text-generation-webui/wiki/Using-LoRAs#training-a-lora

Formatting and cleaning the dataset is the hardest part.

0 replies

olihough86 · 2023-03-22T08:58:35Z

olihough86
Mar 22, 2023
Author

Thanks, I might have got the wrong end of the stick, this assumes you have your data as questions and answers, I guess I could generate a bunch of question answer pairs based off my data set, do you think this would be the best method of getting new knowledge in?

0 replies

bbecausereasonss · 2023-03-22T13:05:58Z

bbecausereasonss
Mar 22, 2023

Is question and answer data-pairs the only way to fine-tune? Is there anyway of having the LLM parse and make sense of data?

0 replies

oobabooga · 2023-03-22T13:23:50Z

oobabooga
Mar 22, 2023
Maintainer

The line that generates the training data is this one: https://github.com/tloen/alpaca-lora/blob/main/finetune.py#L164

You could try using a single row with a very long string (like a book) to see if it works. I am not experienced with training neural networks, hopefully someone can tell us the right way to do this.

0 replies

St33lMouse · 2023-03-26T06:32:37Z

St33lMouse
Mar 26, 2023

It would be really cool to train a lora based on a novel. For example, you train on Lord of the Rings, then load attach the lora to your favorite adventure model and boom!

You've got a ready made adventure with strong knowledge of Lord of the Rings. Could this work? Is it working already, or has anyone tried it?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Creating a LoRa of Your Own Data #469

{{title}}

Replies: 6 comments

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Creating a LoRa of Your Own Data #469

olihough86 Mar 21, 2023

Replies: 6 comments

olihough86 Mar 21, 2023 Author

oobabooga Mar 22, 2023 Maintainer

olihough86 Mar 22, 2023 Author

bbecausereasonss Mar 22, 2023

oobabooga Mar 22, 2023 Maintainer

St33lMouse Mar 26, 2023

olihough86
Mar 21, 2023

olihough86
Mar 21, 2023
Author

oobabooga
Mar 22, 2023
Maintainer

olihough86
Mar 22, 2023
Author

bbecausereasonss
Mar 22, 2023

oobabooga
Mar 22, 2023
Maintainer

St33lMouse
Mar 26, 2023