Fix nit in LoRA doc #1054

awni · 2024-10-16T19:40:27Z

Very tiny fix, closes #1053

madroidmaq · 2024-10-17T08:53:42Z

hi @awni , According to OpenAI's official documentation, this may not be a bug, as arguments is not an object but a "string" type "object" that needs to be deserialized into a real object.

For details, see: https://platform.openai.com/docs/guides/fine-tuning/fine-tuning-examples

hansvdam · 2024-10-17T14:34:36Z

Yes it is true that it should be like that according to the openai docs, but MLX does not fine-tune well if you adhere to that...
I guess there is something wrong under the hood then with the interpretation of the training data in that format...
I used it for fine-tuning meta-llama/Llama-3.1-8B-Instruct

madroidmaq · 2024-10-17T17:23:50Z

If you check the chat_temlpate configuration in the meta-llama/Llama-3.1-8B-Instruct repository, you may find that the reason could be the requirement for the function return format, which is inconsistent with the OpenAI format. Llama-3.1-8B-Instruct requires a dictionary to be returned.

You have access to the following functions. To call a function, please respond with JSON for a function call.
'Respond in the format {"name": function name, "parameters": dictionary of argument name and its value}.'

So if you return according to the document's format, some issues may arise (the format in the base model is inconsistent with the fine-tuning data format). When I reviewed the HuggingFace chat_template document again, I found it also highlighted this part:

If you’re familiar with the OpenAI API, you should pay attention to an important difference here - the tool_call is a dict, but in the OpenAI API it’s a JSON string. Passing a string may cause errors or strange model behaviour!

When I checked the mistral-finetune project again, I found that its data usage is consistent with the OpenAI format.

So I think there might not be a strictly correct format here. The key point is to ensure that the format of your fine-tuning dataset needs to be consistent with the base model's format, otherwise problems will arise. I think this part can be explained in the documentation.

awni · 2024-10-22T03:31:59Z

I think the right call for this is to leave a pointer in the doc that the data format that works well for tool calling can be different for different models and linking to the page about tool use in HG chat templates that you found @madroidmaq. Thanks for digging into that. I will update the doc and leave the current example as is.

madroidmaq · 2024-10-22T17:32:03Z

@awni I submitted a PR #1063, could you help check if any adjustments are needed?

fix nit in docs

dfac3e6

awni requested review from angeloskath and barronalex October 16, 2024 19:40

angeloskath approved these changes Oct 18, 2024

View reviewed changes

madroidmaq mentioned this pull request Oct 22, 2024

LoRA: update tools datasets docs #1063

Merged

awni closed this Oct 22, 2024

awni deleted the docs_nit branch October 23, 2024 03:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix nit in LoRA doc #1054

Fix nit in LoRA doc #1054

awni commented Oct 16, 2024

madroidmaq commented Oct 17, 2024 •

edited

Loading

hansvdam commented Oct 17, 2024 •

edited

Loading

madroidmaq commented Oct 17, 2024

awni commented Oct 22, 2024

madroidmaq commented Oct 22, 2024

Fix nit in LoRA doc #1054

Fix nit in LoRA doc #1054

Conversation

awni commented Oct 16, 2024

madroidmaq commented Oct 17, 2024 • edited Loading

hansvdam commented Oct 17, 2024 • edited Loading

madroidmaq commented Oct 17, 2024

awni commented Oct 22, 2024

madroidmaq commented Oct 22, 2024

madroidmaq commented Oct 17, 2024 •

edited

Loading

hansvdam commented Oct 17, 2024 •

edited

Loading