Fine-Tuning Chat Model with Domain-Specific Data for custom dataset #1877

anantgupta129 · 2024-12-18T08:55:45Z

I have domain-specific data that I've preprocessed from PDFs into a c4_demo.json format, where each entry looks like [{"text": "<data from file1.pdf"}]. Now, I want to fine-tune my model for chat-based interactions to generate structured chat outputs. My dataset is currently in the following format: [{"role": "system", "content": ""}, {"role": "user", "content": ""}, {"role": "assistant", "content": ""}, ...]. Which fine-tuning method should I use for this?

The text was updated successfully, but these errors were encountered:

anantgupta129 added the question Further information is requested label Dec 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fine-Tuning Chat Model with Domain-Specific Data for custom dataset #1877

Fine-Tuning Chat Model with Domain-Specific Data for custom dataset #1877

anantgupta129 commented Dec 18, 2024

Fine-Tuning Chat Model with Domain-Specific Data for custom dataset #1877

Fine-Tuning Chat Model with Domain-Specific Data for custom dataset #1877

Comments

anantgupta129 commented Dec 18, 2024