This repository contains a deep learning-based model for solving various types of math problems.
The purpose of this project is to develop a robust model capable of solving a wide range of mathematical problems using deep learning techniques. We initially experimented with several open-source models to establish a baseline for performance. Subsequently, we fine-tuned the DeepSeek-Math model with Qlora to enhance its capabilities and accuracy.
- Solves diverse types of math problems.
- Utilizes state-of-the-art deep learning techniques.
- Easy integration into existing systems or applications.
To use the fine-tuned DeepSeek-Math model with Qlora, download the Lora adapters from Kaggle and follow the instructions provided in the model card for integration.
- Open-Source Models: We experimented with various open-source models available in the literature.
- Fine-Tuned Model: The DeepSeek-Math model was fine-tuned with Qlora to improve its accuracy and performance. For a detailed fine-tuning report, refer to W&B report.
Data used for training and fine-tuning the models are not included in this repository due to licensing restrictions. Instructions for obtaining the data can be found here.
- We thank the developers of the open-source models used in this project for their contributions.
- Special thanks to the team behind Qlora for providing a powerful platform for fine-tuning deep learning models.
This project is licensed under the MIT License.
Contributions are welcome! Please feel free to open issues or pull requests.