GitHub - xz259/aimo: This project utilizes large language models to solve challenging math problems.

This repository contains the source code and resources for an LLM-based mathematical problem solver. It combines the generative capabilities of large language models with the discriminative power of classical machine learning techniques.

Project Structure

inference.py: Contains the main inference pipeline for solving mathematical problems using the two-stage approach.
qlora_training.py: Script for fine-tuning the base language model using QLoRA (Quantized Low-Rank Adaptation).
statistical_features_classifier.py: Trains the second-stage logistic regression classifier using aggregated statistics from the QLoRA model's outputs.
model/: Directory containing the base language model and fine-tuned QLoRA adapters.
data/: Contains training, validation, and test datasets.

Setup

Clone the Repository:

git clone https://github.com/yourusername/AIMO.git
cd AIMO

Install Dependencies:
```
pip install -r requirements.txt
```

Download the Base Model:

Download the base language model (e.g., DeepSeek-Math-7B-RL) and place it in the model/ directory.

Running the Project

Fine-tune the QLoRA Model:
```
python qlora_training.py
```

Train the statistical features classifier:

python aggregated_statistics_classifier.py

Run inference:
```
python inference.py
```

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
data		data
model_checkpoints		model_checkpoints
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
inference.py		inference.py
qlora fine-tuning.py		qlora fine-tuning.py
requirements.txt		requirements.txt
statistical_features_classifier.py		statistical_features_classifier.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project Structure

Setup

Running the Project

About

Releases

Packages

Languages

License

xz259/aimo

Folders and files

Latest commit

History

Repository files navigation

Project Structure

Setup

Running the Project

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages