Monte Carlo Tree Search with Self-Refine

This repository contains an implementation of Monte Carlo Tree Search (MCTS) with self-refinement, using a locally hosted LLaMA instance for generating answers. This implementation specifically focuses on handling gsm8k and MATH datasets using the Hugging Face datasets library.

❗Note: This repository is very much based on Trotsky1997's implementation, which is (presumably) much better tested than mine. If you're looking for working code I recommend using theirs, but if you're looking to learn more by getting your hands dirty, I would love to collaborate!

Installation

Clone the repository:

git clone https://github.com/naivoder/MCTSr.git
cd MCTSr/

Set up a virtual environment and activate it:

conda create -n mctsr python=3.11     # Use conda like a sane person
python -m venv env                    # Otherwise use venv
source env/bin/activate               # On Windows use `env\Scripts\activate`

Please use conda. Why does anyone use venv anymore??

Install the required dependencies:
```
pip install -r requirements.txt
```
Download and set up the LLaMA model:

Follow the instructions provided by Hugging Face to download and set up the LLaMA model.

Usage

Modify the MODEL_NAME and DATA_NAME variables in main.py as needed:

MODEL_NAME = "meta-llama/Meta-Llama-3-8B"
DATA_NAME = "gsm8k-rs-mistral7B"

Run the main script:
```
python main.py
```

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
README.md		README.md
extractor.py		extractor.py
generate.py		generate.py
main.py		main.py
mctsr.py		mctsr.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Monte Carlo Tree Search with Self-Refine

Installation

Usage

About

Releases

Packages

Languages

naivoder/MCTSr

Folders and files

Latest commit

History

Repository files navigation

Monte Carlo Tree Search with Self-Refine

Installation

Usage

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages