Steering Large Language Models for Machine Translation with Finetuning and In-Context Learning

This repository provides the code for the paper "Steering Large Language Models for Machine Translation with Finetuning and In-Context Learning". It was based on the lit-llama repository.

Quick Installation

To install run:

pip install -r requirements.txt
pip install -e .

Convert LLaMA checkpoint

Before fine-tuning the pre-trained model, you need to convert the original LLaMA checkpoint to be compatible with the tool. To do this, run:

python translation_llm/convert_checkpoint.py \
    --output_dir <directory to save converted checkpoint> \
    --checkpoint_dir <directory with original checkpoint> \
    --model_size <7B/13B>

Finetune

Look at the finetune_lora.sh and finetune_no_lora.sh scripts for examples of how to finetune the models.

Generate

Look at the generate_lora.sh and generate_no_lora.sh scripts for examples of how to generate from the models.

Evaluate

For the evaluation environment, you need to install the packages sacrebleu and unbabel-comet.

Look at the eval.sh script for an example on how to evaluate the models.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
data_processing		data_processing
results		results
translation_llm		translation_llm
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.example.sh		config.example.sh
eval.sh		eval.sh
finetune_lora.sh		finetune_lora.sh
finetune_no_lora.sh		finetune_no_lora.sh
generate_lora.sh		generate_lora.sh
generate_no_lora.sh		generate_no_lora.sh
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Steering Large Language Models for Machine Translation with Finetuning and In-Context Learning

Quick Installation

Convert LLaMA checkpoint

Finetune

Generate

Evaluate

About

Releases

Packages

Contributors 4

Languages

License

deep-spin/translation_llm

Folders and files

Latest commit

History

Repository files navigation

Steering Large Language Models for Machine Translation with Finetuning and In-Context Learning

Quick Installation

Convert LLaMA checkpoint

Finetune

Generate

Evaluate

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages