twb_nllb_finetuning

Datasets:

Languages:

en
rw

library_name:

transformers

Model Details

Model Description

This is a Machine Translation model, finetuned from NLLB-200's distilled 1.3B model, it is meant to be used in machine translation for education-related data.

Finetuning code repository: the code used to finetune this model can be found here

How to Get Started with the Model

Use the code below to get started with the model.

Training Procedure

The model was finetuned on three datasets; a general purpose dataset, a tourism, and an education dataset.

The model was finetuned in two phases.

Phase one:

General purpose dataset
Education dataset
Tourism dataset

Phase two:

Education dataset or Tourism dataset (Depending on the downstream task)

Other than the dataset changes between phase one, and phase two finetuning; no other hyperparameters were modified. In both cases, the model was trained on an A100 40GB GPU for two epochs.

Evaluation

Metrics

Model performance was measured using BLEU, spBLEU, TER, and chrF++ metrics.

Results

Tourism metrics:

Lang. Direction	BLEU	spBLEU	chrf++	TER
Eng -> Kin	28.37	40.62	56.48	59.71
Kin -> Eng	42.54	44.84	61.54	43.87

Education metrics:

Lang. Direction	BLEU	spBLEU	chrf++	TER
Eng -> Kin	45.96	59.20	68.79	41.61
Kin -> Eng	43.98	44.94	63.05	41.41

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
scrapers		scrapers
LICENSE		LICENSE
README.md		README.md
test.py		test.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

twb_nllb_finetuning

Datasets:

Languages:

library_name:

Model Details

Model Description

How to Get Started with the Model

Training Procedure

Phase one:

Phase two:

Evaluation

Metrics

Results

Tourism metrics:

Education metrics:

About

Releases

Packages

Languages

License

Digital-Umuganda/twb_nllb_project_tourism_education

Folders and files

Latest commit

History

Repository files navigation

twb_nllb_finetuning

Datasets:

Languages:

library_name:

Model Details

Model Description

How to Get Started with the Model

Training Procedure

Phase one:

Phase two:

Evaluation

Metrics

Results

Tourism metrics:

Education metrics:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages