Skip to content

Latest commit

 

History

History
53 lines (31 loc) · 1.53 KB

README.md

File metadata and controls

53 lines (31 loc) · 1.53 KB

KLEA

An open-source Khmer Word to Speech Model. Just single word not sentence!

Open In Colab

1. Setup

pip install -r requirements.txt

2. Download Checkpoint

G_60000.pth

wget https://huggingface.co/spaces/seanghay/KLEA/resolve/main/G_60000.pth

Place the checkpoint in the current directory.

3. Inference

python infer.py "មនុស្សខ្មែរ"

This will output a file called audio.wav in the current directory. Output audio sample rate is 22.05 kHz.

Gradio

python app.py

Colab

image

Dataset

This model was trained on kheng.info dataset. You can find it on http://kheng.info or at https://hf.co/datasets/seanghay/khmer_kheng_info_speech

Reference