Skip to content

Latest commit

 

History

History
73 lines (47 loc) · 4.85 KB

README.md

File metadata and controls

73 lines (47 loc) · 4.85 KB

ESPnet Notebooks

Demo

ASR (Speech recognition)

SE (Speech enhancement/separation)

SLU (Spoken language understanding)

TTS (Text-to-speech)

Other utilities

ESPnet-EZ

ASR (Speech recognition)

ST (Speech-to-text translation)

  • integrate_huggingface.ipynb: Integrating the weakly-supervised model (OWSM) and huggingface's pre-trained language model with ESPnet-EZ on MuST-C-v2.
  • ST_finetune_owsm.ipynb: Fine-tuning the weakly-supervised model (OWSM) with ESPnet-EZ on MuST-C-v2.

SLU (Spoken language understanding)

Course

CMU SpeechProcessing Spring2023

CMU SpeechRecognition Fall2022

CMU SpeechRecognition Fall2021

ESPnet1 (Legacy)