Open source speech to text models for Indic Languages
-
Updated
Sep 16, 2022
Open source speech to text models for Indic Languages
Repository containing experimentation platform on how to train, infer on wav2vec2 models.
Code related to the Dutch instance and user groups of the KALDI speech recognition toolkit
Implementation of Hybrid CTC/Attention Architecture for End-to-End Speech Recognition in pure python and PyTorch
It is a system through which various audio speech files are classified into different emotions such as happy, sad, anger and neutral by computer. SER can be used in areas such as the medical field or customer call centers.
Forced alignment of Nda‘ Nda’ a Cameroonian language
Offline free automatic speech recognition
Fine-tuning Multilingual Large Speech Recognition Models: Wav2vec and Whisper
Chinese Mandarin Synthesis Corpus-Customer Sevice
Proyecto de reconocimiento de voz utilizando el modelo VOSK, desarrollado por GEO.VOICE-TECH, enfocado en aplicaciones de campo donde la transcripción automática es esencial.
Sara :- The Personal Voice Assistant
Add a description, image, and links to the speech-recognition-model topic page so that developers can more easily learn about it.
To associate your repository with the speech-recognition-model topic, visit your repo's landing page and select "manage topics."