Skip to content

ASR, speech transcription system on Librispeech data

Notifications You must be signed in to change notification settings

lndip/speech_recognition

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

29 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Introduction

This project focuses on automatic speech recognition task, specifically speech transcription, using Deep Neural Network (DNN) architecture. The model was trained and test on 10% of train-clean-100 and test-clean from Librispeech. The implementation refered to AssemblyAI tutorial on E2E speech recognition system

Model architecture

This project employs CRNN structure with convolutional and GRU blocks to process the input spectrogram. The model output the prediction probabilities of the letters over the time steps. image

Installation

To run the code, you need python, pytorch, and numpy

How to run

asr_main.py incorperates the training loop and the testing stage of the speech transcription model

Authors

  • Diep Luong
  • Fareeda Mohammad

About

ASR, speech transcription system on Librispeech data

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published