Skip to content

This codebase utilizes the Whisper ASR model to transcribe audio files. It segments longer audio into 20-minute chunks, and it can also convert MP4 videos to MP3 for transcription.

Notifications You must be signed in to change notification settings

ShobhitVishnoi30/whisper-transcript-translation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

whisper_transcription

This codebase utilizes the Whisper ASR model to transcribe audio files. It segments longer audio into 20-minute chunks, and it can also convert MP4 videos to MP3 for transcription.

Table of Contents

Installation

Clone the repository:

git clone https://github.com/svrapidinnovation/whisper_transcription
cd whisper_transcription

Create and activate a virtual environment

python3 -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install Dependecies

pip install -r requirements.txt

It also requires the command-line tool ffmpeg to be installed on your system, which is available from most package managers:

# on Ubuntu or Debian
sudo apt update && sudo apt install ffmpeg

# on Arch Linux
sudo pacman -S ffmpeg

# on MacOS using Homebrew (https://brew.sh/)
brew install ffmpeg

# on Windows using Chocolatey (https://chocolatey.org/)
choco install ffmpeg

# on Windows using Scoop (https://scoop.sh/)
scoop install ffmpeg

Run the project

python app.py

About

This codebase utilizes the Whisper ASR model to transcribe audio files. It segments longer audio into 20-minute chunks, and it can also convert MP4 videos to MP3 for transcription.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages