Transcript an entire playlist from YouTube, based on OpenAI Whisper.
This repo's scripts have since been modified to use faster-whisper. Note: pytube seems to be a bit borked,
Example for subtitles (in Hebrew) obtained from the regular whisper model is Tutorial 10 - Cross Validation
- You need a GPU that uses CUDA, so that means a modern Nvidia GPU with a decent amount of VRAM (check whisper's and faster-whisper's requirements).
- You need to make sure you have met the requirements of Whisper. That means having an updated version of CUDA Toolkit, adding its bin folder to your system path (CRUCIAL!), etc.
git clone
cd YouTubeTranscripter
pip install -r requirements.txt
- Run
python download.py
.
download-faster.ipynb is meant for Google Colab.
There are several .py files:
- TranscribeAudioFile.py is meant to transcribe a single audio file. Simply run the script with an argument that contains the path of the audio file you wish to transcribe (Useage example: python TranscribeAudioFile.py "SomeAudioFile.wav").
- TranscribeYouTubePlaylist.py is the one that can transcribe an entire YouTube playlist. It has no arguments, just follow the instructions.
- download.py is an older, deprecated version of the playlist script that uses the original whisper model.