Speech To Text

This simple python project lets you convert the audio of a file into searchable text by using cloud computing resources from Azure Cognitive Services.

Requirements

Python 3
Instance of Azure Speech Service
Recommended audio format:
- type: WAV (required)
- precision: 16-bit
- sample rate: 8kHz or 16kHz
- channel: mono

Getting started

Setup the Azure Speech service

Create free Azure Subscripition
Create free instance of Speech service (5 audio hours per month)

Prepare the audio

The default audio format for the recognition to work is WAV (16 kHz or 8 kHz, 16-bit, and mono PCM). You can convert your audio with this Online Audio Converter.

Setup the environment

Create virutal environment for installing the dependencies
```
python3 -m venv venv
```

Activate virtual environment

# Linux
source venv/bin/activate

# Windows
.\venv\Scripts\activate

Install dependencies
```
pip install -r requirements.txt
```

Provide your configuration

Get API key and region of your Speech service resource
Enter API key and location into env_sample.txt
Enter input path, output path and language of your audio file into env_sample.txt
Rename the file to .env

Run the transcription

python3 transcription.py

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.gitignore		.gitignore
README.md		README.md
env_sample.txt		env_sample.txt
requirements.txt		requirements.txt
transcription.py		transcription.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speech To Text

Requirements

Getting started

Setup the Azure Speech service

Prepare the audio

Setup the environment

Provide your configuration

Run the transcription

About

Languages

flumi3/speech-to-text

Folders and files

Latest commit

History

Repository files navigation

Speech To Text

Requirements

Getting started

Setup the Azure Speech service

Prepare the audio

Setup the environment

Provide your configuration

Run the transcription

About

Topics

Resources

Stars

Watchers

Forks

Languages