A .NET Core console application to transcribe audio using the Google Speech-to-Text API.
Features
- Supports long-running transcription
- Temporarily uploads audio for analysis to Google Cloud Storage
- Supports WAV/PCM/FLAC audio formats
- Outputs transcribed text in .json and .text formats
Supported Platforms
- Windows
- Linux
- Unix, macOS
Usage
Flag | Description | Required |
---|---|---|
-c "credentials.json" |
The path to the "credentials.json" file | True |
-a "audio.flac" |
The path to the audio file to transcribe | True |
-b "bucket" |
The bucket to store the uploaded audio, defaults to "gcs-tool" | False |
-t "en-US" |
The language code of the supplied audio, defaults to "en-US" | False |
Requirements
Enable the Google Speech-to-Text API as described here.
- Go to the Google API Console.
- Select a project.
- In the sidebar on the left, expand APIs & auth and select APIs.
- In the displayed list of available APIs, click the Speech-to-Text link and click Enable API.
Disclaimer
The material embodied in this software is provided to you "as-is" and without warranty of any kind, express, implied or otherwise, including without limitation, any warranty of fitness for a particular purpose. In no event shall the author be liable to you or anyone else for any direct, special, incidental, indirect or consequential damages of any kind, or any damages whatsoever, including without limitation, loss of profit, loss of use, savings or revenue, or the claims of third parties, whether or not the author has been advised of the possibility of such loss, however caused and on any theory of liability, arising out of or in connection with the possession, use or performance of this software.