This project allows for using GCP (Google Cloud Provider) to create text transcriptions of large audio files. This could be useful for things like podcasts, meeting minutes etc.
This project is code that can be deployed to Google Cloud as two Google Functions. It also relies on Google Storage and the Google Speech API.
Note that if you exceed your free introductory stipend by Google, you will have to pay for your usage. However, a decent amount can be transcribed within the introductory free money Google proides.
- Create a Google Cloud account and activate the
Cloud Functions API
, theCloud Pub/Sub API
, theCloud Logging API
, theCloud Speech-to-Text API
and Google Storage. You can activate the services by clicking 'Go To Console' then using the top search bar to find each service and click 'Enable'. - In Google Storage, create three buckets. Name them
transcriber-input
,transcriber-processed
andtranscriber-output
specifically. (You can technically name them different things, but you'd have to update parts of the code and deployment scripts). - Download Node
- Download the google cloud sdk and initialize it and log in
- Then run
yarn deploy
in thetranscribe-function
directory as well as theaudio-file-splitter-function
directory to deploy the two necessary Google Functions.
To transcribe an audio file, upload the file via the Google Storage UI to the transcriber-input
bucket.
This will automatically trigger the Google Function that converts it to the flac format and splits up large files. When the function is done, another function will be triggered to transcribe the audio files to text. When that second function completes, a full transcript will appear in the transcriber-output
folder. You can download it via the UI and use it however you want.