Currently, IBM Watson API only allows you to work with audio files on your computer to implement Speech To Text and Text to Speech.
WARVIS works directly with your microphone and provides the functionality of IBM Watson TTS without having to store audio files of your voice recordings.
- You are able to customise the loudness threshold of your voice, so WARVIS knows when to start 'listening'
- You can also change the number of seconds to wait after you finish speaking, so you are not interrupted when making pauses between sentences
- Just type in the words that you would like your computer to 'say' and it will do it in a matter of seconds, without having to save an audio file of the generated voice.
- Go to IBM Cloud website, create credentials for TTS and STT (it's free!)
- Insert them into main.py
The program might take some time for the initial launch, please be patient!