A small script that types what you say using whisper while holding a hotkey
I have only tested this on Linux, but it should work on Windows and Mac as well
Prerequisites for macOS users to use PyAudio
brew install portaudio
Install python requirements:
pip3 install -r requirements.txt
python3 main.py
python3 main.py --help
usage: main.py [-h] [--model_name {tiny.en,tiny,base.en,base,small.en,small,medium.en,medium,large-v1,large-v2,large}] [--mic_device MIC_DEVICE] [--inference_device INFERENCE_DEVICE] [--activation_key ACTIVATION_KEY] [--keep_leading_whitespace] [--zoomer_mode]
a small script that types what you say using whisper while holding a hotkey
options:
-h, --help show this help message and exit
--model_name {tiny.en,tiny,base.en,base,small.en,small,medium.en,medium,large-v1,large-v2,large-v3,large}
the name of the whisper model to use
--mic_device MIC_DEVICE
the name of the pyaudio input device to use
--inference_device INFERENCE_DEVICE
the device to run the inference on. can be cpu, cuda, or cuda:<device number> will automatically select the best device if not specified
--activation_key ACTIVATION_KEY
the key to use for push to talk. can be like <ctrl_r> or <alt_l> or <f1> or e or 1 or 2 or 3 etc
--keep_leading_whitespace
keep the leading space that whisper outputs
--zoomer_mode makes everything lowercase and removes all trailing periods
Defaults
model_name: "base.en"
mic_device: "default"
inference_device: None
activation_key: "<ctrl_r>"
keep_leading_whitespace: False
zoomer_mode: False