Streaming SenseVoice processes inference in chunks of SenseVoice.
- transcribe wav file
$ python main.py
- transcribe from microphone
$ python realtime.py
- transcribe from websocket
A basic WebSocket service built with Recorder
and FastAPI
; the frontend uses MP3
format to transmit audio information to reduce latency and increase stability.
pip install -r requirements-ws-demo.txt
python realtime_ws_server_demo.py
# check cli options
python realtime_ws_server_demo.py --help