Speech2text #39

mgv99 · 2023-10-19T14:27:16Z

add Speech2Text NLP component

solves #25

add audio recorder component in streamlit_ui, it sends the recorded bytes to the websocket server as a string encoded in base64 add voice handling in TelegramPlatform and WebSocketPlatform add WEBSOCKET_MAX_SIZE property to adjust the max size of messages (voice messages were too big before, so now we can ignore the size limit and send audios) add new Speech2Text component in NLPEngine add HFSpeech2Text implementation of Speech2Text, that loads a Hugging Face model (currently tested only with openai/whisper-* models)

mgv99 added 3 commits October 19, 2023 14:25

hide tensorflow and huggingface warnings

279d19d

add Speech2Text docs

6c041e6

Aran30 approved these changes Oct 20, 2023

View reviewed changes

Aran30 mentioned this pull request Oct 20, 2023

Specify different speech2text models based on language. #41

Open

mgv99 merged commit 84672b9 into dev Oct 20, 2023

Aran30 deleted the speech2text branch August 26, 2024 14:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speech2text #39

Speech2text #39

mgv99 commented Oct 19, 2023

Speech2text #39

Speech2text #39

Conversation

mgv99 commented Oct 19, 2023