Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Speech2text #39

Merged
merged 3 commits into from
Oct 20, 2023
Merged

Speech2text #39

merged 3 commits into from
Oct 20, 2023

Conversation

mgv99
Copy link
Collaborator

@mgv99 mgv99 commented Oct 19, 2023

add Speech2Text NLP component

solves #25

mgv99 added 3 commits October 19, 2023 14:25
add audio recorder component in streamlit_ui, it sends the recorded bytes to the websocket server as a string encoded in base64

add voice handling in TelegramPlatform and WebSocketPlatform

add WEBSOCKET_MAX_SIZE property to adjust the max size of messages (voice messages were too big before, so now we can ignore the size limit and send audios)

add new Speech2Text component in NLPEngine

add HFSpeech2Text implementation of Speech2Text, that loads a Hugging Face model (currently tested only with openai/whisper-* models)
@mgv99 mgv99 merged commit 84672b9 into dev Oct 20, 2023
@Aran30 Aran30 deleted the speech2text branch August 26, 2024 14:37
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants