Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Voice-QuickFix] Start speaking filler immediately #496

Merged
merged 2 commits into from
Nov 15, 2023

Conversation

farzadab
Copy link
Contributor

I noticed that the filler sentences (or interjections as Peter calls them, which I'm not a total fan of), don't start until after a long pause and seemed to only start after the corpus response was ready, which defeats the purpose. The issue seemed to have been due to the filler not being recognized as a separate sentence which meant TTS would not be called.

This PR adds a newline between separate AssistantMessage responses from Fixie to solve this issue for Fixie calls.

Note that I did not investigate what happens in the other cases (LLM not from Fixie).

Logs before: notice no chunk is being requested from TTS for a long time.
Screenshot 2023-11-15 at 9 47 08 AM

Logs after: TTS being called before actual content is ready, though it's still not easy to understand what's happening underneath when simply listening to it (there's no gap between filler TTS end and start of actual content).
Screenshot 2023-11-15 at 9 43 31 AM

Copy link

vercel bot commented Nov 15, 2023

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name Status Preview Comments Updated (UTC)
ai-jsx-docs ✅ Ready (Inspect) Visit Preview 💬 Add feedback Nov 15, 2023 7:19pm
ai-jsx-nextjs-demo ✅ Ready (Inspect) Visit Preview 💬 Add feedback Nov 15, 2023 7:19pm
ai-jsx-tutorial-nextjs ✅ Ready (Inspect) Visit Preview 💬 Add feedback Nov 15, 2023 7:19pm
ai-jsx-voice ✅ Ready (Inspect) Visit Preview 💬 Add feedback Nov 15, 2023 7:19pm

Copy link
Contributor

@juberti juberti left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

New line seems kind of artificial here, perhaps this should just be a space?

@farzadab
Copy link
Contributor Author

farzadab commented Nov 15, 2023

Sure, either works. I kinda see different assistant messages as being more separate (i.e. totally separate paragraphs even, otherwise why separate them) but I don't have a strong opinion here. I'll change it to a space.

Copy link
Contributor

@petersalas petersalas left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

FWIW I don't really love "interjection" either as it's more of a description of the implementation (the code is interjecting w.r.t. the LLM) rather than the result. But that was the best I came up with 🤷‍♂️

@farzadab farzadab merged commit 03f2c72 into main Nov 15, 2023
7 checks passed
@farzadab farzadab deleted the farzad/filler-sentence-quickfix branch November 15, 2023 19:50
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants