Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Supports OpenAI's TTS and STT APIs #324

Open
zhanghx0905 opened this issue Oct 11, 2024 · 4 comments
Open

Supports OpenAI's TTS and STT APIs #324

zhanghx0905 opened this issue Oct 11, 2024 · 4 comments
Labels

Comments

@zhanghx0905
Copy link

I'm wondering if the project currently supports OpenAI's TTS and STT APIs, or if there are any plans to integrate them.

@plutoless
Copy link
Contributor

the one realtime api use? or a separate one.

@zhanghx0905
Copy link
Author

the one realtime api use? or a separate one.

Separate TTS and STT api,

@plutoless
Copy link
Contributor

@zhanghx0905 openai's STT/TTS is not stream based, they can only process files. so they are not that ideal in realtime cases.

@zhanghx0905
Copy link
Author

@zhanghx0905 openai's STT/TTS is not stream based, they can only process files. so they are not that ideal in realtime cases.

You may take a look at the livekit-agent GitHub repository. I tried their OpenAI plugin and adapt it to Chinese. I found it works just like a streaming service.

By the way, I have locally deployed TTS (Text-to-Speech) / STT (Speech-to-Text) services. In order to integrate them into applications compatible with the OpenAI API, I wrapped them in the OpenAI API format. Therefore, I hope you will also consider these APIs.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants