Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

automatic reference text transcription #28

Closed
cocktailpeanut opened this issue Nov 26, 2024 · 2 comments
Closed

automatic reference text transcription #28

cocktailpeanut opened this issue Nov 26, 2024 · 2 comments
Labels
enhancement New feature or request todo

Comments

@cocktailpeanut
Copy link

Currently the voice cloning feature doesn't seem to work unless you provide an accurate transcription of the reference audio, which is too tedious.

This can be fixed by incorporating whisper to automatically transcribe the reference audio instead of making the users manually enter the reference text. This would make a huge difference since most people are too lazy to transcribe audio clips

@edwko
Copy link
Owner

edwko commented Nov 26, 2024

Thanks for the suggestion, I’ll look into adding this.

@edwko edwko added enhancement New feature or request todo labels Nov 26, 2024
edwko added a commit that referenced this issue Nov 30, 2024
Added Whisper-based transcription for speaker creation when `transcript` is None (#28).
@edwko
Copy link
Owner

edwko commented Nov 30, 2024

Added in the 0.2.1 release :)

@edwko edwko closed this as completed Nov 30, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request todo
Projects
None yet
Development

No branches or pull requests

2 participants