Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Does the model support other languages? #149

Closed
jmmaa opened this issue Sep 1, 2022 · 5 comments
Closed

Does the model support other languages? #149

jmmaa opened this issue Sep 1, 2022 · 5 comments

Comments

@jmmaa
Copy link

jmmaa commented Sep 1, 2022

Just curious about this project, but I was wondering if it can also integrate to other languages like chinese, japanese, etc.

@jmmaa jmmaa changed the title Does the language support other languages? Does the model support other languages? Sep 1, 2022
@akshhack
Copy link

@jmmaa wanna collaborate on building that together? If so, write me at akshatp.cs@gmail.com -- I'm a CMU AI grad with tremendous interest in this space. Always looking for cool people to work with.

@neonbjb
Copy link
Owner

neonbjb commented Sep 14, 2022

It does not support other languages, but the techniques used to build it are general and could extend to other languages if someone had a dataset big enough and the compute/time to train it.

There are a couple of people interested in this @akshhack, try looking around in some of the other issues or discussions if @jmmaa doesn't get back to you.

In general, I would be happy to support anyone training a version of Tortoise for another language provided they will commit to open sourcing the inference models like I did with English.

@neonbjb neonbjb closed this as completed Sep 14, 2022
@akshhack
Copy link

@neonbjb I'd be happy to sign and open source my work. I really want to democratize access to this brilliant technology in my mother tongue Hindi. Of course, I'm clearly light years away from such sophisticated work, but I'd love to take your help in getting something started. Thank you.

What's the best place for me to reach out to you? jbetker@gmail.com?

@neonbjb
Copy link
Owner

neonbjb commented Sep 15, 2022

Hey there,
Yeah, sure - feel free to reach out. To be clear, I don't really have the time to do much hand-holding, but I can definitely provide high-level guidance and answer blocking questions as you work on this. The first step will be gathering a dataset for your target language and figuring out how to transcribe it. Looks like there is a wav2vec for Hindi so your first step might be integrating that with ocotillo and doing some experiments to see what kind of quality the transcriptions are.

@mathigatti
Copy link

mathigatti commented Oct 18, 2022

wouldn't it be possible to use a multi-language wav2vec and a phonetic representation of text (like IPA or whatever) to make it work on any language all at once?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants