Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How you have trained the LID model #1

Closed
Durgesh92 opened this issue May 15, 2021 · 5 comments
Closed

How you have trained the LID model #1

Durgesh92 opened this issue May 15, 2021 · 5 comments

Comments

@Durgesh92
Copy link

Can you share your LID training recipe and data preparation guide?

@igorsitdikov
Copy link
Owner

Yes, sure. It's not a secret. I used https://github.com/kaldi-asr/kaldi/blob/master/egs/sre16/v2/run.sh. In utt2spk file I used utt_id lang instead of utt_id speaker

@Durgesh92
Copy link
Author

Durgesh92 commented May 15, 2021

Thanks, and to use the trained model with your vosk modified src what's the model structure? Can you please share your trained model to test?

@igorsitdikov
Copy link
Owner

@Durgesh92
Copy link
Author

Thanks for the quick response. Also, I have a question about training. How much data do you recommend for each language? also is it necessary to have an even distribution of data volume for each language?

@igorsitdikov
Copy link
Owner

https://arxiv.org/abs/2011.12998

This issue was closed.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants