Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

v0.6.2 #1353

Merged
merged 40 commits into from
Apr 20, 2022
Merged

v0.6.2 #1353

merged 40 commits into from
Apr 20, 2022

Conversation

erogol
Copy link
Member

@erogol erogol commented Mar 8, 2022

No description provided.

Edresson and others added 19 commits March 9, 2022 16:18
* Fix recipes -> #1366

* Fix trainer docs
…ctrograms (#1348)

* Add support for the speaker encoder training using torch spectrograms

* Remove useless function in speaker encoder dataset class
* Add alphas to control language and speaker balancer

* Add docs for speaker and language samplers

* Change the Samplers weights to float for save memory

* Change the test_samplers to unittest format

* Add get_sampler method in BaseTTS

* Fix rebase issues

* Add language and speaker samplers support for DDP training

* Rename distributed sampler wrapper

* Remove the DistributedSamplerWrapper and use the one from Trainer

* Bugfix after rebase

* Move the samplers config to tts config
* Add support for voice conversion inference

* Cache d_vectors_by_speaker for fast inference using a bigger speakers.json

* Rebase bug fix

* Use the average d-vector for inference
* Add bug_report template

* Fix typos
* Add support for voice conversion inference

* Cache d_vectors_by_speaker for fast inference using a bigger speakers.json

* Rebase bug fix

* Use the average d-vector for inference

* Fix the bug in find unique chars script

* Add OpenBible formatter

Co-authored-by: Eren Gölge <erogol@hotmail.com>
… Emotion Encoder training support (#1349)

* Rename Speaker encoder module to encoder

* Add a generic emotion dataset formatter

* Transform the Speaker Encoder dataset to a generic dataset and create emotion encoder config

* Add class map in emotion config

* Add Base encoder config

* Add evaluation encoder script

* Fix the bug in plot_embeddings

* Enable Weight decay for encoder training

* Add argumnet to disable storage

* Add Perfect Sampler and remove storage

* Add evaluation during encoder training

* Fix lint checks

* Remove useless config parameter

* Active evaluation in speaker encoder test and use multispeaker dataset for this test

* Unit tests fixs

* Remove useless tests for speedup the aux_tests

* Use get_optimizer in Encoder

* Add BaseEncoder Class

* Fix the unitests

* Add Perfect Batch Sampler unit test

* Add compute encoder accuracy in a function
* Update model file ext to ```.pth```

* Update docs

* Rename more

* Find model files
@erogol
Copy link
Member Author

erogol commented Mar 22, 2022

@erogol : please add luxembourgish (lb) in the gruut language list. Thank you.

I don't need to add it manually. It is already available as Gruut supports it.

erogol and others added 8 commits March 23, 2022 12:57
* Fix manager

* Make style
* Add style checks to `make lint`

* Bump target-version in black config
Replace webrtcvad by silero-vad
* Fix the bug in freeze encoder

* Remove emb_l definition for non-multilingual training

* Fix unit tests
* Enforce phonemizer definition for synthesis

* Fix train_tts, tokenizer init can now edit config

* Add small change to trigger CI pipeline

* fix wrong output path for one tts_test

* Fix style

* Test config overides by args and tokenizer

* Fix style
* Fix enforce phonemizer

* Add new models

* Fix .model.json
@erogol erogol merged commit c410bc5 into main Apr 20, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

7 participants