Update #1

freds0 · 2023-11-29T20:23:08Z

No description provided.

commit dd612fd Author: JiangCheng <jiangcheng@kezaihui.com> Date: Mon Jun 5 16:04:54 2023 +0800 Failed to download the file and need to delete the created file path

Resolve conflicts

* Don't install MeCab by default * Add optional [ja] deps, like [dev] etc * Add JA requirements file * Add JA requirements to requirements_all This should help the tests run.

* Separate API tests and only run when uplifted * Make style

* add configs * Update config file * Add model configs * Add model layers * Add layer files * Add layer modules * change config names * Add emotion manager * fIX missing ap bug * Fix missing ap bug * Add base TTS e2e class * Fix wrong variable name in load_tts_samples * Add training script * Remove range predictor and gaussian upsampling * Add helper function * Add vctk recipe * Add conformer docs * Fix linting in conformer.py * Add Docs * remove duplicate import * refactor args * Fix bugs * Removew emotion embedding * remove unused arg * Remove emotion embedding arg * Remove emotion embedding arg * fix style issues * Fix bugs * Fix bugs * Add unittests * make style * fix formatter bug * fix test * Add pyworld compute pitch func * Update requirments.txt * Fix dataset Bug * Chnge layer norm to instance norm * Add missing import * Remove emotions.py * remove ssim loss * Add init layers func to aligner * refactor model layers * remove audio_config arg * Rename loss func * Rename to delightful-tts * Rename loss func * Remove unused modules * refactor imports * replace audio config with audio processor * Add change sample rate option * remove broken resample func * update recipe * fix style, add config docs * fix tests and multispeaker embd dim * remove pyworld * Make style and fix inference * Split tts tests * Fixup * Fixup * Fixup * Add argument names * Set "random" speaker in the model Tortoise/Bark * Use a diff f0_cache path for delightfull tts * Fix delightful speaker handling * Fix lint * Make style --------- Co-authored-by: loganhart420 <loganartpersonal@gmail.com> Co-authored-by: Eren Gölge <erogol@hotmail.com>

* Remove key prunning in tortoise * Make lint

… when speaker_id is None or not passed, fixes onnx exporting for models with init_discriminator=false (#2816)

* Changes from jhlfrfufyfn <jhlfrfufyfn@gmail.com> * Recipe for Belarusian TTS --------- Co-authored-by: jhlfrfufyfn <jhlfrfufyfn@gmail.com>

* fix: wrong import class * fix: formatter name missing * feat: get rid of clearml

* Fix tests * Make style

…rors (#2831)

* Handle missing JA phonemizer * Make style

Update versions

…ing (#3241) * Ensures that only GPT model is in training mode during training * Fix parallel wavegan unit test

Run CI for v0.20.6

Remove duplicate/unused code

…tic class variable (#3297)

…ment (#3294) In multilingual models, the target language is specified via the `--language_idx` argument. However, the `tts` CLI also accepts a `--language` argument for use with Coqui Studio, so it is easy to choose the wrong one, resulting in the following confusing error at synthesis time: ``` AssertionError: ❗ Language None is not supported. Supported languages are ['en', 'es', 'fr', 'de', 'it', 'pt', 'pl', 'tr', 'ru', 'nl', 'cs', 'ar', 'zh-cn', 'hu', 'ko', 'ja'] ``` This commit adds a better error message when `--language` is passed for a non-studio model. Fixes #3270, fixes #3291

Previously, the text was wrapped in an additional set of quotes that was passed to Espeak. This could result in different phonemization in certain edges and caused the insertion of an initial separator "_" that had to be removed. Compare: $ espeak-ng -q -b 1 -v en-us --ipa=1 '"A"' _ˈɐ $ espeak-ng -q -b 1 -v en-us --ipa=1 'A' ˈeɪ Fixes #2619

* Revert "fix for issue 3067" This reverts commit 041b4b6. Fixes #3143. The original issue (#3067) was people trying to use tts.tts_with_vc_to_file() with XTTS and was "fixed" in #3109. But XTTS has integrated VC and you can just do tts.tts_to_file(..., speaker_wav="..."), there is no point in passing it through FreeVC afterwards. So, reverting this commit because it breaks tts.tts_with_vc_to_file() for any model that doesn't have integrated VC, i.e. all models this method is meant for. * fix: support multi-speaker models in tts_with_vc/tts_with_vc_to_file * fix: only compute spk embeddings for models that support it Fixes #1440. Passing a `speaker_wav` argument to regular Vits models failed because they don't support voice cloning. Now that argument is simply ignored.

… `model_path` (#3273) * load multilingual model by path * use config to assert multi lingual or not

* Moved Dockerfile, COPY at the end This change should prevent re-installation of the dependencies upon every change of the repository's contents. Typically if Docker detects that something changed in a layer, all downstream layers are invalidated and rebuilt. * Moved Dockerfile back to main directory Main dockerfile in a separate directory can cause issues with the current CI/CD setup. This can be a good change for later. * Introduced Dockerfile.dev, updated CONTRIBUTING Dockerfile.dev can be used as a separate development environment for anyone that does not wish to install the dependencies locally.

ZhouGongZaiShi and others added 30 commits July 4, 2023 11:38

delete meaningless print() (#2662)

d5f16d7

fixed small spelling mistakes (#2551)

d611067

Update README.md

229cfbd

Fix typo

e42a72e

Squashed commit of the following:

53938e2

commit dd612fd Author: JiangCheng <jiangcheng@kezaihui.com> Date: Mon Jun 5 16:04:54 2023 +0800 Failed to download the file and need to delete the created file path

Merge pull request #2741 from coqui-ai/merge_2651

08bc758

Resolve conflicts

Export multispeaker onnx (#2743)

7b5c842

Fix #2745 (#2748)

a2984fb

Bump up to v0.15.6

b5cd644

Fix #2749 (#2750)

672ec3b

Fix share model page URL (#2757)

e5fb0d9

Make Japanese-specific dependencies optional (#2776)

c0aabb8

* Don't install MeCab by default * Add optional [ja] deps, like [dev] etc * Add JA requirements file * Add JA requirements to requirements_all This should help the tests run.

API tests (#2790)

0de12ec

* Separate API tests and only run when uplifted * Make style

Test synthesize api separately

1652598

Update README

f24c5e0

Update README.md

b3472a7

Fix Tortoise load (#2791)

8aacb81

* Remove key prunning in tortoise * Make lint

Bump up to v0.16.0

b739326

Adds multi-language support for VITS onnx, fixes onnx inference error…

c140df5

… when speaker_id is None or not passed, fixes onnx exporting for models with init_discriminator=false (#2816)

Recipe for Belarusian TTS (#2756)

d124f78

* Changes from jhlfrfufyfn <jhlfrfufyfn@gmail.com> * Recipe for Belarusian TTS --------- Co-authored-by: jhlfrfufyfn <jhlfrfufyfn@gmail.com>

Delightful TTS VCTK recipe fixes (#2808)

9e74b51

* fix: wrong import class * fix: formatter name missing * feat: get rid of clearml

Add kwargs to ignore extra arguments w/o error (#2822)

483888b

Fix DelightfulTTS (#2823)

69f080e

* Fix tests * Make style

Please p3.11

17ddd65

Bump up to v0.16.1

dc04baa

add post functionality to /api/tts (#2836)

52a528c

Add fairseq onnx support and strict configuration, fixes some onnx er…

4e7f8cd

…rors (#2831)

Fix imports (#2845)

48f8133

Handle missing JA phonemizer (#2843)

4186f42

* Handle missing JA phonemizer * Make style

eginhard and others added 29 commits November 17, 2023 01:18

refactor: use save_checkpoint()/save_best_model() from Trainer

0fb0d67

Update versions

63d7145

Update CI version

08d11e9

Make k_diffusion optional

26efdf6

Make style

44880f0

Merge pull request #3248 from coqui-ai/slacker_deps

14579a4

Update versions

Ensures that only GPT model is in training mode during XTTS GPT train…

11283fc

…ing (#3241) * Ensures that only GPT model is in training mode during training * Fix parallel wavegan unit test

Update versions

c864acf

Update CI version

44494da

Make k_diffusion optional

f21067a

Make style

a3279f9

Ensures that only GPT model is in training mode during XTTS GPT train…

6075fa2

…ing (#3241) * Ensures that only GPT model is in training mode during training * Fix parallel wavegan unit test

Update model hash for v2.0.2

52cb1e2

Update to v0.20.6

c011ab7

Merge pull request #3249 from coqui-ai/run_ci_for_v0.20.6

29dede2

Run CI for v0.20.6

Merge pull request #3243 from idiap/checkpoints

b47d9c6

Remove duplicate/unused code

Made the tqdm progress_bar objects of static download methods a sta…

64f391b

…tic class variable (#3297)

Misjudgment of is_multi_lingual When Loading Multilingual Model via…

4d0f53d

… `model_path` (#3273) * load multilingual model by path * use config to assert multi lingual or not

update deepspeed version (#3281)

a55755c

Update to XTTS v2.0.3

6dd43b0

Update to v0.21.0

1542a50

Simple text cleaner for "hi"

3206513

Merge branch 'dev' of https://github.com/coqui-ai/TTS into dev

7e57506

Update to v0.21.1

00a870c

Add hi in config defaults

11ec9f7

freds0 merged commit 77c2155 into freds0:dev Nov 29, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update #1

Update #1

freds0 commented Nov 29, 2023

Update #1

Update #1

Conversation

freds0 commented Nov 29, 2023