Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

XTTS v1.1 #3089

Merged
merged 8 commits into from
Oct 20, 2023
Merged

XTTS v1.1 #3089

merged 8 commits into from
Oct 20, 2023

Conversation

WeberJulian
Copy link
Contributor

@WeberJulian WeberJulian commented Oct 20, 2023

XTTS v1.1

This model is trained on top of XTTS v1, using output masking. We mask the part of the output that is used as the audio prompt while training and don't compute loss for that segment. This helps us to resolve the hallucination issue that V1 experienced.

Changes

  • Add Japanese
  • Resolve the hallucination issue (repeating the audio prompt)
  • Increased expressivity
  • Hash check to control model version
  • Added ne_hifigan that was trained without denoising that brought some EQ and compression profile that might be unwanted for some use-cases

@CLAassistant
Copy link

CLAassistant commented Oct 20, 2023

CLA assistant check
All committers have signed the CLA.

TTS/tts/models/xtts.py Outdated Show resolved Hide resolved
@Edresson Edresson self-requested a review October 20, 2023 12:27
@erogol erogol merged commit cf97116 into dev Oct 20, 2023
48 checks passed
@erogol erogol deleted the xtts_v1.1 branch October 20, 2023 14:02
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants