Skip to content

OuteTTS v0.2.0 Release

Compare
Choose a tag to compare
@edwko edwko released this 25 Nov 12:08
· 71 commits to main since this release

OuteTTS v0.2.0 Release Notes

Major Changes

  • New Model Support: Added support for OuteTTS-0.2-500M model
  • Speaker Management: Introduced default speaker presets for each supported language
  • Breaking Changes:
    • Speaker files from previous versions (<0.2.0) are not compatible
    • Interface usage has been significantly revised (see README.md for new implementation)

New Features

  • Added voice cloning guidelines and interface usage recommendations in README.md
  • Implemented Gradio example playground for OuteTTS-0.2-500M
  • Multi-language alignment support
  • Enhanced speaker management:
    • New methods: interface.print_default_speakers() and interface.load_default_speaker(name="male_1")
    • Switched from pickle to JSON format for speaker saving
    • Added speaker language information in saved files
  • Option to load WavTokenizer from custom path (resolves issue #24)
  • Multiple interface version initialization in a single function

Improvements

  • Restructured library files for better organization
  • Implemented hash verification for WavTokenizer downloads (resolves issue #3)
  • Reworked interface for better usability
  • Made sounddevice optional with improved error handling for sound playback
  • Added data preparation examples for training

Error Handling

  • Added validation for audio token detection
  • Improved error messages for long input text and early EOS cases
  • Enhanced overall library error handling and feedback

How to Upgrade

  • Update your library via pip:
    pip install --upgrade outetts