OuteTTS v0.2.0 Release

edwko released this 25 Nov 12:08

· 71 commits to main since this release

c2d413b

OuteTTS v0.2.0 Release Notes

Major Changes

New Model Support: Added support for OuteTTS-0.2-500M model
Speaker Management: Introduced default speaker presets for each supported language
Breaking Changes:
- Speaker files from previous versions (<0.2.0) are not compatible
- Interface usage has been significantly revised (see README.md for new implementation)

New Features

Added voice cloning guidelines and interface usage recommendations in README.md
Implemented Gradio example playground for OuteTTS-0.2-500M
Multi-language alignment support
Enhanced speaker management:
- New methods: interface.print_default_speakers() and interface.load_default_speaker(name="male_1")
- Switched from pickle to JSON format for speaker saving
- Added speaker language information in saved files
Option to load WavTokenizer from custom path (resolves issue #24)
Multiple interface version initialization in a single function

Improvements

Restructured library files for better organization
Implemented hash verification for WavTokenizer downloads (resolves issue #3)
Reworked interface for better usability
Made sounddevice optional with improved error handling for sound playback
Added data preparation examples for training

Error Handling

Added validation for audio token detection
Improved error messages for long input text and early EOS cases
Enhanced overall library error handling and feedback

How to Upgrade

Update your library via pip:
```
pip install --upgrade outetts
```

Assets 2