Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

v0.6.2 #1353

Merged
merged 40 commits into from
Apr 20, 2022
Merged

v0.6.2 #1353

Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
40 commits
Select commit Hold shift + click to select a range
c670365
Fix VCTK recipe and formatter
erogol Mar 8, 2022
d792b78
Fix multilingual recipe (#1354)
Edresson Mar 9, 2022
48f6bb4
Fix recipes as to the recent API changes. (#1367)
erogol Mar 10, 2022
8a007c8
feat: add docsqa to docs website (#1363)
nomagick Mar 10, 2022
07d96f7
Fix DocQA title
erogol Mar 10, 2022
f381e29
REBASED: Add support for the speaker encoder training using torch spe…
Edresson Mar 10, 2022
917f417
Add alphas to control language and speaker balancer (#1216)
Edresson Mar 10, 2022
dbe9da7
Add Voice conversion inference support (#1337)
Edresson Mar 10, 2022
b0be825
Update issue template (#1370)
erogol Mar 11, 2022
36e9ea2
Open bible dataset formatter (#1365)
Edresson Mar 11, 2022
f818924
REBASED: Transform Speaker Encoder in a Generic Encoder and Implement…
Edresson Mar 11, 2022
24b57f6
Fix typo workflow text (#1403)
WeberJulian Mar 16, 2022
f40b833
Add CITATION.cff (#1404)
erogol Mar 16, 2022
690c96e
Fix default phonemizer for ja and zh (#1399)
WeberJulian Mar 16, 2022
0870a4f
Make style (#1405)
erogol Mar 16, 2022
fd56fab
Fix #1380 (#1409)
erogol Mar 16, 2022
c7f9ec0
Hinge Gruut version to 2.2.3 (#1419)
erogol Mar 18, 2022
2e6e8f6
Update CheckSpectrograms notebook (#1418)
erogol Mar 18, 2022
ccdc230
Add eval_split and eval_split_size in the call of load_tts_samples fo…
Edresson Mar 22, 2022
72d85e5
Update model file extension (#1422)
erogol Mar 22, 2022
1c3623a
Fix model manager (#1436)
erogol Mar 23, 2022
3c7c146
Add formatting tests (#1437)
WeberJulian Mar 23, 2022
3af01cf
Update base model wrt 👟 (#1406)
erogol Mar 23, 2022
ea53d6f
Replace webrtcvad by silero-vad
Edresson Mar 22, 2022
0ae1e02
Fix the bug for emptly audio files
Edresson Mar 22, 2022
3435bc8
Fix style tests
Edresson Mar 23, 2022
464dc65
Merge pull request #1431 from coqui-ai/silero-vad
Edresson Mar 24, 2022
37896e1
Bug fix in freeze encoder (#1391)
Edresson Mar 24, 2022
c66a624
Enforce phonemizer definition for synthesis (#1441)
WeberJulian Mar 25, 2022
1b22f03
Fix G2P backend of the released models (#1461)
WeberJulian Mar 30, 2022
060e0f9
Add EmbeddingManager and BaseIDManager (#1374)
Edresson Mar 31, 2022
164c7dd
Update requirements coqui_trainer -> trainer (#1478)
erogol Apr 8, 2022
27fcb5d
Add Dockerfile and build/push CI
reuben Apr 14, 2022
c18100d
Merge branch 'docker-ci' into dev (Fixes #1498)
reuben Apr 15, 2022
e8573bf
Update CONTRIBUTING.md (#1463)
Jackiexiao Apr 15, 2022
4953636
Add African models (#1511)
WeberJulian Apr 19, 2022
7133f8f
Print Model's license when downloading (#1512)
erogol Apr 19, 2022
b45d5c5
Improve docsQA default questions (#1411)
nomagick Apr 19, 2022
30bea7d
Update manage.py (#1514)
WeberJulian Apr 19, 2022
c410bc5
Bump to v0.6.2
erogol Apr 20, 2022
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 2 additions & 1 deletion .dockerignore
Original file line number Diff line number Diff line change
@@ -1 +1,2 @@
.git/
.git/
Dockerfile
58 changes: 0 additions & 58 deletions .github/ISSUE_TEMPLATE/bug_report.md

This file was deleted.

85 changes: 85 additions & 0 deletions .github/ISSUE_TEMPLATE/bug_report.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,85 @@
name: "🐛 Bug report"
description: Create a bug report to help 🐸 improve
title: '[Bug] '
labels: [ "bug" ]
body:
- type: markdown
attributes:
value: |
Welcome to the 🐸TTS! Thanks for taking the time to fill out this bug report!

- type: textarea
id: bug-description
attributes:
label: Describe the bug
description: A clear and concise description of what the bug is. If you intend to submit a PR for this issue, tell us in the description. Thanks!
placeholder: Bug description
validations:
required: true

- type: textarea
id: reproduction
attributes:
label: To Reproduce
description: |
Please share your code to reproduce the error.

Issues are fixed faster if you can provide a working example.

The best place for sharing code is colab. https://colab.research.google.com/
So we can directly run your code and reproduce the issue.

In the worse case, provide steps to reproduce the behavior.

1. Run the following command '...'
2. ...
3. See error
placeholder: Reproduction
validations:
required: true

- type: textarea
id: expected-behavior
attributes:
label: Expected behavior
description: "Write down what the expected behaviour"

- type: textarea
id: logs
attributes:
label: Logs
description: "Please include the relevant logs if you can."
render: shell

- type: textarea
id: system-info
attributes:
label: Environment
description: |
You can either run `TTS/bin/collect_env_info.py`

```bash
wget https://raw.githubusercontent.com/coqui-ai/TTS/main/TTS/bin/collect_env_info.py
python collect_env_info.py
```

or fill in the fields below manually.
render: shell
placeholder: |
- 🐸TTS Version (e.g., 1.3.0):
- PyTorch Version (e.g., 1.8)
- Python version:
- OS (e.g., Linux):
- CUDA/cuDNN version:
- GPU models and configuration:
- How you installed PyTorch (`conda`, `pip`, source):
- Any other relevant information:
validations:
required: true
- type: textarea
id: context
attributes:
label: Additional context
description: Add any other context about the problem here.
validations:
required: false
56 changes: 56 additions & 0 deletions .github/workflows/docker.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,56 @@
name: "Docker build and push"
on:
pull_request:
push:
branches:
- main
- dev
tags:
- v*
jobs:
docker-build:
name: "Build and push Docker image"
runs-on: ubuntu-20.04
strategy:
matrix:
arch: ["amd64"]
steps:
- uses: actions/checkout@v2
- name: Log in to the Container registry
uses: docker/login-action@v1
with:
registry: ghcr.io
username: ${{ github.actor }}
password: ${{ secrets.GITHUB_TOKEN }}
- name: Compute Docker tags, check VERSION file matches tag
id: compute-tag
run: |
set -ex
base="ghcr.io/coqui-ai/tts"
tags="" # PR build
if [[ "${{ startsWith(github.ref, 'refs/heads/') }}" = "true" ]]; then
# Push to branch
github_ref="${{ github.ref }}"
branch=${github_ref#*refs/heads/} # strip prefix to get branch name
tags="${base}:${branch},${base}:${{ github.sha }},"
elif [[ "${{ startsWith(github.ref, 'refs/tags/') }}" = "true" ]]; then
VERSION="v$(cat TTS/VERSION)"
if [[ "${{ github.ref }}" != "refs/tags/${VERSION}" ]]; then
echo "Pushed tag does not match VERSION file. Aborting push."
exit 1
fi
tags="${base}:${VERSION},${base}:latest,${base}:${{ github.sha }}"
fi
echo "::set-output name=tags::${tags}"
- name: Set up QEMU
uses: docker/setup-qemu-action@v1
- name: Set up Docker Buildx
id: buildx
uses: docker/setup-buildx-action@v1
- name: Build and push
uses: docker/build-push-action@v2
with:
context: .
platforms: linux/${{ matrix.arch }}
push: ${{ github.event_name == 'push' }}
tags: ${{ steps.compute-tag.outputs.tags }}
2 changes: 1 addition & 1 deletion .github/workflows/text_tests.yml
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
name: tts-tests
name: text-tests

on:
push:
Expand Down
1 change: 1 addition & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -115,6 +115,7 @@ venv.bak/
*.swo

# pytorch models
*.pth
*.pth.tar
result/

Expand Down
20 changes: 20 additions & 0 deletions CITATION.cff
Original file line number Diff line number Diff line change
@@ -0,0 +1,20 @@
cff-version: 1.2.0
message: "If you want to cite 🐸💬, feel free to use this (but only if you loved it 😊)"
title: "Coqui TTS"
abstract: "A deep learning toolkit for Text-to-Speech, battle-tested in research and production"
date-released: 2021-01-01
authors:
- family-names: "Eren"
given-names: "Gölge"
- name: "The Coqui TTS Team"
version: 1.4
doi: 10.5281/zenodo.6334862
license: "MPL-2.0"
url: "https://www.coqui.ai"
repository-code: "https://github.com/coqui-ai/TTS"
keywords:
- machine learning
- deep learning
- artificial intelligence
- text to speech
- TTS
4 changes: 3 additions & 1 deletion CONTRIBUTING.md
Original file line number Diff line number Diff line change
Expand Up @@ -26,7 +26,8 @@ If you like to contribute code, squash a bug but if you don't know where to star
We list all the target improvements for the next version. You can pick one of them and start contributing.

- Also feel free to suggest new features, ideas and models. We're always open for new things.
#####Call for sharing language models

## Call for sharing language models
If possible, please consider sharing your pre-trained models in any language (if the licences allow for you to do so). We will include them in our model catalogue for public use and give the proper attribution, whether it be your name, company, website or any other source specified.

This model can be shared in two ways:
Expand All @@ -36,6 +37,7 @@ This model can be shared in two ways:
Models are served under `.models.json` file and any model is available under TTS CLI or Server end points.

Either way you choose, please make sure you send the models [here](https://github.com/coqui-ai/TTS/issues/380).

## Sending a ✨**PR**✨

If you have a new feature, a model to implement, or a bug to squash, go ahead and send a ✨**PR**✨.
Expand Down
11 changes: 11 additions & 0 deletions Dockerfile
Original file line number Diff line number Diff line change
@@ -0,0 +1,11 @@
FROM nvcr.io/nvidia/pytorch:22.03-py3
RUN apt-get update && apt-get install -y --no-install-recommends espeak && rm -rf /var/lib/apt/lists/*
WORKDIR /root
COPY requirements.txt /root
COPY requirements.dev.txt /root
COPY requirements.notebooks.txt /root
RUN pip install -r <(cat requirements.txt requirements.dev.txt requirements.notebooks.txt)
COPY . /root
RUN make install
ENTRYPOINT ["tts"]
CMD ["--help"]
1 change: 1 addition & 0 deletions MANIFEST.in
Original file line number Diff line number Diff line change
@@ -1,6 +1,7 @@
include README.md
include LICENSE.txt
include requirements.*.txt
include *.cff
include requirements.txt
include TTS/VERSION
recursive-include TTS *.json
Expand Down
2 changes: 2 additions & 0 deletions Makefile
Original file line number Diff line number Diff line change
Expand Up @@ -44,6 +44,8 @@ style: ## update code style.

lint: ## run pylint linter.
pylint ${target_dirs}
black ${target_dirs} --check
isort ${target_dirs} --check-only

system-deps: ## install linux system deps
sudo apt-get install -y libsndfile1-dev
Expand Down
8 changes: 4 additions & 4 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -159,13 +159,13 @@ If you are on Windows, 👑@GuyPaddock wrote installation instructions [here](ht
- Run your own TTS model (Using Griffin-Lim Vocoder):

```
$ tts --text "Text for TTS" --model_path path/to/model.pth.tar --config_path path/to/config.json --out_path output/path/speech.wav
$ tts --text "Text for TTS" --model_path path/to/model.pth --config_path path/to/config.json --out_path output/path/speech.wav
```

- Run your own TTS and Vocoder models:
```
$ tts --text "Text for TTS" --model_path path/to/config.json --config_path path/to/model.pth.tar --out_path output/path/speech.wav
--vocoder_path path/to/vocoder.pth.tar --vocoder_config_path path/to/vocoder_config.json
$ tts --text "Text for TTS" --model_path path/to/config.json --config_path path/to/model.pth --out_path output/path/speech.wav
--vocoder_path path/to/vocoder.pth --vocoder_config_path path/to/vocoder_config.json
```

### Multi-speaker Models
Expand All @@ -185,7 +185,7 @@ If you are on Windows, 👑@GuyPaddock wrote installation instructions [here](ht
- Run your own multi-speaker TTS model:

```
$ tts --text "Text for TTS" --out_path output/path/speech.wav --model_path path/to/config.json --config_path path/to/model.pth.tar --speakers_file_path path/to/speaker.json --speaker_idx <speaker_id>
$ tts --text "Text for TTS" --out_path output/path/speech.wav --model_path path/to/config.json --config_path path/to/model.pth --speakers_file_path path/to/speaker.json --speaker_idx <speaker_id>
```

## Directory Structure
Expand Down
Loading