Add TransformerVC implementation #90

ArkhamImp · 2024-01-05T02:50:36Z

The code has been tested, and the model is under training.

lmxue · 2024-01-05T08:02:38Z

Thanks for your efforts. There are some suggestions for a standard PR, especially for the PR that integrates a new model into Amphion:

Please format the code using black
Please detail PR descriptions, including the PR purpose, supported model, etc.
Please attach the pre-trained models as well as some samples in this PR.
Please submit a PR from a specific branch instead of the main branch.

RMSnow · 2024-01-06T04:00:48Z

Thanks @ArkhamImp a lot to integrate our first VC model. For the "New Feature" PR, the review stage is more strict and usually takes several request-change loops.

You can follow these two PRs to provide the inference samples of TransformerVC, so that we can ensure the code is bug-free:

A perfect "New Feature" PR will contain:

(1) Code;
(2) Generated Samples;
(3) Recipe and Doc;
(4) Pretrained Models in our Huggingface;
(5) Gradio Demo in our Huggingface.

I think for this PR, you need to provide (1)-(2) at least. Besides, you can schedule an expected date for (3) to inform others. After (1)-(3), we can assign other developers to cooperate with you to accomplish (4) and (5).

ArkhamImp · 2024-01-15T02:23:17Z

Voice Conversion Samples

Sample 1

Source

121_121726_000020_000001.mp4

Target

672_122797_000002_000002.mp4

Converted

121_121726_000020_000001_672_122797_000002_000002.mp4

Sample 2

Source

1089_134686_000018_000000.mp4

Target

lj001-0003.mp4

Converted

1089_134686_000018_000000_LJ001-0003.mp4

ArkhamImp · 2024-01-26T04:57:31Z

Samples of VitsVC: https://x8gvg3n7v3.feishu.cn/docx/SPbnd0gHcowovGxCyOLcndfanAb?from=from_copylink

RMSnow

Thanks @ArkhamImp ! The VitsVC samples sound better than TransformerVC. You can integrate the both two if you are ready!

ArkhamImp · 2024-01-29T01:06:45Z

Ready to integrate.

ArkhamImp and others added 3 commits January 5, 2024 10:37

Add using of speaker embedding

5ee1246

Add TransformerVC implementation

6e12991

Update env.sh

02b3068

ArkhamImp added 2 commits January 5, 2024 17:39

Fixed load vocoder checkpoint issue

73e34c8

Merge remote-tracking branch 'origin/main' into main

ee88b02

jiaqili3 requested a review from lmxue January 6, 2024 02:04

ArkhamImp and others added 3 commits January 13, 2024 12:52

Merge branch 'open-mmlab:main' into main

c318c6a

Change input to Hubert token and normalized pitch

87c0e09

Change input to Hubert token and normalized pitch

e7b1122

ArkhamImp added 2 commits January 25, 2024 09:10

Merge branch 'open-mmlab:main' into main

cddffd5

Add implement of VitsVC

da6bb0d

RMSnow self-requested a review January 27, 2024 02:08

RMSnow requested changes Jan 27, 2024

View reviewed changes

ArkhamImp requested a review from RMSnow January 29, 2024 01:18

RMSnow requested a review from Adorable-Qin January 29, 2024 03:51

ArkhamImp deleted the branch open-mmlab:main April 17, 2024 01:41

ArkhamImp closed this Apr 17, 2024

ArkhamImp deleted the main branch April 17, 2024 01:41

ArkhamImp mentioned this pull request Apr 17, 2024

Add TransformerVC & VITSVC implementation #183

Open

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add TransformerVC implementation #90

Add TransformerVC implementation #90

ArkhamImp commented Jan 5, 2024

lmxue commented Jan 5, 2024 •

edited

Loading

RMSnow commented Jan 6, 2024

ArkhamImp commented Jan 15, 2024

ArkhamImp commented Jan 26, 2024

RMSnow left a comment

ArkhamImp commented Jan 29, 2024

Add TransformerVC implementation #90

Add TransformerVC implementation #90

Conversation

ArkhamImp commented Jan 5, 2024

lmxue commented Jan 5, 2024 • edited Loading

RMSnow commented Jan 6, 2024

ArkhamImp commented Jan 15, 2024

Voice Conversion Samples

Sample 1

Source

Target

Converted

Sample 2

Source

Target

Converted

ArkhamImp commented Jan 26, 2024

RMSnow left a comment

Choose a reason for hiding this comment

ArkhamImp commented Jan 29, 2024

lmxue commented Jan 5, 2024 •

edited

Loading