[Fix] Replace mispronounced words in TTS using hack method #350

6drf21e · 2024-06-19T01:28:07Z

This PR addresses the issue of mispronounced words in the TTS system by implementing a hack method. The main idea is as follows:

Identify correctly pronounced characters by ChatTTS.
Replace the mispronounced characters with correctly pronounced ones.

For example:

Original: 关关雎鸠，在河之洲。窈窕淑女，君子好逑。
Corrected: 关关居鸠，在河之洲。咬挑淑女，君子好求。

The replacement rule file ChatTTS/homophones_map.json contains 16,000 entries.

Rule creation process:

Use a word corpus to let ChatTTS infer the text.
Record any discrepancies between the inferred text and the input.
Find homophones with the correct pronunciation for the mismatched characters.

Corpus used: Tencent AI Lab Embedding Corpora for Chinese and English Words and Phrases

Limitations:

Some characters and words might not be covered.
Some characters do not have correctly pronounced homophones, e.g., “sǒu 叟”.

中文:

本次PR通过使用hack办法解决TTS系统中读错、漏读的问题。主要的构思如下：

找到ChatTTS读音正确的字。
将读错的字符替换成读音正确的字。

例如：

原文: 关关雎鸠，在河之洲。窈窕淑女，君子好逑。
替换后: 关关居鸠，在河之洲。咬挑淑女，君子好求。

替换规则文件 ChatTTS/homophones_map.json 包含1.6万条规则。

规则制作流程：

使用词库，让ChatTTS推理文本。
记录推理文本和输入文本之间的不一致。
将不一致的字找到对应的正确读音的同音字。

当前版本所使用的词库：腾讯AI实验室中英文词语嵌入语料库

缺陷：

可能有未覆盖的字词。
有些字找不到能读正确的字，比如“sǒu 叟”。

ChatTTS/utils/infer_utils.py

fumiama · 2024-06-19T03:49:38Z

p.s. 建议在代码中标明加载的json的出处。

fumiama

Thanks!

aliencaocao · 2024-06-19T08:37:21Z

Im curious on whether this hack has been verified to not affect the output quality by any means. By swapping words, you are scrambling up phrases into meaningless indiv characters. By right this should affect the linguistic ability of the model.

fumiama · 2024-06-19T08:43:28Z

By right this should affect the linguistic ability of the model.

It is better than no sound if the model cannot recognize that character. We will remove (or shrink) this replace if the model is updated and it can recognize new characters.

6drf21e · 2024-06-19T09:16:42Z

The replacements only target incorrectly output characters. In modern novels, the replacement rate is low. For example, in "Twenty Thousand Leagues Under the Sea":

Total Chinese characters: 214,526
Replaced characters: 316
Replacement rate: 0.15%

Top 10 Replaced Characters:

Original	Replacement	Count
鲛	教	43
鳃	塞	32
桅	维	27
舷	闲	22
颚	恶	16
呷	嘎	12
锨	先	12
獭	塔	8
岖	区	7
囱	聪	6

In ancient novels, the rate is higher. For example, in "Romance of the Three Kingdoms":

Total Chinese characters: 493,054
Replaced characters: 6,946
Replacement rate: 1.41%

Top 10 Replaced Characters:

Original	Replacement	Count
郃	和	249
岱	带	189
惇	蹲	161
褚	楚	154
傕	觉	136
汜	四	134
讫	气	133
赍	机	119
綝	陈	109
瑁	帽	103

6drf21e added 2 commits June 19, 2024 09:11

[Fix] Replace mispronounced words in TTS using hack method

47ebf5e

[Refactor] Move homophones_map.json to res folder

c3e3aad

fumiama requested changes Jun 19, 2024

View reviewed changes

ChatTTS/utils/infer_utils.py Outdated Show resolved Hide resolved

fumiama added enhancement New feature or request good first issue Good for newcomers labels Jun 19, 2024

6drf21e added 2 commits June 19, 2024 14:53

[Refactor] Cache homophones_map.json and document source

c793aa9

[Refactor] document process in class docstring.

78ffe75

fumiama approved these changes Jun 19, 2024

View reviewed changes

fumiama merged commit ce1c962 into 2noise:main Jun 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Fix] Replace mispronounced words in TTS using hack method #350

[Fix] Replace mispronounced words in TTS using hack method #350

6drf21e commented Jun 19, 2024

fumiama commented Jun 19, 2024

fumiama left a comment

aliencaocao commented Jun 19, 2024

fumiama commented Jun 19, 2024

6drf21e commented Jun 19, 2024

[Fix] Replace mispronounced words in TTS using hack method #350

[Fix] Replace mispronounced words in TTS using hack method #350

Conversation

6drf21e commented Jun 19, 2024

fumiama commented Jun 19, 2024

fumiama left a comment

Choose a reason for hiding this comment

aliencaocao commented Jun 19, 2024

fumiama commented Jun 19, 2024

6drf21e commented Jun 19, 2024

Top 10 Replaced Characters:

Top 10 Replaced Characters: