TTS/TTS/tts/layers/xtts/tokenizer.py", line 180, in expand_abbreviations_multilingual for regex, replacement in _abbreviations[lang]: KeyError: 'zh-cn'[Bug] #3189

lucasjinreal · 2023-11-10T07:08:23Z

Describe the bug

TTS/TTS/tts/layers/xtts/tokenizer.py", line 180, in expand_abbreviations_multilingual
for regex, replacement in _abbreviations[lang]:
KeyError: 'zh-cn'

To Reproduce

TTS/TTS/tts/layers/xtts/tokenizer.py", line 180, in expand_abbreviations_multilingual
for regex, replacement in _abbreviations[lang]:
KeyError: 'zh-cn'

Expected behavior

TTS/TTS/tts/layers/xtts/tokenizer.py", line 180, in expand_abbreviations_multilingual
for regex, replacement in _abbreviations[lang]:
KeyError: 'zh-cn'

Logs

TTS/TTS/tts/layers/xtts/tokenizer.py", line 180, in expand_abbreviations_multilingual
    for regex, replacement in _abbreviations[lang]:
KeyError: 'zh-cn'

Environment

TTS/TTS/tts/layers/xtts/tokenizer.py", line 180, in expand_abbreviations_multilingual
    for regex, replacement in _abbreviations[lang]:
KeyError: 'zh-cn'

Additional context

No response

douhaohaode · 2023-11-10T15:01:27Z

If zh-cn and zh represent Chinese, it is recommended to use one.

如果想运行可以手动先更改TTS文件下tokenizer.py中118行和283行 zh改为zh-cn

lucasjinreal · 2023-11-10T15:41:56Z

I think the tokenizer these map's keys should be consistent with language codes.

AIFSH · 2023-11-11T11:21:39Z

before offical fix

pip uninstall TTS
pip install TTS==0.20.2

work!

jbang2004 · 2023-11-12T00:04:29Z

If zh-cn and zh represent Chinese, it is recommended to use one.

如果想运行可以手动先更改TTS文件下tokenizer.py中118行和283行 zh改为zh-cn

可以啊兄弟，对了，兄弟知道怎么保存说话人的潜在特征和嵌入，使用这些特征生成多段对话吗？现在每次都要先生成特征，再推理，效率很低

lucasjinreal · 2023-11-12T02:01:26Z

@jbang2004 可以，但是官方似乎压根没有考虑这个问题

jbang2004 · 2023-11-12T04:22:00Z

@jbang2004 可以，但是官方似乎压根没有考虑这个问题

研究了一个上午，官方文档里有个直接从模型提取特征，然后用torchaudio生成wav的方法，这个可以一直沿用相同的特征进行转换，不过这种方法生成的效果比使用api差一些，不知道为什么

lucasjinreal · 2023-11-12T04:40:05Z

@jbang2004 方便分享一下代码吗

Edresson · 2023-11-14T16:33:39Z

I fixed it on #3216. "zh-cn" is what we have in the config and docs so I rename "zh" to "zh-cn".

genglinxiao · 2023-11-15T09:12:14Z

I think there are 2 places that the key code used for Chinese language are inconsistent:
The model uses "zh-cn" for the Chinese (simplifed) language. However, the key defined in the _abbreviations and the _symbols_multilingual for Chinese language is "zh". These 2 structures are used in expand_abbreviations_multilingual() and expand_symbols_multilingual() respectively, resulting in key errors.

In my case, I changed the key from "zh-cn" to "zh" inside these 2 functions by adding the following lines to the functions.

    if lang=="zh-cn":
        lang="zh"

But I think there ought to be a cleaner solution.

stale · 2023-12-16T01:45:59Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. You might also look our discussion channels.

lucasjinreal added the bug Something isn't working label Nov 10, 2023

Edresson mentioned this issue Nov 14, 2023

Fix XTTS GPT padding and inference issues #3216

Merged

Edresson self-assigned this Nov 14, 2023

stale bot added the wontfix This will not be worked on but feel free to help. label Dec 16, 2023

stale bot closed this as completed Dec 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TTS/TTS/tts/layers/xtts/tokenizer.py", line 180, in expand_abbreviations_multilingual for regex, replacement in _abbreviations[lang]: KeyError: 'zh-cn'[Bug] #3189

TTS/TTS/tts/layers/xtts/tokenizer.py", line 180, in expand_abbreviations_multilingual for regex, replacement in _abbreviations[lang]: KeyError: 'zh-cn'[Bug] #3189

lucasjinreal commented Nov 10, 2023

douhaohaode commented Nov 10, 2023 •

edited

Loading

lucasjinreal commented Nov 10, 2023

AIFSH commented Nov 11, 2023

jbang2004 commented Nov 12, 2023

lucasjinreal commented Nov 12, 2023

jbang2004 commented Nov 12, 2023

lucasjinreal commented Nov 12, 2023

Edresson commented Nov 14, 2023 •

edited

Loading

genglinxiao commented Nov 15, 2023

stale bot commented Dec 16, 2023

TTS/TTS/tts/layers/xtts/tokenizer.py", line 180, in expand_abbreviations_multilingual for regex, replacement in _abbreviations[lang]: KeyError: 'zh-cn'[Bug] #3189

TTS/TTS/tts/layers/xtts/tokenizer.py", line 180, in expand_abbreviations_multilingual for regex, replacement in _abbreviations[lang]: KeyError: 'zh-cn'[Bug] #3189

Comments

lucasjinreal commented Nov 10, 2023

Describe the bug

To Reproduce

Expected behavior

Logs

Environment

Additional context

douhaohaode commented Nov 10, 2023 • edited Loading

lucasjinreal commented Nov 10, 2023

AIFSH commented Nov 11, 2023

jbang2004 commented Nov 12, 2023

lucasjinreal commented Nov 12, 2023

jbang2004 commented Nov 12, 2023

lucasjinreal commented Nov 12, 2023

Edresson commented Nov 14, 2023 • edited Loading

genglinxiao commented Nov 15, 2023

stale bot commented Dec 16, 2023

douhaohaode commented Nov 10, 2023 •

edited

Loading

Edresson commented Nov 14, 2023 •

edited

Loading