[tts] 基于 BERT 实现语音合成文本前端的多音字预测 #1283

yt605155624 · 2022-01-06T11:45:51Z

目前的多音字使用 pypinyin 或者 g2pM，精度有限，想做一个基于 BERT (或者 ERNIE) 多音字预测模型，简单来说就是假设某语言有 100 个多音字，每个多音字最多有 3 个发音，那么可以在 BERT 后面接 100 个 3 分类器（简单的 fc 层即可），在预测时，找到对应的分类器进行分类即可。
参考论文：
tencent_polyphone.pdf

数据可以用 https://github.com/kakaobrain/g2pM 提供的数据

进阶：多任务的 BERT

Jzow · 2022-01-18T09:26:38Z

但是我发现并没有英语的合成的示例，客观评价paddle在这块的doc 远远不如其他开源，mozilla 和 tensorflow的 TTS 会有明确的文档

yt605155624 · 2022-01-18T09:37:15Z

ljspeech 和 vctk 都是英文的合成数据集，包含示例

Jzow · 2022-01-18T09:50:40Z

@yt605155624 非常感谢你的及时回复，我会留意看一下，

GloryRoadWangzh · 2022-07-29T08:56:17Z

基于bert实现语音合成文本前端的多音字预测有代码实现吗？

yt605155624 · 2022-08-08T12:17:47Z

@GloryRoadWangzh 目前没有，可以参考标点预测来做，基于 paddlenlp，目前有开发者正在把 g2pw 加到我们的前端，是基于 bert 的，所以我们可能就不自己搞多音字预测了 #2230

lucasjinreal · 2022-10-28T08:16:04Z

@yt605155624 请教一下，为什么有了g2pw 就不需要多因子预测了，比如下面的句子能预测对马：

孩子，别吃了，这里的肉脏，走，跟我去太平间

yt605155624 · 2022-11-18T08:24:39Z

@jinfagang 因为 g2pw 就是一种基于 bert 的多音字预测模型

yt605155624 added the good first issue label Jan 6, 2022

zh794390558 assigned yt605155624 Jan 12, 2022

zh794390558 added this to PaddleSpeech Jan 12, 2022

stale bot added the Stale label Mar 5, 2022

zh794390558 changed the title ~~基于 BERT 实现语音合成文本前端的多音字预测~~ [tts] 基于 BERT 实现语音合成文本前端的多音字预测 Mar 29, 2022

stale bot removed the Stale label Mar 29, 2022

stale bot added the Stale label Jun 11, 2022

stale bot closed this as completed Jul 12, 2022

stale bot moved this to Done in PaddleSpeech Jul 12, 2022

yt605155624 reopened this Jul 28, 2022

stale bot removed the Stale label Jul 28, 2022

PaddlePaddle deleted a comment from stale bot Sep 7, 2022

yt605155624 added the T2S label Sep 14, 2022

yt605155624 closed this as completed Nov 18, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[tts] 基于 BERT 实现语音合成文本前端的多音字预测 #1283

[tts] 基于 BERT 实现语音合成文本前端的多音字预测 #1283

yt605155624 commented Jan 6, 2022 •

edited

Loading

Jzow commented Jan 18, 2022

yt605155624 commented Jan 18, 2022

Jzow commented Jan 18, 2022

GloryRoadWangzh commented Jul 29, 2022

yt605155624 commented Aug 8, 2022 •

edited

Loading

lucasjinreal commented Oct 28, 2022

yt605155624 commented Nov 18, 2022

[tts] 基于 BERT 实现语音合成文本前端的多音字预测 #1283

[tts] 基于 BERT 实现语音合成文本前端的多音字预测 #1283

Comments

yt605155624 commented Jan 6, 2022 • edited Loading

Jzow commented Jan 18, 2022

yt605155624 commented Jan 18, 2022

Jzow commented Jan 18, 2022

GloryRoadWangzh commented Jul 29, 2022

yt605155624 commented Aug 8, 2022 • edited Loading

lucasjinreal commented Oct 28, 2022

yt605155624 commented Nov 18, 2022

yt605155624 commented Jan 6, 2022 •

edited

Loading

yt605155624 commented Aug 8, 2022 •

edited

Loading