Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

G2p bigcidian #103

Open
wants to merge 5 commits into
base: main
Choose a base branch
from
Open

Conversation

sanqianyuejia
Copy link

Add bilingual Mandarin and English support using BigCiDian

  1. Add a phonemize backend G2PBackend

  2. Import compiled BigCiDian to g2p to support bilingual Mandarin and English

  3. Add userdict.txt (from BigCidian) for Jieba to support segmentation for Mandarin and English

@sanqianyuejia
Copy link
Author

The document docs/userdict.txt used in jieba has problem with its frequency setting, resulting in the inability to guarantee that the user dictionary takes priority over the system dictionary. @lifeiteng

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant