fix ordinals and conjunctions in tts normalizer #341

Spycsh · 2023-09-19T07:11:56Z

Type of Change

Bug fix

Description

fix ordinals and conjunctions in tts normalizer

correctly handle following:

conjunctions
CVPR-15 => cee vee pee ar fifteen
ordinals
1st 2nd 3rd 4th 5th 11th 12th 21st 22nd => first second third fourth fifth eleventh twelfth twenty first twenty second

Expected Behavior & Potential Risk

Make the normalizer more robust.

There still are some words such as i7, ffmpeg, BTW not spelled correctly and should be hardcoded maybe in an advanced Trie. Also, the potention number is only treated as a year when it has prepositions in front of it and between (1000,2999), which still is not an absolute mapping (e.g. in Tom's 1986 report indicated that..., 1986 should be a year but with no prepositions it is converted as cardinal number).

How has this PR been tested?

add two UTs

Dependency Change?

re should be built-in Python package

intel_extension_for_transformers/neural_chat/pipeline/plugins/audio/utils/english_normalizer.py

* fix ordinals and conjunctions in tts normalizer * fix comment

Spycsh added 2 commits September 19, 2023 00:00

fix ordinals and conjunctions in tts normalizer

f324832

fix comment

a4aaf58

Spycsh requested a review from lvliang-intel as a code owner September 19, 2023 07:11

hshen14 reviewed Sep 19, 2023

View reviewed changes

intel_extension_for_transformers/neural_chat/pipeline/plugins/audio/utils/english_normalizer.py Show resolved Hide resolved

hshen14 reviewed Sep 19, 2023

View reviewed changes

intel_extension_for_transformers/neural_chat/pipeline/plugins/audio/utils/english_normalizer.py Show resolved Hide resolved

hshen14 approved these changes Sep 20, 2023

View reviewed changes

lvliang-intel approved these changes Sep 20, 2023

View reviewed changes

hshen14 merged commit 0892f8a into main Sep 20, 2023
14 checks passed

hshen14 deleted the spycsh/fix_normalizer branch September 20, 2023 03:15

lvliang-intel pushed a commit that referenced this pull request Sep 20, 2023

fix ordinals and conjunctions in tts normalizer (#341)

3435463

* fix ordinals and conjunctions in tts normalizer * fix comment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix ordinals and conjunctions in tts normalizer #341

fix ordinals and conjunctions in tts normalizer #341

Spycsh commented Sep 19, 2023

fix ordinals and conjunctions in tts normalizer #341

fix ordinals and conjunctions in tts normalizer #341

Conversation

Spycsh commented Sep 19, 2023

Type of Change

Description

Expected Behavior & Potential Risk

How has this PR been tested?

Dependency Change?