Skip to content

Commit

Permalink
Fixing english tokenization
Browse files Browse the repository at this point in the history
  • Loading branch information
WeberJulian committed Oct 6, 2023
1 parent 3063846 commit 1ec3418
Showing 1 changed file with 3 additions and 2 deletions.
5 changes: 3 additions & 2 deletions TTS/tts/layers/xtts/tokenizer.py
Original file line number Diff line number Diff line change
Expand Up @@ -223,8 +223,9 @@ def preprocess_text(self, txt, lang):
results = kks.convert(txt)
txt = " ".join([result["kana"] for result in results])
txt = basic_cleaners(txt)
elif lang == "en":
txt = english_cleaners(txt)
# elif lang == "en":
# txt = english_cleaners(txt)
# English cleaner remove the language tag [en]
elif lang == "ar":
txt = arabic_cleaners(txt)
elif lang == "zh-cn":
Expand Down

0 comments on commit 1ec3418

Please sign in to comment.