LM tokenizer can't handle whitespace token #7406
Labels
area:rasa-oss 🎡
Anything related to the open source Rasa framework
type:bug 🐛
Inconsistencies or issues which will cause an issue or problem for users or implementors.
Rasa version: 2.1.1
Python version: 3.7.6
Operating system (windows, osx, ...): Ubuntu 18.04.5 LTS
Issue:
when the training sample has " " (whitespace char) inside, e.g. '你好,我是 Silly,一个专注天气预报的对话机器人。'.
rasa train
will raise ValueError: not enough values to unpack (expected 2, got 0)Error (including full traceback):
Command or request that led to error:
domain.yml:
I already know how to fix it, I will submit a PR for this later.
The text was updated successfully, but these errors were encountered: