You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
echo'Jođiheaddji guovttosges'| hfst-tokenise -g tokeniser-gramcheck-gt-desc.pmhfst
"<Jođiheaddji guovttosges>""ges" Pcle Foc/ges <W:0.0>"<ges>""jođiheaddji guovttos" N Coll Sem/Group_Hum Sg Loc <W:0.0>"<Jođiheaddji guovttos>""ges" Pcle Foc/ges <W:0.0>"<ges>""jođiheaddji guovttos" N Coll Sem/Group_Hum Sg Nom <W:0.0>"<Jođiheaddji guovttos>"
:\n'Jođiheaddji guovttosges'| hfst-tokenise -g tokeniser-gramcheck-gt-desc.pmhfst | cg-mwesplit
"<Jođiheaddji guovttos>""jođiheaddji guovttos" N Coll Sem/Group_Hum Sg Loc <W:0.0>"jođiheaddji guovttos" N Coll Sem/Group_Hum Sg Nom <W:0.0>"<ges>""ges" Pcle Foc/ges <W:0.0>
:\n
After cg-mwesplit has been applied, there is an extra newline after the split cohorts that was not there in the input. Do you get the same, @unhammer ?
The text was updated successfully, but these errors were encountered:
It just feels "dirty" - the stream is changed in unintended ways. There also was a use case I had in mind when I reported this, but that is a long time ago, and now forgotten. Will add it if/when I remember what it was.
Cf the following (using giellalt/lang-sme as example):
After
cg-mwesplit
has been applied, there is an extra newline after the split cohorts that was not there in the input. Do you get the same, @unhammer ?The text was updated successfully, but these errors were encountered: