Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Hungarian model with spaCy #17

Closed
oroszgy opened this issue Feb 4, 2019 · 2 comments
Closed

Hungarian model with spaCy #17

oroszgy opened this issue Feb 4, 2019 · 2 comments

Comments

@oroszgy
Copy link

oroszgy commented Feb 4, 2019

Hi,

Just letting you know that parsing Hungarian should be available through spaCy, as it can tokenize Hungarian and PoS tagging with Dependency parsing is also available through a 3rd party packages

Thanks,
Gyuri

@nikitakit
Copy link
Owner

Thanks for letting me know!

By any chance have you tried running benepar with your spaCy models for Hungarian? I'm always a bit nervous the spaCy integration introduced some sort of error that would be immediately obvious to a speaker of the language. My parser also has a POS tagger (optional but enabled by default), though I'm not sure how good it is compared to the spaCy model you linked.

I'll update the README with instructions on how to use Hungarian spaCy models. Sadly the tokenizer available via spacy.blank is not sufficient for benepar because it doesn't mark sentence boundaries.

@nikitakit
Copy link
Owner

Closing this now that I've added the link to the README. Thanks for telling me about the spaCy models (and let me know if you ever spot any issues with the Hungarian parser).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants