-
Notifications
You must be signed in to change notification settings - Fork 597
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to deal with different languages #373
Comments
Hi @rickgoud, thanks for your comment! To get started with additional languages, see here. For best practices on developing new recognizers, see here. Each call to Presidio has a language parameter. Presidio assumes you know what language you're sending to it. If that's not the case, consider using a language detection mechanism prior to calling Presidio. In addition, each recognizer (in charge of detecting one or more PII entities), is configured to support specific languages. This is in a nutshell how Presidio can be set up to support multiple languages at once or call a subset of recognizers given the input language. |
i can not find the files there, can you provide me the links, if they are updated? Thanks |
Hi @Oheed911, please see the updated link here: https://microsoft.github.io/presidio/analyzer/languages/ |
Thanks! |
Hi,
Great repo! However only works for English. How would it work if we wanted to add classifiers for Dutch? How to add them the best way and how does presidio know which classifiers to use depending on the language, or will it always run all (which feels like to much overkill, so should be a more intelligent way I don't fully understand yet.
Any help would be greatly appreciated!!
Regards,
Rick
The text was updated successfully, but these errors were encountered: