Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature Request: Unicode Script property #11

Open
russcam opened this issue Apr 30, 2024 · 3 comments
Open

Feature Request: Unicode Script property #11

russcam opened this issue Apr 30, 2024 · 3 comments

Comments

@russcam
Copy link

russcam commented Apr 30, 2024

It can be useful to know the Unicode Script property of characters, for example, in language detection.

Would you be interested in a PR to add support for Unicode scripts, and to look up the script name for a given character?

@Dubzer
Copy link

Dubzer commented May 14, 2024

I'd like to see that, too 👀

@russcam
Copy link
Author

russcam commented Jun 20, 2024

@Dubzer in the meantime, you may be interested in using UnicodeScript.g.cs, generated using https://github.com/searchpioneer/lingua-dotnet/tree/main/build/Lingua.UnicodeScriptGenerator. It implements an extension method GetScript(this char ch) that returns the unicode script for the char.

@hexawyz
Copy link
Owner

hexawyz commented Jun 20, 2024

Hello

Thanks for providing this option 😊

I do plan to address this in the library at some point, but let's be honest, this project is not really the top of my priorities at the moment. 😶‍🌫️
This change and the other one required to upgrade the Unicode version seem to require slightly non-trivial changes, so I need to reserve a chunk of time to work on this at some point. It will happen, but I can't promise when 😅

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants