Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Treat digits as a continuous word #8

Open
GregHib opened this issue Jan 28, 2022 · 1 comment
Open

Treat digits as a continuous word #8

GregHib opened this issue Jan 28, 2022 · 1 comment

Comments

@GregHib
Copy link
Contributor

GregHib commented Jan 28, 2022

If [WordSplitterConfig.handleCase] is set to true: if a lowercase character is followed by an uppercase character, a word boundary is considered to be prior to the uppercase character.
If [WordSplitterConfig.handleCase] is set to true: if multiple uppercase characters are followed by a lowercase character, a word boundary is considered to be prior to the last uppercase character.

Prior to the last digit is an odd place to be considered a boundary. As such any digits which are followed by letters get chopped up.

Input Expected Actual
option14b option 14 b option 1 4b
123abc 123 abc 12 3abc

Numbers should be considered continuous

@mrapplexz
Copy link
Member

mrapplexz commented Mar 15, 2022

Hello! Sorry, for a late reply, but I'm currently busy at work. I will look into this issue when I'll have some time :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants