-
-
Notifications
You must be signed in to change notification settings - Fork 122
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[\u{0}-\u{10FFFF}] matches any byte instead of any unicode character #202
Comments
Additionally, adding |
This is interesting. I've never tried to use \u in my regular expressions, but then again my project is an assembler for a 16-bit processor so ASCII is more likely to be used than Unicode. Would like to know if I could use this to check for every extended ASCII character (e.g. \x00 through \xFF instead of just \x00 through \x7F), by doing say \u{0} through \u{FF}. |
Just wanted to gives this a bump as this is still an issue |
Just had a potentially dumb thought, but would changing the regex to |
Add a workaround for maciejhirsz/logos#202. See #857.
Example:
The second assert fails:
The
derive(Logos)
is expanded as:It looks like
[\u{0}-\u{10FFFF}]
matches any raw byte instead of any unicode characterThe text was updated successfully, but these errors were encountered: