Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CLDR-16034 Check for pairing bidi markup chars (illegal); remove 2 (unpaired) RLO in ff_Adlm #3606

Conversation

pedberg-icu
Copy link
Contributor

@pedberg-icu pedberg-icu commented Apr 3, 2024

CLDR-16034

  • This PR completes the ticket.

Add test to check for bidi markup chars that are intended to be used in pairs; these are illegal in CLDR values )only LRM, RLM, ALM are ok). Then run the test to find any such issues; remove two (unpaired) RLO found in ff_Adlm.xml

ALLOW_MANY_COMMITS=true

@pedberg-icu pedberg-icu requested review from srl295 and macchiati April 3, 2024 02:53
@pedberg-icu pedberg-icu self-assigned this Apr 3, 2024
@pedberg-icu pedberg-icu merged commit 9de9528 into unicode-org:main Apr 3, 2024
10 checks passed
@pedberg-icu pedberg-icu deleted the CLDR-16034-error-for-invalid-bidi-markup branch April 3, 2024 05:23
@fitzsim
Copy link

fitzsim commented Apr 3, 2024

Thank you for fixing these. An automated testing tool I am using (rpminspect) was flagging these characters in OpenJDK's import of ff_Adlm.xml, and I was just about to ask a Unicode CLDR maintainer about them.

While investigating this, I found two more RLO characters in common/annotationsDerived/ff_Adlm.xml, lines 93 and 94, which I suspect are also invalid:

<annotation cp="🇧🇼">𞤼𞤵𞤼𞤢𞤱𞤢𞤤 | 𞤼𞤵𞤼𞤢𞤱𞤢𞤤: ‮𞤄𞤮𞤼𞤧𞤵𞤱𞤢𞥄𞤲𞤢</annotation>

<annotation cp="🇧🇼" type="tts">𞤼𞤵𞤼𞤢𞤱𞤢𞤤: ‮𞤄𞤮𞤼𞤧𞤵𞤱𞤢𞥄𞤲𞤢</annotation>

@pedberg-icu
Copy link
Contributor Author

While investigating this, I found two more RLO characters in common/annotationsDerived/ff_Adlm.xml, lines 93 and 94, which I suspect are also invalid:

Those are actually generated from the display name for region "BW" which was fixed in this PR. So the next time the derived annotations are re-generated the problems in common/annotationsDerived/ff_Adlm.xml should go away.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants