Support slicing likely subtags and adding extended likely subtags #2903
Labels
C-data-infra
Component: provider, datagen, fallback, adapters
C-locale
Component: Locale identifiers, BCP47
S-medium
Size: Less than a week (larger bug fix or enhancement)
T-core
Type: Required functionality
Milestone
CLDR is planning to expand the number of entries in likely subtags data from the current ~1000 to ~7000. This isn't sustainable with our current use of likely subtags for locale fallback.
I suggest we take an approach similar to the one we did with Japanese eras: the default key is defined to contain only the essentials, and an additional "extended" key contains the full set. The choice of which one you want is done at the constructor level.
Based on a discussion with @macchiati, CLDR can add some spec text to inform ICU4X of how to correctly slice the likely subtags data.
I would also like to use this opportunity to possibly consolidate the two copies of likely subtags: the one for fallback only, and the one for the LocaleExpander.
CC @zbraniecki @dminor who have worked on this component.
The text was updated successfully, but these errors were encountered: