Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CLDR-17615 Update languages (v46 SBRS) #3700

Merged
merged 4 commits into from
May 15, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
8 changes: 4 additions & 4 deletions common/validity/language.xml
Original file line number Diff line number Diff line change
Expand Up @@ -12,7 +12,7 @@
<supplementalData>
<version number="$Revision$"/>
<idValidity>
<id type='language' idStatus='regular'> <!-- 7958 items -->
<id type='language' idStatus='regular'> <!-- 7961 items -->
aa aaa~i aak~l aan~q aas~x aaz
ab aba~j abl~z
aca~b acd~f ach~i ack~n acp~z
Expand Down Expand Up @@ -76,7 +76,7 @@
cia~e cih cik cim~n cip cir ciw ciy
cja cje cjh~i cjk cjm~p cjs cjv cjy
ckb ckh ckl~o ckq~v ckx~z
cla clc cle clh~m clo clt~u clw cly
cla clc cle clh~m clo cls~u clw cly
cma cmc cme cmg cmi cml~m cmo cmr~t
cna~c cng~i cnk~l cno~q cns~u cnw~x
co coa~h coj~q cot~x coz
Expand Down Expand Up @@ -427,7 +427,7 @@
rn rnb rnd rng rnl rnn rnp rnr rnw
ro roa~g rol~m roo~p ror rou row
rpn rpt
rri rro rrt
rri rrm rro rrt
rsb rsk~n rsw
rtc rth rtm rts rtw
ru rub~c rue~i ruk ruo~q rut~u ruy~z
Expand Down Expand Up @@ -522,7 +522,7 @@
vnk vnm vnp
vo vor vot
vra vro vrs~t
vsi vsl vsv
vsi vsl vsn vsv
vto
vum~n vut
vwa
Expand Down
6 changes: 3 additions & 3 deletions common/validity/variant.xml
Original file line number Diff line number Diff line change
Expand Up @@ -12,7 +12,7 @@
<supplementalData>
<version number="$Revision$"/>
<idValidity>
<id type='variant' idStatus='regular'> <!-- 110 items -->
<id type='variant' idStatus='regular'> <!-- 117 items -->
1606nict 1694acad 1901 1959acad 1994 1996
abl1943 akuapem alalc97 aluku anpezo ao1990 aranes arkaika asante auvern
baku1926 balanka barla basiceng bauddha bciav bcizbl biscayan biske blasl bohoric boont bornholm
Expand All @@ -29,10 +29,10 @@
metelko monoton
ndyuka nedis newfound nicard njiva nulik
osojs oxendict
pahawh2~4 pamaka peano petr1708 pinyin polyton provenc puter
pahawh2~4 pamaka peano pehoeji petr1708 pinyin polyton provenc puter
rigik rozaj rumgr
scotland scouse simple solba sotav spanglis surmiran sursilv sutsilv synnejyl
tarask tongyong tunumiit
tailo tarask tongyong tunumiit
uccor ucrcor ulster unifon
vaidika valbadia valencia vallader vecdruka vivaraup
wadegile
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -79,6 +79,9 @@ public class GenerateEnums {

private Set<String> languages = new TreeSet<>();

private final Set<String> ignoreMissingAlpha3 =
new TreeSet<>(Arrays.asList("EA", "EZ", "IC", "UN"));
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

More changes like this would belong in https://unicode-org.atlassian.net/browse/CLDR-17637


public static void main(String[] args) throws IOException {
GenerateEnums gen = new GenerateEnums();
gen.showLanguageInfo();
Expand Down Expand Up @@ -365,7 +368,7 @@ public void loadCLDRData() throws IOException {
enum_UN.put(enumName(region), region);
cldrCodes.add(region);
} else {
if (enum_alpha3.get(region) == null) {
if (enum_alpha3.get(region) == null && !ignoreMissingAlpha3.contains(region)) {
System.out.println("Missing alpha3 for: " + region);
}
}
Expand Down
Original file line number Diff line number Diff line change
@@ -1 +1 @@
20240207
20240415
Original file line number Diff line number Diff line change
Expand Up @@ -1553,7 +1553,7 @@ dgk I L Dagba
dgl I L Andaandi
dgn I E Dagoman
dgo I L Dogri (individual language)
dgr dgr dgr I L Dogrib
dgr dgr dgr I L Tlicho
dgs I L Dogoso
dgt I E Ndra'ngith
dgw I E Daungwurrung
Expand Down Expand Up @@ -2662,6 +2662,7 @@ iso I L Isoko
isr I L Israeli Sign Language
ist I L Istriot
isu I L Isu (Menchum Division)
isv I C Interslavic
ita ita ita it I L Italian
itb I L Binongan Itneg
itd I L Southern Tidung
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -1616,7 +1616,7 @@ dgl Dongolawi Dongolawi
dgn Dagoman Dagoman
dgo Dogri (individual language) Dogri (individual language)
dgr Dogrib Dogrib
dgr Tłı̨chǫ Tłı̨chǫ
dgr Tlicho Tlicho
dgs Dogoso Dogoso
dgt Ndra'ngith Ndra'ngith
dgw Daungwurrung Daungwurrung
Expand Down Expand Up @@ -2800,6 +2800,7 @@ iso Isoko Isoko
isr Israeli Sign Language Israeli Sign Language
ist Istriot Istriot
isu Isu (Menchum Division) Isu (Menchum Division)
isv Interslavic Interslavic
ita Italian Italian
itb Binongan Itneg Itneg, Binongan
itd Southern Tidung Tidung, Southern
Expand Down
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
File-Date: 2023-10-16
File-Date: 2024-03-07
%%
Type: language
Subtag: aa
Expand Down Expand Up @@ -882,6 +882,7 @@ Type: language
Subtag: sa
Description: Sanskrit
Added: 2005-10-16
Scope: macrolanguage
%%
Type: language
Subtag: sc
Expand Down Expand Up @@ -8028,6 +8029,12 @@ Description: Lowland Oaxaca Chontal
Added: 2009-07-29
%%
Type: language
Subtag: cls
Description: Classical Sanskrit
Added: 2024-03-04
Macrolanguage: sa
%%
Type: language
Subtag: clt
Description: Lautu Chin
Added: 2012-08-12
Expand Down Expand Up @@ -30916,6 +30923,11 @@ Description: Ririo
Added: 2009-07-29
%%
Type: language
Subtag: rrm
Description: Moriori
Added: 2024-03-04
%%
Type: language
Subtag: rro
Description: Waima
Added: 2009-07-29
Expand Down Expand Up @@ -37660,6 +37672,12 @@ Description: Venezuelan Sign Language
Added: 2009-07-29
%%
Type: language
Subtag: vsn
Description: Vedic Sanskrit
Added: 2024-03-04
Macrolanguage: sa
%%
Type: language
Subtag: vsv
Description: Valencian Sign Language
Description: Llengua de signes valenciana
Expand Down Expand Up @@ -47559,6 +47577,13 @@ Comments: Aluku dialect of the "Busi Nenge Tongo" English-based Creole
continuum in Eastern Suriname and Western French Guiana
%%
Type: variant
Subtag: anpezo
Description: Anpezo standard of Ladin
Added: 2024-03-04
Prefix: lld
Comments: Represents the standard written form of Ladin in Anpezo
%%
Type: variant
Subtag: ao1990
Description: Portuguese Language Orthographic Agreement of 1990 (Acordo
Ortográfico da Língua Portuguesa de 1990)
Expand Down Expand Up @@ -47779,6 +47804,22 @@ Added: 2012-02-05
Prefix: en
%%
Type: variant
Subtag: fascia
Description: Fascia standard of Ladin
Added: 2024-03-04
Prefix: lld
Comments: Represents the standard written form of Ladin in Fascia which
unified the three subvarieties Cazet, Brach and Moenat
%%
Comment on lines +47807 to +47812
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@DavidLRowe FYI. this may be a factor in your other PR

Type: variant
Subtag: fodom
Description: Fodom standard of Ladin
Added: 2024-03-04
Prefix: lld
Comments: Represents the standard written form of Ladin in Livinallongo
and Colle Santa Lucia
%%
Type: variant
Subtag: fonipa
Description: International Phonetic Alphabet
Added: 2006-12-11
Expand Down Expand Up @@ -47819,6 +47860,13 @@ Prefix: oc
Comments: Occitan variant spoken in Gascony
%%
Type: variant
Subtag: gherd
Description: Gherdëina standard of Ladin
Added: 2024-03-04
Prefix: lld
Comments: Represents the standard written form of Ladin in Gherdëina
%%
Type: variant
Subtag: grclass
Description: Classical Occitan orthography
Added: 2018-04-22
Expand Down Expand Up @@ -48120,6 +48168,15 @@ Comments: Peano’s Interlingua, created in 1903 by Giuseppe Peano as an
Added: 2020-03-12
%%
Type: variant
Subtag: pehoeji
Description: Hokkien Vernacular Romanization System
Description: Pe̍h-ōe-jī orthography/romanization
Added: 2024-03-04
Prefix: nan-Latn
Comments: Modern Hokkien Vernacular Romanization System, evolved from
the New Dictionary in the Amoy by John Van Nest Talmage in 1894
%%
Type: variant
Subtag: petr1708
Description: Petrine orthography
Added: 2010-10-10
Expand Down Expand Up @@ -48254,6 +48311,16 @@ Added: 2021-07-17
Prefix: da
%%
Type: variant
Subtag: tailo
Description: Taiwanese Hokkien Romanization System for Hokkien
languages
Description: Tâi-lô orthography/romanization
Added: 2024-03-04
Prefix: nan-Latn
Comments: Taiwanese Hokkien Romanization System (Tâi-lô) published in
2006 by the Taiwan Ministry of Education
%%
Type: variant
Subtag: tarask
Description: Belarusian in Taraskievica orthography
Added: 2007-04-27
Expand Down Expand Up @@ -48317,6 +48384,15 @@ Comments: The most ancient dialect of Sanskrit used in verse and prose
composed until about the 4th century B.C.E.
%%
Type: variant
Subtag: valbadia
Description: Val Badia standard of Ladin
Added: 2024-03-04
Prefix: lld
Comments: Represents the standard written form of Ladin in the Val
Badia, unifying the three variants Marô, Mesaval and Badiot spoken
in this valley
%%
Type: variant
Subtag: valencia
Description: Valencian
Added: 2007-03-06
Expand Down
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
# Version 2024022700, Last Updated Tue Feb 27 07:07:02 2024 UTC
# Version 2024050800, Last Updated Wed May 8 07:07:02 2024 UTC
AAA
AARP
ABB
Expand Down Expand Up @@ -85,7 +85,6 @@ AUSPOST
AUTHOR
AUTO
AUTOS
AVIANCA
AW
AWS
AX
Expand Down Expand Up @@ -510,7 +509,6 @@ GROUP
GS
GT
GU
GUARDIAN
GUCCI
GUGE
GUIDE
Expand Down
Loading