diff --git a/java/jv-examples.js b/java/jv-examples.js index 850ec0443..a06fab2cc 100644 --- a/java/jv-examples.js +++ b/java/jv-examples.js @@ -1,21 +1,38 @@ /* @ Source: -§ 'Yogyakarta' -# 200 -~ ltr -! jv + +native | meaning | IPA① | transcription | other transcriptions② | notes | wiktionary③ | rawIPA + + +① separate each text unit (consonant+combining_mark(s)) with §, separate phonetic syllables with . + use ‹ and › for a sound that spans 2 text units + use – for a text unit that doesn't produce sound + syllable boundary markers go inside previous unit, and emphasis markers begin emphasised unit + +② the previous transcription field is always Latin. + if there is another key orthography (eg. Mongolian cyrillic) it goes here. + vowelled/unvowelled alternatives also go here + +③ x means that no link to Wiktionary should be added + if Wiktionary spells the word differently, add the spelling here */ + +otherTranscription = true // kicks in the latest order of items + + + + autoExpandExamples.jv = ` ꧇꧑꧕꧇ꦎꦏ꧀ꦠꦺꦴꦧꦼꦂ꧇꧒꧐꧑꧒꧇|15 october 2012| -ꦄꦪꦸ|Ayu|ˈa§ju*|ayu|personal name +ꦄꦪꦸ|Ayu|ˈa§ju|ayu|personal name ꦄꦫꦶꦱ꧀ꦠꦺꦴꦠꦼꦭꦼꦱ꧀|Aristotle| ꦄꦭ꧀ꦭꦃ|Allah||allah ꦆꦤ꧀ꦠꦼꦫꦏ꧀ꦱꦶ|interaction||interaksi ꦉꦩꦸꦏ꧀ꦗꦸꦁ|| ꦉꦩꦸꦗꦸꦁ|| -ꦊꦩꦃꦊꦩ꧀ꦧꦸꦠ꧀|gentle|lə§mah §lə§m§bu§t*|lemah lembut +ꦊꦩꦃꦊꦩ꧀ꦧꦸꦠ꧀|gentle|lə§mah §lə§m§bu§t|lemah lembut ꦊꦩ꧀ꦧꦸ|| ꦊꦭꦸꦲꦸꦂ|| ꦏꦁꦩꦱ꧀|| @@ -223,7 +240,7 @@ autoExpandExamples.jv = ` ꦩꦪꦁ|| ꦩꦭꦲꦺꦏꦠ꧀|| ꦩꦱꦾꦫꦏꦠ꧀|| -ꦩꦲꦺꦴꦱ꧀|to read (polite)|ma§o§s*|maos +ꦩꦲꦺꦴꦱ꧀|to read (polite)|ma§o§s|maos ꦩꦶꦠꦿ|| ꦩꦸꦏ|| ꦩꦸꦝ|young||mudha @@ -312,7 +329,7 @@ autoExpandExamples.jv = ` ꦲꦧꦁ|red|ˈa.baŋ/ˈɔ.bɔŋ ꦲꦩ꧀ꦧꦿꦤꦁ|turn fiery red||ambranaŋ ꦲꦪꦤ꧀|| -ꦲꦪꦸ|graceful|ˈa§ju*|ayu +ꦲꦪꦸ|graceful|ˈa§ju|ayu ꦲꦭꦱ꧀|| ꦲꦭꦺꦴꦤ꧀ꦲꦭꦺꦴꦤ꧀|| ꦲꦱꦸ|| @@ -345,6 +362,13 @@ autoExpandExamples.jv = ` +@ other +ꦩꦔꦤ꧀ꦱꦺꦴꦠꦺꦴ||ma§ŋa§n §so§to|magnan soto + + + + + @https://en.wiktionary.org/wiki/Category:Javanese_terms_with_IPA_pronunciation ꦲꦶꦩ꧀ꦧ|brow|i§m.§ba|imba|imba||imba ꦲꦭꦶꦱ꧀|eyebrow|a.§li§s|alis|alis||alis diff --git a/java/jv.html b/java/jv.html index 990e870f9..0967ee56c 100755 --- a/java/jv.html +++ b/java/jv.html @@ -173,13 +173,13 @@
Syllable-initial clusters use 3 dedicated combining marks for the second consonant. ❯ onsets
-Word-final consonant sounds may be represented by 4 dedicated combining marks. Otherwise, if nothing follows, they are ordinary consonants followed by a visible ꧀ [U+A9C0 JAVANESE PANGKON]. ❯ finals
+Word-final consonant sounds may be represented by 4 dedicated combining marks. Otherwise, if nothing follows, they are ordinary consonants followed by a visible A9C0. ❯ finals
The Javanese orthography is an abugida with 2 inherent vowels. Other post-consonant vowels are written using 5 vowel signs, all combining marks. It is mandatory for 2 of the consonant+vowel sequences to be written using vocalics, rather than vowel signs. ❯ vowels
There is 1 pre-base glyph and no circumgraphs. Only one multipart vowel is used in modern Javanese orthography, although more exist when writing Sanskrit or Kawi in this script. The multipart vowel used in modern Javanese involves only 2 vowel signs, and places glyphs on either side of the base consonant(s). ❯ compositeV
-Standalone vowel sounds are written using independent vowels when there is a need to distinguish normal words from proper nouns or foreign words. The normal approach for standalone vowels is to apply vowel signs to ꦲ [U+A9B2 JAVANESE LETTER HA]. ❯ standalone
+Standalone vowel sounds are written using independent vowels when there is a need to distinguish normal words from proper nouns or foreign words. The normal approach for standalone vowels is to apply vowel signs to A9B2. ❯ standalone
Javanese has vocalics. ❯ vocalics
@@ -589,12 +589,12 @@Syllable-final consonants are usually a single sound.
@@ -713,8 +713,8 @@Additional information about archaic forms and variants of these characters and others mentioned below can be found in the character notes document.
-ꦴ [U+A9B4 JAVANESE VOWEL SIGN TARUNG] was originally a length mark, but in modern Javanese it is only used in the combination described in the section compositeV.
-Modern Javanese doesn't write the sounds rə and lə using the consonant+vowel combinations *ꦫꦼ [U+A9AB JAVANESE LETTER RA + U+A9BC JAVANESE VOWEL SIGN PEPET] or *ꦭꦼ [U+A9AD JAVANESE LETTER LA + U+A9BC JAVANESE VOWEL SIGN PEPET]. Vocalic letters are used instead,ws,#Swara eg. +
A9B4 was originally a length mark, but in modern Javanese it is only used in the combination described in the section compositeV.
+Modern Javanese doesn't write the sounds rə and lə using the consonant+vowel combinations *ꦫꦼ U+A9AB JAVANESE LETTER RA + U+A9BC JAVANESE VOWEL SIGN PEPET or *ꦭꦼ U+A9AD JAVANESE LETTER LA + U+A9BC JAVANESE VOWEL SIGN PEPET. Vocalic letters are used instead,ws,#Swara eg. ꦊꦩꦃꦊꦩ꧀ꦧꦸꦠ꧀
When a vowel sign follows a subjoined consonant it appears above the stack.
@@ -750,8 +750,7 @@In Javanese, unlike many other scripts, including Balinese, when a vowel is represented by multiple glyphs either side of a base character two separate combining marks need to be added to the base. Javanese has no circumgraphs (ie. a single code point that places glyphs around the base).
-Only one such multipart vowel is used for modern Javanese, ꦺꦴ -[U+A9BA JAVANESE VOWEL SIGN TALING + U+A9B4 JAVANESE VOWEL SIGN TARUNG], which represents the sound o. Both characters are typed and stored after the base character, and should be in the order shown.
+Only one such multipart vowel is used for modern Javanese, A9BA A9B4, which represents the sound o. Both characters are typed and stored after the base character, and should be in the order shown.
A two-part vowel associated with a consonant cluster appears before and after the whole cluster, whether it is conjoined or stacked. This is a reminder that vowel signs are applied to the orthographic syllable, rather than to a single letter. In fig_vs_conjoined this actually extends across a word boundary, since the last and first letters of the adjoining words form a conjunct cluster. This means that the pre-base part of the vowel sign appears to be within the previous word.
+A two-part vowel associated with a consonant cluster appears before and after the whole cluster, whether it is conjoined or stacked. This is a reminder that vowel signs are applied to the orthographic syllable, rather than to a single letter. In fig_composite_word_break this actually extends across a word boundary, since the last and first letters of the adjoining words form a conjunct cluster. This means that the pre-base part of the vowel sign appears to be within the previous word.
+The left-hand column contains basic Javanese consonant letters and consonants using the cecak telu; the right-hand column contains murda and mahaprana letters.
General: | -+ |
---|
If the final word in a phrase ends with ◌꧀ [U+A9C0 JAVANESE PANGKON], a space alone is sufficient to indicate a phrase boundary. Otherwise, ꧈ [U+A9C8 JAVANESE PADA LINGSA] is used.
-꧇ [U+A9C7 JAVANESE PADA PANGKAT] is equivalent to a colon.
-The sequence ꧀꧈ [U+A9C0 JAVANESE PANGKON + U+A9C8 JAVANESE PADA LINGSA] indicates a sentence boundary. If there is no pangkon, ꧉ [U+A9C9 JAVANESE PADA LUNGSI] is used.
+If the final word in a phrase ends with A9C0, a space alone is sufficient to indicate a phrase boundary. Otherwise, A9C8 is used.
+A9C7 is equivalent to a colon.
+ +The sequence A9C0 A9C8 indicates a sentence boundary. If there is no pangkon, A9C9 is used.
All of the above punctuation marks are optionally followed by a space, if they occur inside a paragraph.
-A paragraph and or a section typically begins with ꧋ [U+A9CB JAVANESE PADA ADEG ADEG]. This punctutation is also used before other short runs of text, such as subtitles, list items, etc.
-꧊ [U+A9CA JAVANESE PADA ADEG] is a general divider. See also quotations.
+A paragraph and or a section typically begins with A9CB. This punctutation is also used before other short runs of text, such as subtitles, list items, etc.
+A9CA is a general divider. See also quotations.
Letters may begin with ꧋꧆꧋ if the writer doesn't want to indicate a distinction regarding age or rank between themselves and the reader. Otherwise, for more formal letters, they can choose one of three alternatives provided as single characters in the Javanese Unicode block.
The end of a letter can be signaled using ꧉꧆꧉ This combination may also involve just ꧆꧉or may be repeated with spaces between to fill the linee, eg.
@@ -2830,7 +2831,7 @@
In poetry ꧅ꦧ꧀ꦖ꧅ or ꧅ꦧ꧀ꦕ꧅ (purwapada) introduces a poem; ꧅ꦟ꧀ꦢꦿ꧅ (madyapada) introduces a new song within a poem; and ꧅ꦆ꧅ (wasanapada) indicates the end of a poem.
-Optionally, ꧉ [U+A9C9 JAVANESE PADA LUNGSI] can be added to the above with some space around it. The spaces should be non-breaking, since there should be no line-breaks between the constituent partse, eg.
+
Optionally, A9C9 can be added to the above with some space around it. The spaces should be non-breaking, since there should be no line-breaks between the constituent partse, eg.
꧅ ꧉ ꦧ꧀ꦖ ꧉ ꧅
A9CA
A9CA
A9CC
A9CD
Typically a pair of ꧊ [U+A9CA JAVANESE PADA ADEG] characters are used.
-Alternatively, the pair of characters ꧌ [U+A9CC JAVANESE PADA PISELEH] and ꧍ [U+A9CD JAVANESE TURNED PADA PISELEH] may be used.
+Typically a pair of A9CA characters are used.
+Alternatively, the pair of characters A9CC and A9CD may be used.
@@ -2887,7 +2888,7 @@As for parentheses, Javanese text may use ꧊ [U+A9CA JAVANESE PADA ADEG] for quotation marks.
+As for parentheses, Javanese text may use A9CA for quotation marks.
top level | -- | + | A9CA |
+A9CA |
---|---|---|---|---|
alternative | -- | + | A9CC |
+A9CD |
Alternatively, the pair of characters ꧌ [U+A9CC JAVANESE PADA PISELEH] and ꧍ [U+A9CD JAVANESE TURNED PADA PISELEH] may be used.
+Alternatively, the pair of characters A9CC and A9CD may be used.
@@ -2922,8 +2923,8 @@To draw attention to text Javanese may use the same characters as are used for parentheses and quotations, ie. a paired set of ꧊ [U+A9CA JAVANESE PADA ADEG] characters around the relevant text, or the two characters ꧌ [U+A9CC JAVANESE PADA PISELEH] and ꧍ [U+A9CD JAVANESE TURNED PADA PISELEH] can be used similarly.
-Sometimes just ꧌ [U+A9CC JAVANESE PADA PISELEH] is repeated.
+To draw attention to text Javanese may use the same characters as are used for parentheses and quotations, ie. a paired set of A9CA characters around the relevant text, or the two characters A9CC and A9CD can be used similarly.
+Sometimes just A9CC is repeated.
@@ -2939,7 +2940,7 @@According to Everson ꧈ [U+A9C8 JAVANESE PADA LINGSA] is used for acronyms, eg. ꧈ꦢꦺ꧈ꦲ꧈ꦌꦭ꧀꧈
+According to Everson A9C8 is used for acronyms, eg. ꧈ꦢꦺ꧈ꦲ꧈ꦌꦭ꧀꧈
It is also used after initials in a name,e eg. ꦫ꧈ꦩ꧈ꦯꦸꦭꦂꦠ
A repeated syllable can be represented by ꧏ [U+A9CF JAVANESE PANGRANGKEP], which is derived from the arabic-indic digit for 2, eg. ꦧꦸꦏꦸꧏ buku-buku books It can be transcribed as buku2.
+A repeated syllable can be represented by A9CF, which is derived from the arabic-indic digit for 2, eg. ꦧꦸꦏꦸꧏ buku-buku books It can be transcribed as buku2.
-For 'ditto' marks in vertical lists, Javanese uses ꧉ [U+A9C9 JAVANESE PADA LUNGSI] .
+For 'ditto' marks in vertical lists, Javanese uses A9C9 .
According to Wikipedia ꧟ [U+A9DF JAVANESE PADA ISEN-ISEN] is used in handwriting to indicate a correction in Yogyakarta, eg. where a scribe wanted to write pada luhur but actually wrote pada wu.. they would use this mark as follows: ꦥꦢꦮꦸ꧟꧟꧟ꦭꦸꦲꦸꦂ
-In Yogyakarta they would use the character ꧞ [U+A9DE JAVANESE PADA TIRTA TUMETES] instead.
+According to Wikipedia A9DF is used in handwriting to indicate a correction in Yogyakarta, eg. where a scribe wanted to write pada luhur but actually wrote pada wu.. they would use this mark as follows: ꦥꦢꦮꦸ꧟꧟꧟ꦭꦸꦲꦸꦂ
+In Yogyakarta they would use the character A9DE instead.
Because there are no spaces between words, and because the end of one word and the beginning of another often form conjuncts (see fig_pangandika), Javanese doesn't wrap at word boundaries. Instead, it wraps at syllable boundaries where no conjuncts are involved.
-Unfortunately, modern browsers are often unable to detect appropriate break points for Javanese. In the sample text at the beginning of this page U+200B ZERO WIDTH SPACE] is used at places where the line could be broken. Otherwise, the line would continue, unbroken off the right side of the page. [
+Unfortunately, modern browsers are often unable to detect appropriate break points for Javanese. In the sample text at the beginning of this page 200B is used at places where the line could be broken. Otherwise, the line would continue, unbroken off the right side of the page.
Hyphenation, per se, is not used. See an interesting discussion about Javanese & Balinese line-breaking on GitHub.
@@ -3019,7 +3020,7 @@In some materials, when a new line begins with ꦺ [U+A9BA JAVANESE VOWEL SIGN TALING], an additional spacing taling is placed at the end of the previous line.
+In some materials, when a new line begins with A9BA, an additional spacing taling is placed at the end of the previous line.