diff --git a/java/jv-examples.js b/java/jv-examples.js index 850ec0443..a06fab2cc 100644 --- a/java/jv-examples.js +++ b/java/jv-examples.js @@ -1,21 +1,38 @@ /* @ Source: -§ 'Yogyakarta' -# 200 -~ ltr -! jv + +native | meaning | IPA① | transcription | other transcriptions② | notes | wiktionary③ | rawIPA + + +① separate each text unit (consonant+combining_mark(s)) with §, separate phonetic syllables with . + use ‹ and › for a sound that spans 2 text units + use – for a text unit that doesn't produce sound + syllable boundary markers go inside previous unit, and emphasis markers begin emphasised unit + +② the previous transcription field is always Latin. + if there is another key orthography (eg. Mongolian cyrillic) it goes here. + vowelled/unvowelled alternatives also go here + +③ x means that no link to Wiktionary should be added + if Wiktionary spells the word differently, add the spelling here */ + +otherTranscription = true // kicks in the latest order of items + + + + autoExpandExamples.jv = ` ꧇꧑꧕꧇ꦎꦏ꧀ꦠꦺꦴꦧꦼꦂ꧇꧒꧐꧑꧒꧇|15 october 2012| -ꦄꦪꦸ|Ayu|ˈa§ju*|ayu|personal name +ꦄꦪꦸ|Ayu|ˈa§ju|ayu|personal name ꦄꦫꦶꦱ꧀ꦠꦺꦴꦠꦼꦭꦼꦱ꧀|Aristotle| ꦄꦭ꧀ꦭꦃ|Allah||allah ꦆꦤ꧀ꦠꦼꦫꦏ꧀ꦱꦶ|interaction||interaksi ꦉꦩꦸꦏ꧀ꦗꦸꦁ|| ꦉꦩꦸꦗꦸꦁ|| -ꦊꦩꦃꦊꦩ꧀ꦧꦸꦠ꧀|gentle|lə§mah §lə§m§bu§t*|lemah lembut +ꦊꦩꦃꦊꦩ꧀ꦧꦸꦠ꧀|gentle|lə§mah §lə§m§bu§t|lemah lembut ꦊꦩ꧀ꦧꦸ|| ꦊꦭꦸꦲꦸꦂ|| ꦏꦁꦩꦱ꧀|| @@ -223,7 +240,7 @@ autoExpandExamples.jv = ` ꦩꦪꦁ|| ꦩꦭꦲꦺꦏꦠ꧀|| ꦩꦱꦾꦫꦏꦠ꧀|| -ꦩꦲꦺꦴꦱ꧀|to read (polite)|ma§o§s*|maos +ꦩꦲꦺꦴꦱ꧀|to read (polite)|ma§o§s|maos ꦩꦶꦠꦿ|| ꦩꦸꦏ|| ꦩꦸꦝ|young||mudha @@ -312,7 +329,7 @@ autoExpandExamples.jv = ` ꦲꦧꦁ|red|ˈa.baŋ/ˈɔ.bɔŋ ꦲꦩ꧀ꦧꦿꦤꦁ|turn fiery red||ambranaŋ ꦲꦪꦤ꧀|| -ꦲꦪꦸ|graceful|ˈa§ju*|ayu +ꦲꦪꦸ|graceful|ˈa§ju|ayu ꦲꦭꦱ꧀|| ꦲꦭꦺꦴꦤ꧀ꦲꦭꦺꦴꦤ꧀|| ꦲꦱꦸ|| @@ -345,6 +362,13 @@ autoExpandExamples.jv = ` +@ other +ꦩꦔꦤ꧀ꦱꦺꦴꦠꦺꦴ||ma§ŋa§n §so§to|magnan soto + + + + + @https://en.wiktionary.org/wiki/Category:Javanese_terms_with_IPA_pronunciation ꦲꦶꦩ꧀ꦧ|brow|i§m.§ba|imba|imba||imba ꦲꦭꦶꦱ꧀|eyebrow|a.§li§s|alis|alis||alis diff --git a/java/jv.html b/java/jv.html index 990e870f9..0967ee56c 100755 --- a/java/jv.html +++ b/java/jv.html @@ -173,13 +173,13 @@

Basic features

Syllable-initial clusters use 3 dedicated combining marks for the second consonant. ❯ onsets

-

Word-final consonant sounds may be represented by 4 dedicated combining marks. Otherwise, if nothing follows, they are ordinary consonants followed by a visible [U+A9C0 JAVANESE PANGKON]. ❯ finals

+

Word-final consonant sounds may be represented by 4 dedicated combining marks. Otherwise, if nothing follows, they are ordinary consonants followed by a visible A9C0. ❯ finals

The Javanese orthography is an abugida with 2 inherent vowels. Other post-consonant vowels are written using 5 vowel signs, all combining marks. It is mandatory for 2 of the consonant+vowel sequences to be written using vocalics, rather than vowel signs. ❯ vowels

There is 1 pre-base glyph and no circumgraphs. Only one multipart vowel is used in modern Javanese orthography, although more exist when writing Sanskrit or Kawi in this script. The multipart vowel used in modern Javanese involves only 2 vowel signs, and places glyphs on either side of the base consonant(s). ❯ compositeV

-

Standalone vowel sounds are written using independent vowels when there is a need to distinguish normal words from proper nouns or foreign words. The normal approach for standalone vowels is to apply vowel signs to [U+A9B2 JAVANESE LETTER HA]. ❯ standalone

+

Standalone vowel sounds are written using independent vowels when there is a need to distinguish normal words from proper nouns or foreign words. The normal approach for standalone vowels is to apply vowel signs to A9B2. ❯ standalone

Javanese has vocalics. ❯ vocalics

@@ -589,12 +589,12 @@

Structure

Syllable-final consonants are usually a single sound.

@@ -713,8 +713,8 @@

Combining marks used for vowels

ꦶ␣ꦸ␣ꦺ␣ꦼ␣ꦴ

Additional information about archaic forms and variants of these characters and others mentioned below can be found in the character notes document.

-

[U+A9B4 JAVANESE VOWEL SIGN TARUNG​] was originally a length mark, but in modern Javanese it is only used in the combination described in the section compositeV.

-

Modern Javanese doesn't write the sounds and using the consonant+vowel combinations *ꦫꦼ [U+A9AB JAVANESE LETTER RA + U+A9BC JAVANESE VOWEL SIGN PEPET] or *ꦭꦼ [U+A9AD JAVANESE LETTER LA + U+A9BC JAVANESE VOWEL SIGN PEPET]. Vocalic letters are used instead,ws,#Swara eg. +

A9B4 was originally a length mark, but in modern Javanese it is only used in the combination described in the section compositeV.

+

Modern Javanese doesn't write the sounds and using the consonant+vowel combinations *ꦫꦼ U+A9AB JAVANESE LETTER RA + U+A9BC JAVANESE VOWEL SIGN PEPET or *ꦭꦼ U+A9AD JAVANESE LETTER LA + U+A9BC JAVANESE VOWEL SIGN PEPET. Vocalic letters are used instead,ws,#Swara eg. ꦊꦩꦃꦊꦩ꧀ꦧꦸꦠ꧀

When a vowel sign follows a subjoined consonant it appears above the stack.

@@ -750,8 +750,7 @@

Multipart vowel

In Javanese, unlike many other scripts, including Balinese, when a vowel is represented by multiple glyphs either side of a base character two separate combining marks need to be added to the base. Javanese has no circumgraphs (ie. a single code point that places glyphs around the base).

-

Only one such multipart vowel is used for modern Javanese, ꦺꦴ -[U+A9BA JAVANESE VOWEL SIGN TALING + U+A9B4 JAVANESE VOWEL SIGN TARUNG], which represents the sound o. Both characters are typed and stored after the base character, and should be in the order shown.

+

Only one such multipart vowel is used for modern Javanese, A9BA A9B4, which represents the sound o. Both characters are typed and stored after the base character, and should be in the order shown.

@@ -768,7 +767,7 @@

Multipart vowel

-

A two-part vowel associated with a consonant cluster appears before and after the whole cluster, whether it is conjoined or stacked. This is a reminder that vowel signs are applied to the orthographic syllable, rather than to a single letter. In fig_vs_conjoined this actually extends across a word boundary, since the last and first letters of the adjoining words form a conjunct cluster. This means that the pre-base part of the vowel sign appears to be within the previous word.

+

A two-part vowel associated with a consonant cluster appears before and after the whole cluster, whether it is conjoined or stacked. This is a reminder that vowel signs are applied to the orthographic syllable, rather than to a single letter. In fig_composite_word_break this actually extends across a word boundary, since the last and first letters of the adjoining words form a conjunct cluster. This means that the pre-base part of the vowel sign appears to be within the previous word.

+

The left-hand column contains basic Javanese consonant letters and consonants using the cecak telu; the right-hand column contains murda and mahaprana letters.

- + @@ -1353,7 +1353,7 @@

Basic (nglegéna) consonants

-

[U+A9B2 JAVANESE LETTER HA] represents either ha or the standalone vowel a

+

A9B2 represents either ha or the standalone vowel a

@@ -1370,9 +1370,9 @@

Murda letters

ꦟ␣ꦘ
-

[U+A996 JAVANESE LETTER CA MURDA] is only attested as a subjoined form, ꧀ꦖ The non-subjoined forms shown in this paragraph and the above list are a modern-day reinvention.ws,#Wyanjana

-

[U+A9AF JAVANESE LETTER SA MURDA] is a rare letter which, when used with other characters, represents the Chinese sound se, ie. ꦯ͜ꦌ̈ Another represents the Chinese syo, ie. ꦯ꦳ꦾꦺꦴ

-

[U+A9AC JAVANESE LETTER RA AGUNG] not used in modern text, and also not widely known, was used historically by some writers to address royal figures..ws,#Wyanjana

+

A996 is only attested as a subjoined form, ꧀ꦖ The non-subjoined forms shown in this paragraph and the above list are a modern-day reinvention.ws,#Wyanjana

+

A9AF is a rare letter which, when used with other characters, represents the Chinese sound se, ie. ꦯ͜ꦌ̈ Another represents the Chinese syo, ie. ꦯ꦳ꦾꦺꦴ

+

A9AC not used in modern text, and also not widely known, was used historically by some writers to address royal figures..ws,#Wyanjana

@@ -1393,11 +1393,11 @@

Mahaprana letters

Repertoire extension

The following combinations, called aksara rékan (ꦲꦏ꧀ꦱꦫꦫꦺꦏꦤ꧀), are used to represent foreign sounds. There may be some variance around which combinations produce which sounds.ws,#R%C3%A9kan

-

Javanese uses [U+A9B3 JAVANESE SIGN CECAK TELU] with a similar consonant to represent most foriegn sounds, initially those from Arabic, but then also those from Dutch, Indonesian, and English.

+

Javanese uses A9B3 with a similar consonant to represent most foriegn sounds, initially those from Arabic, but then also those from Dutch, Indonesian, and English.

ꦐ␣ꦔ꦳␣ꦥ꦳␣ꦢ꦳␣ꦗ꦳␣ꦱ꦳␣ꦏ꦳␣ꦒ꦳␣ꦲ꦳
-

[U+A990 JAVANESE LETTER KA SASAK] is used for writing q in Sasak.

+

A990 is used for writing q in Sasak.

When consonants are subjoined there can be some ambiguity about which consonant the cecak telu applies to. For example, the following look identical:ꦏ꦳꧀ꦗ kˑ͓ʤ kza ꦏ꧀ꦗ꦳ k͓ʤˑ xja

Wikipedia has a set of Chinese sounds that are represented using some combining characters from a non-Javanese block.ws,#Additional_Aksara

@@ -1433,7 +1433,7 @@

Onset consonants

Final consonants

-

Word-final consonant sounds with no following consonant may be represented by ordinary consonant characters, followed by a visible [U+A9C0 JAVANESE PANGKON] character, eg. ꦏꦿꦸꦥꦸꦏ꧀ ꦏꦔꦼꦤ꧀

+

Word-final consonant sounds with no following consonant may be represented by ordinary consonant characters, followed by a visible A9C0 character, eg. ꦏꦿꦸꦥꦸꦏ꧀ ꦏꦔꦼꦤ꧀

If another word or consonant does follow the word-final consonant, the pangkon is still used, but becomes invisible and results in the stacking of the two consonants. (See clusters.)

However, there is also a set of dedicated combining characters (seisigeg) that dispense with the need for the pangkan.

Four syllable-final consonant sounds are represented using combining characters.

@@ -1557,7 +1557,7 @@

Conjoined consonants

mahaprana glyphs
ꦰ꧀ꦰ
-

The conjoined [U+A9B1 JAVANESE LETTER SA] is unusual in that it also adds a glyph (resembling the vowel sign [U+A9B8 JAVANESE VOWEL SIGN SUKU]) below the initial consonant. This helps distinguish it from the conjoined p. See fig_conjoined_p for an example in the word ꦲꦏ꧀ꦱꦫ.

+

The conjoined A9B1 is unusual in that it also adds a glyph (resembling the vowel sign A9B8) below the initial consonant. This helps distinguish it from the conjoined p. See fig_conjoined_p for an example in the word ꦲꦏ꧀ꦱꦫ.

-

Several of the digits are identical to letters of the alphabet, so [U+A9C7 JAVANESE PADA PANGKAT] is often used around numbers, eg. +

Several of the digits are identical to letters of the alphabet, so A9C7 is often used around numbers, eg. ꧇꧑꧕꧇ꦎꦏ꧀ꦠꦺꦴꦧꦼꦂ꧇꧒꧐꧑꧒꧇

@@ -2402,14 +2402,14 @@

Shaping

Glyph shaping is required for Javanese. One principle area is that of subjoined or postfixed consonants, which often interact typographically with the preceding consonant.

Not all fonts show the same shaping behaviours.

-

In fig_k_joins, the three syllables, each containing a k-k stack, show how the font adapts the subjoined [U+A98F JAVANESE LETTER KA] at the bottom right according to what follows it.

+

In fig_k_joins, the three syllables, each containing a k-k stack, show how the font adapts the subjoined A98F at the bottom right according to what follows it.

 ꦏ꧀ꦏꦿ  ꦏ꧀ꦏ ꦏ꧀ꦏꦾ
Adaptations of the lower right of a subjoined k.
-

The following two syllables show how the font changes the shape of ꦿ [U+A9BF JAVANESE CONSONANT SIGN CAKRA​] to match the depth of the syllable.

+

The following two syllables show how the font changes the shape of A9BF to match the depth of the syllable.

ꦏꦿ ꦏ꧀ꦏꦿ @@ -2770,18 +2770,18 @@

Phrase & section boundaries

- + - + - + @@ -2789,16 +2789,17 @@

Phrase & section boundaries

- +
General:
ꦥ␣ꦧ␣ꦠ␣ꦢ␣ꦛ␣ꦝ␣ꦏ␣ꦒ␣ꦐ␣ꦔ꦳
ꦥ␣ꦧ␣ꦠ␣ꦢ␣ꦛ␣ꦝ␣ꦏ␣ꦒ␣ꦔ꦳
ꦦ␣ꦨ␣ꦡ␣ꦑ␣ꦓ␣ ␣ꦣ␣ꦜ␣ꦞ
phrase

U+0020 SPACE

-

[U+A9C8 JAVANESE PADA LINGSA]

-

[U+A9C7 JAVANESE PADA PANGKAT]

0020

+

A9C8

+

A9C7

sentence

[U+A9C9 JAVANESE PADA LUNGSI]

-

[U+A9C8 JAVANESE PADA LINGSA] after ◌꧀ [U+A9C0 JAVANESE PANGKON​]

A9C9

+

A9C8 after A9C0

paragraph [U+A9CB JAVANESE PADA ADEG ADEG]A9CB
section
general divider [U+A9CA JAVANESE PADA ADEG] A9CA 
-

If the final word in a phrase ends with ◌꧀ [U+A9C0 JAVANESE PANGKON​], a space alone is sufficient to indicate a phrase boundary. Otherwise, [U+A9C8 JAVANESE PADA LINGSA] is used.

-

[U+A9C7 JAVANESE PADA PANGKAT] is equivalent to a colon.

-

The sequence ꧀꧈ [U+A9C0 JAVANESE PANGKON + U+A9C8 JAVANESE PADA LINGSA] indicates a sentence boundary. If there is no pangkon, [U+A9C9 JAVANESE PADA LUNGSI] is used.

+

If the final word in a phrase ends with A9C0, a space alone is sufficient to indicate a phrase boundary. Otherwise, A9C8 is used.

+

A9C7 is equivalent to a colon.

+ +

The sequence A9C0 A9C8 indicates a sentence boundary. If there is no pangkon, A9C9 is used.

All of the above punctuation marks are optionally followed by a space, if they occur inside a paragraph.

-

A paragraph and or a section typically begins with [U+A9CB JAVANESE PADA ADEG ADEG]. This punctutation is also used before other short runs of text, such as subtitles, list items, etc.

-

[U+A9CA JAVANESE PADA ADEG] is a general divider. See also quotations.

+

A paragraph and or a section typically begins with A9CB. This punctutation is also used before other short runs of text, such as subtitles, list items, etc.

+

A9CA is a general divider. See also quotations.

@@ -2815,9 +2816,9 @@

Letters

Letters may begin with ꧋꧆꧋ if the writer doesn't want to indicate a distinction regarding age or rank between themselves and the reader. Otherwise, for more formal letters, they can choose one of three alternatives provided as single characters in the Javanese Unicode block.

The end of a letter can be signaled using ꧉꧆꧉ This combination may also involve just ꧆꧉or may be repeated with spaces between to fill the linee, eg.
@@ -2830,7 +2831,7 @@

Letters

Poetry

ꦆ␣ꦕ␣ꦖ␣ꦟ␣ꦢ␣ꦧ␣ꦿ␣꧀␣꧅␣꧉

In poetry ꧅ꦧ꧀ꦖ꧅ or ꧅ꦧ꧀ꦕ꧅ (purwapada) introduces a poem; ꧅ꦟ꧀ꦢꦿ꧅ (madyapada) introduces a new song within a poem; and ꧅ꦆ꧅ (wasanapada) indicates the end of a poem.

-

Optionally, [U+A9C9 JAVANESE PADA LUNGSI] can be added to the above with some space around it. The spaces should be non-breaking, since there should be no line-breaks between the constituent partse, eg.
+

Optionally, A9C9 can be added to the above with some space around it. The spaces should be non-breaking, since there should be no line-breaks between the constituent partse, eg.
꧅ ꧉ ꦧ꧀ꦖ ꧉ ꧅

@@ -2865,18 +2866,18 @@

Bracketed text

standard -

[U+A9CA JAVANESE PADA ADEG]

-

[U+A9CA JAVANESE PADA ADEG]

+

A9CA

+

A9CA

alternative -

[U+A9CC JAVANESE PADA PISELEH]

-

[U+A9CD JAVANESE TURNED PADA PISELEH]

+

A9CC

+

A9CD

-

Typically a pair of [U+A9CA JAVANESE PADA ADEG] characters are used.

-

Alternatively, the pair of characters [U+A9CC JAVANESE PADA PISELEH] and [U+A9CD JAVANESE TURNED PADA PISELEH] may be used.

+

Typically a pair of A9CA characters are used.

+

Alternatively, the pair of characters A9CC and A9CD may be used.

@@ -2887,7 +2888,7 @@

Quotations & citations

꧊␣꧌␣꧍
-

As for parentheses, Javanese text may use [U+A9CA JAVANESE PADA ADEG] for quotation marks.

+

As for parentheses, Javanese text may use A9CA for quotation marks.

@@ -2900,17 +2901,17 @@

Quotations & citations

- - + + - - + +
top level

[U+A9CA JAVANESE PADA ADEG]

[U+A9CA JAVANESE PADA ADEG]

A9CA

A9CA

alternative

[U+A9CC JAVANESE PADA PISELEH]

[U+A9CD JAVANESE TURNED PADA PISELEH]

A9CC

A9CD

-

Alternatively, the pair of characters [U+A9CC JAVANESE PADA PISELEH] and [U+A9CD JAVANESE TURNED PADA PISELEH] may be used.

+

Alternatively, the pair of characters A9CC and A9CD may be used.

@@ -2922,8 +2923,8 @@

Emphasis

꧊␣꧌␣꧍
-

To draw attention to text Javanese may use the same characters as are used for parentheses and quotations, ie. a paired set of [U+A9CA JAVANESE PADA ADEG] characters around the relevant text, or the two characters [U+A9CC JAVANESE PADA PISELEH] and [U+A9CD JAVANESE TURNED PADA PISELEH] can be used similarly.

-

Sometimes just [U+A9CC JAVANESE PADA PISELEH] is repeated.

+

To draw attention to text Javanese may use the same characters as are used for parentheses and quotations, ie. a paired set of A9CA characters around the relevant text, or the two characters A9CC and A9CD can be used similarly.

+

Sometimes just A9CC is repeated.

@@ -2939,7 +2940,7 @@

Abbreviation, ellipsis & repetition

Abbreviation

-

According to Everson [U+A9C8 JAVANESE PADA LINGSA] is used for acronyms, eg. ꧈ꦢꦺ꧈ꦲ꧈ꦌꦭ꧀꧈

+

According to Everson A9C8 is used for acronyms, eg. ꧈ꦢꦺ꧈ꦲ꧈ꦌꦭ꧀꧈

It is also used after initials in a name,e eg. ꦫ꧈ꦩ꧈ꦯꦸꦭꦂꦠ

@@ -2948,9 +2949,9 @@

Abbreviation

Repetition

-

A repeated syllable can be represented by [U+A9CF JAVANESE PANGRANGKEP], which is derived from the arabic-indic digit for 2, eg. ꦧꦸꦏꦸꧏ buku-buku books It can be transcribed as buku2.

+

A repeated syllable can be represented by A9CF, which is derived from the arabic-indic digit for 2, eg. ꦧꦸꦏꦸꧏ buku-buku books It can be transcribed as buku2.

-

For 'ditto' marks in vertical lists, Javanese uses [U+A9C9 JAVANESE PADA LUNGSI] .

+

For 'ditto' marks in vertical lists, Javanese uses A9C9 .

@@ -2965,8 +2966,8 @@

Inline notes & annotations

Correction marks

-

According to Wikipedia [U+A9DF JAVANESE PADA ISEN-ISEN] is used in handwriting to indicate a correction in Yogyakarta, eg. where a scribe wanted to write pada luhur but actually wrote pada wu.. they would use this mark as follows: ꦥꦢꦮꦸ꧟꧟꧟ꦭꦸꦲꦸꦂ

-

In Yogyakarta they would use the character [U+A9DE JAVANESE PADA TIRTA TUMETES] instead.

+

According to Wikipedia A9DF is used in handwriting to indicate a correction in Yogyakarta, eg. where a scribe wanted to write pada luhur but actually wrote pada wu.. they would use this mark as follows: ꦥꦢꦮꦸ꧟꧟꧟ꦭꦸꦲꦸꦂ

+

In Yogyakarta they would use the character A9DE instead.

@@ -3009,7 +3010,7 @@

Line breaking & hyphenation

Because there are no spaces between words, and because the end of one word and the beginning of another often form conjuncts (see fig_pangandika), Javanese doesn't wrap at word boundaries. Instead, it wraps at syllable boundaries where no conjuncts are involved.

-

Unfortunately, modern browsers are often unable to detect appropriate break points for Javanese. In the sample text at the beginning of this page ZWSP [U+200B ZERO WIDTH SPACE] is used at places where the line could be broken. Otherwise, the line would continue, unbroken off the right side of the page.

+

Unfortunately, modern browsers are often unable to detect appropriate break points for Javanese. In the sample text at the beginning of this page 200B is used at places where the line could be broken. Otherwise, the line would continue, unbroken off the right side of the page.

Hyphenation, per se, is not used. See an interesting discussion about Javanese & Balinese line-breaking on GitHub.

@@ -3019,7 +3020,7 @@

Line breaking & hyphenation

Taling duplication

-

In some materials, when a new line begins with [U+A9BA JAVANESE VOWEL SIGN TALING], an additional spacing taling is placed at the end of the previous line.

+

In some materials, when a new line begins with A9BA, an additional spacing taling is placed at the end of the previous line.

ꦲꦸꦠꦮꦶꦲꦶꦏꦁꦏꦊꦉꦱ꧀ꦏꦺ
 ꦮꦺꦴꦤ꧀꧈