zˤ consonant. Sometimes used for writing Berber sounds.lpz
`,
@@ -6162,6 +6178,8 @@
'\u{2018}': `
‘
+
+
Closing quotation mark. Unlike parentheses, the glyph of this character is not mirrored in right-to-left text.
`,
@@ -6171,6 +6189,8 @@
'\u{2019}': `
’
+
+
Opening quotation mark. Unlike parentheses, the glyph of this character is not mirrored in right-to-left text.
`,
@@ -6180,6 +6200,8 @@
'\u{201C}': `
“
+
+
Closing quotation mark. Unlike parentheses, the glyph of this character is not mirrored in right-to-left text.
`,
@@ -6189,6 +6211,8 @@
'\u{201D}': `
”
+
+
Opening quotation mark. Unlike parentheses, the glyph of this character is not mirrored in right-to-left text.
`,
@@ -6198,12 +6222,393 @@
'\u{2014}': `
—
+
+
Em dash.
+`,
+
+
+
+'\u{0021}': `
+
!
+
+
Exclamation mark.
+`,
+
+
+
+
+
+'\u{0025}': `
+
%
+
+
Percentage mark.
+`,
+
+
+
+
+
+'\u{0028}': `
+
(
+
+
Opeining parenthesis.
+
+
The words 'left' and 'right' in the Unicode names for parentheses, brackets, and other paired characters should be ignored. LEFT should be read as if it said START, and RIGHT as END. The direction in which the glyphs point will be automatically determined according to the base direction of the text.
+`,
+
+
+
+
+
+'\u{0029}': `
+
)
+
+
Closing parenthesis.
+
+
The words 'left' and 'right' in the Unicode names for parentheses, brackets, and other paired characters should be ignored. LEFT should be read as if it said START, and RIGHT as END. The direction in which the glyphs point will be automatically determined according to the base direction of the text.
+`,
+
+
+
+
+
+'\u{002E}': `
+
.
+
+
Full stop.
+`,
+
+
+
+
+
+'\u{0030}': `
+
0
+
+
Digit.
+`,
+
+
+
+
+
+'\u{0031}': `
+
1
+
+
Digit.
+`,
+
+
+
+
+
+'\u{0032}': `
+
2
+
+
Digit.
+`,
+
+
+
+
+
+'\u{0033}': `
+
3
+
+
Digit.
+`,
+
+
+
+
+
+'\u{0034}': `
+
4
+
+
Digit.
+`,
+
+
+
+
+
+'\u{0035}': `
+
5
+
+
Digit.
+`,
+
+
+
+
+
+'\u{0036}': `
+
6
+
+
Digit.
+`,
+
+
+
+
+
+'\u{0037}': `
+
7
+
+
Digit.
+`,
+
+
+
+
+
+'\u{0038}': `
+
8
+
+
Digit.
+`,
+
+
+
+
+
+'\u{0039}': `
+
9
+
+
Digit.
`,
+'\u{003A}': `
+
:
+
+
Colon.
+`,
+
+
+
+
+
+'\u{00AB}': `
+
«
+
+
Opening quotation mark.
+
+
The words 'left' and 'right' in the Unicode names for parentheses, brackets, and other paired characters should be ignored. LEFT should be read as if it said START, and RIGHT as END. The direction in which the glyphs point will be automatically determined according to the base direction of the text.
+`,
+
+
+
+
+
+'\u{00BB}': `
+
»
+
+
Closing quotation mark.
+
+
The words 'left' and 'right' in the Unicode names for parentheses, brackets, and other paired characters should be ignored. LEFT should be read as if it said START, and RIGHT as END. The direction in which the glyphs point will be automatically determined according to the base direction of the text.
+`,
+
+
+
+
+
+'\u{034F}': `
+
͏
+
+
Combining grapheme joiner.
+
+
Used to produce special ordering of diacritics. The name is a misnomer, as it is generally used to break the normal sequence of diacritics.
An invisible character, that prevents two adjacent letters forming a visual connection with each other when rendered. Especially useful for educational illustrations, but also has real-world applications.
+
+
It is used to interrupt the shaping of joining glyphs in cursive scripts, and also used to manage the visual interactions of glyphs in other scripts, eg. to prevent the formation of conjuncts, position diacritics, etc.
An invisible character, that permits a letter to form a cursive connection without a visible neighbour. Especially useful for educational illustrations, but also has some real-world applications.
+
+
Also used with complex scripts to manage the visual representation of glyphs that normally interact, eg. to form conjuncts, position diacritics, etc.
An invisible character with strong LTR directional properties that can be used to produce the correct ordering of text, especially where there is a risk of spillover effects while the Unicode Bidirectional Algorithm is at work.
+
+
Generally referred to as LRM.
+`,
+
+
+
+
+// RLM
+'\u{200F}': `
+
An invisible character with strong RTL directional properties that can be used to produce the correct ordering of text, especially where there is a risk of spillover effects while the Unicode Bidirectional Algorithm is at work.
+
+
Generally referred to as RLM.
+`,
+
+
+
+
+// LRE
+'\u{202A}': `
+
Sets the start point for a range of inline text when applying a base direction of left-to-right. The range is terminated by 202C (PDF).
+
+
Use 2066 (LRI) rather than this character.
+`,
+
+// RLE
+'\u{202B}': `
+
Sets the start point for a range of inline text when applying a base direction of right-to-left. The range is terminated by 202C (PDF).
+
+
Use 2067 (RLI) rather than this character.
+`,
+
+// PDF
+'\u{202C}': `
+
Sets the end point for a range of inline text when applying a base direction. The range is started with either 202A (LRE) or 202B (RLE).
+
+
Use 2069 (PDI) and its associated range starters rather than this character.
+`,
+
+
+
+
+
+// LRI
+'\u{2066}': `
+
Sets the start point for a range of inline text when applying a base direction of left-to-right, and isolates the text within that range from text outside it. The isolation prevents unintended spill-over effects when the text is reordered by the Unicode Bidirectional Algorithm. The range is terminated by 2069 (PDI).
+
+
This character should be used rather than 202A (LRE).
+`,
+
+
+// RLI
+'\u{2067}': `
+
Sets the start point for a range of inline text when applying a base direction of right-to-left, and isolates the text within that range from text outside it. The isolation prevents unintended spill-over effects when the text is reordered by the Unicode Bidirectional Algorithm. The range is terminated by 2069 (PDI).
+
+
This character should be used rather than 202B (RLE).
+`,
+
+// FSI
+'\u{2068}': `
+
Sets the start point for a range of inline text when applying a base direction, and isolates the text within that range from text outside it. The base direction set is determined by that of the first strong directional character in the range. The isolation prevents unintended spill-over effects when the text is reordered by the Unicode Bidirectional Algorithm. The range is terminated by 2069 (PDI).
+`,
+
+// PDI
+'\u{2069}': `
+
Sets the end point for a range of inline text when applying a base direction. The range is started with either 2066 (LRI), 2066 (RLI) or 2068 (FSI).
+
+
This character should be used rather than 202C (PDF).
Left-joining glyphs are commonly called initial; dual-joining are called medial; and right-joining are called final. Glyphs that don't join on either side are called isolated. However, these glyph shapes can be found in various places within a single word.
-
Word-initial characters usually have initial glyph shapes (eg. 064A ). However, characters that only join to the right will use an isolated glyph shape (eg. 062F ).
+
Word-initial characters usually have initial glyph shapes (eg. 064A ). However, characters that only join to the right will use an isolated glyph shape (eg. 062F ).
Furthermore, words beginning with a vowel are always preceded by a vowel carrier, which is normally ا
-(eg. 0627 06CC or 0627 064E ).
+(eg. 0627 06CC or 0627 064E ).
Word-medial characters will typically join on both sides
-(eg. 064A ) but those that only join to the right will use a final glyph (eg. 062F ).
-However, if either of those is preceded by another character that only joins to the right, the glyph shapes rendered will be initial (eg. 064A )
-and isolated (eg. 062F ), respectively.
+(eg. 064A ) but those that only join to the right will use a final glyph (eg. 062F ).
+However, if either of those is preceded by another character that only joins to the right, the glyph shapes rendered will be initial (eg. 064A )
+and isolated (eg. 062F ), respectively.
-
Word-final characters will typically use a final glyph shape (eg. 064A and 062F ).
-However, if the previous character joins only to the right, they will use isolated glyph shapes (eg.064A and 062F ).
+
Word-final characters will typically use a final glyph shape (eg. 064A and 062F ).
+However, if the previous character joins only to the right, they will use isolated glyph shapes (eg.064A and 062F ).
In all this contextual glyph shaping the basic shapes used for a character can vary significantly in a script like Arabic. This also includes some characters that only have ijam dots in certain contexts.
@@ -1287,9 +1287,11 @@
Vowel sounds to characters
-
This section maps Modern Standard Arabic vowel sounds to common graphemes in the Arabic orthography. Sounds listed as 'infrequent' are allophones, or sounds used for foreign words, etc.
+
This section maps Modern Standard Arabic vowel sounds to common graphemes in the Arabic orthography.
-
The columns run right to left and indicate typical word-initial, word-medial, and word-final usage. The joining forms shown are illustrative; alternative shapes may occur (see joining_forms). They are also fully-vowelled, although the examples show normal unvowelled usage as well as vowelled.
+
The entries show typical word-initial, word-medial, and word-final usage. The joining forms shown are illustrative; alternative shapes may occur (see joining_forms). They are also fully-vowelled, although the examples show normal unvowelled usage as well as vowelled.
+
+
Sounds listed as 'infrequent' are allophones, or sounds used for foreign words, etc. Light coloured characters occur infrequently.