-
Notifications
You must be signed in to change notification settings - Fork 385
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Reconsider Shift-JIS-based normalizing
On Windows Mozc has had several nomarization rules to avoid possible compatibility and interoperability reasons. With this CL, we basically remove those special rules except for the following two normalization rules: - 0x301C WAVE DASH -> 0xFF5E FULLWIDTH TILDE - 0x2212 MINUS SIGN -> 0xFF0D FULLWIDTH HYPHEN MINUS Here are some examples of behavior changes. Case A: 1. Type "えん" 2. Hit space key to trigger conversion. 3. Choose "¥ [半] 円記号 <機種依存文字>" Current behavior: U+005C is always committed. New behavior: U+00A5 is committed if it's U+00A5 (Compatible with MS-IME on Windows 10) Case B: 1. Type "U+00A5" 2. Hit space key to trigger conversion. 3. Choose "¥ [半] 円記号 <機種依存文字>" Current behavior: U+005C is always committed. New behavior: U+00A5 is committed. Case C: 1. Type "たてぼう" 2. Hit space key to trigger conversion. 3. Choose "‖ [全] 縦線" Current behavior: U+2225 is always committed. (MS-IME on Windows 10 only shows this) New behavior: Both U+2016 and U+2225 can be committed. Case D: 1. Type "うえ" 2. Hit space key to trigger conversion. 3. Choose "‾ [全] オーバライン" Current behavior: U+007E is always committed. New behavior: U+203E is committed. (Compatible with MS-IME on Windows 10 if you choose "‾ [環境依存]") Note that MS-IME on Windows 10 also converts "おーばーらいん" to "‾". Case E: 1. Select "¢" (U+00A2) 2. Hit 変換 key to trigger reconversion. 3. Choose the top candidate. Current behavior: U+FFE0 is always committed. New behavior: U+00A2 is committed. (Compatible with MS-IME behavior) Case F: 1. Select "£" (U+00A3) 2. Hit 変換 key to trigger reconversion. 3. Choose the top candidate. Current behavior: U+FFE2 is always committed. New behavior: U+00A3 is committed. (Compatible with MS-IME behavior) Case G: 1. Select "¬" (U+00AC) 2. Hit 変換 key to trigger reconversion. 3. Choose the top candidate. Current behavior: U+FFE2 is always committed. New behavior: U+00AC is committed. (Compatible with MS-IME behavior) Case H: 1. Type "vaiorin" in Romaji mode Current behavior: "ヴぁいおりん" (Compatible with MS-IME behavior on Windows 10) New behavior: "ゔぁいおりん" BUG= TEST= REF_BUG=26674144 REF_CL=113128015,113129243,113232446 REF_TIME=2016-01-27T14:50:45+09:00 REF_TIME_RAW=1453873845 +0900
- Loading branch information
1 parent
1a6e539
commit 26241b0
Showing
15 changed files
with
90 additions
and
258 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.