LoginSignup
2
2

More than 5 years have passed since last update.

Sorting Japanese Words in Indexes (??)

Last updated at Posted at 2014-06-02

Hiragana and katakana (1)

The very normal forms and their order

The following table shows the very normal form of hiragana and katakana, which are Japanese syllables and used for indicating pronounce of kanji.

Order Norm. Unicode point(Name)
1 Ux3042 (HIRAGANA LETTER A)
2 Ux3044 (HIRAGANA LETTER I)
3 Ux3046 (HIRAGANA LETTER U)
4 Ux3048 (HIRAGANA LETTER E)
5 Ux304A (HIRAGANA LETTER O)
6 Ux304B (HIRAGANA LETTER KA)
7 Ux304D (HIRAGANA LETTER KI)
8 Ux304F (HIRAGANA LETTER KU)
9 Ux3051 (HIRAGANA LETTER KE)
10 Ux3053 (HIRAGANA LETTER KO)
11 Ux3055 (HIRAGANA LETTER SA)
12 Ux3057 (HIRAGANA LETTER SI)
13 Ux3059 (HIRAGANA LETTER SU)
14 Ux305B (HIRAGANA LETTER SE)
15 Ux305D (HIRAGANA LETTER SO)
16 Ux305F (HIRAGANA LETTER TA)
17 Ux3061 (HIRAGANA LETTER TI)
18 Ux3064 (HIRAGANA LETTER TU)
19 Ux3066 (HIRAGANA LETTER TE)
20 Ux3068 (HIRAGANA LETTER TO)
21 Ux306A (HIRAGANA LETTER NA)
22 Ux306B (HIRAGANA LETTER NI)
23 Ux306C (HIRAGANA LETTER NU)
24 Ux306D (HIRAGANA LETTER NE)
25 Ux306E (HIRAGANA LETTER NO)
26 Ux306F (HIRAGANA LETTER HA)
27 Ux3072 (HIRAGANA LETTER HI)
28 Ux3075 (HIRAGANA LETTER HU)
29 Ux3078 (HIRAGANA LETTER HE)
30 Ux307B (HIRAGANA LETTER HO)
31 Ux307E (HIRAGANA LETTER MA)
32 Ux307F (HIRAGANA LETTER MI)
33 Ux3080 (HIRAGANA LETTER MU)
34 Ux3081 (HIRAGANA LETTER ME)
35 Ux3082 (HIRAGANA LETTER MO)
36 Ux3084 (HIRAGANA LETTER YA)
37 Ux3086 (HIRAGANA LETTER YU)
38 Ux3088 (HIRAGANA LETTER YO)
39 Ux3089 (HIRAGANA LETTER RA)
40 Ux308A (HIRAGANA LETTER RI)
41 Ux308B (HIRAGANA LETTER RU)
42 Ux308C (HIRAGANA LETTER RE)
43 Ux308D (HIRAGANA LETTER RO)
44 Ux308F (HIRAGANA LETTER WA)
45 Ux3090 (HIRAGANA LETTER WI)
46 Ux3091 (HIRAGANA LETTER WE)
47 Ux3092 (HIRAGANA LETTER WO)
48 Ux3093 (HIRAGANA LETTER N)
49 Ux309D (HIRAGANA ITERATION MARK)
50 Ux30FC (KATAKANA-HIRAGANA PROLONGED SOUND MARK)

The above does not cover the all Hiragana and katakana. We will write down the all characters and correspondence to the very normal form.

All hiragana and katakana characters categorized by the very normal form

In this section, we will establish the subsections for each very normal form.

Ux3042 (HIRAGANA LETTER A)

Char. Unicode point (Name)
Ux3041 (HIRAGANA LETTER SMALL A)
Ux30A1 (KATAKANA LETTER SMALL A)
UxFF67 (HALFWIDTH KATAKANA LETTER SMALL A)
Ux3042 (HIRAGANA LETTER A)
Ux30A2 (KATAKANA LETTER A)
UxFF71 (HALFWIDTH KATAKANA LETTER A)

Ux3044 (HIRAGANA LETTER I)

Char. Unicode point (Name)
Ux3043 (HIRAGANA LETTER SMALL I)
Ux30A3 (KATAKANA LETTER SMALL I)
UxFF68 (HALFWIDTH KATAKANA LETTER SMALL I)
Ux3044 (HIRAGANA LETTER I)
Ux30A4 (KATAKANA LETTER I)
UxFF72 (HALFWIDTH KATAKANA LETTER I)

Ux3046 (HIRAGANA LETTER U)

Traditionally, we had not used ゔ (Ux3094, HIRAGANA LETTER VU), but Unicode has defined the letter. Currently, we could prepare the next table.

Char. Unicode point (Name)
Ux3045 (HIRAGANA LETTER SMALL U)
Ux30A5 (KATAKANA LETTER SMALL U)
UxFF69 (HALFWIDTH KATAKANA LETTER SMALL U)
Ux3046 (HIRAGANA LETTER U)
Ux30A6 (KATAKANA LETTER U)
UxFF73 (HALFWIDTH KATAKANA LETTER U)
Ux3094 (HIRAGANA LETTER VU)
ゔ Ux3046Ux3099 (HIRAGANA LETTER UCOMBINING KATAKANA-HIRAGANA VOICED SOUND MARK)
Ux30F4 (KATAKANA LETTER VU)
ヴ Ux30A6Ux3099 (KATAKANA LETTER UCOMBINING KATAKANA-HIRAGANA VOICED SOUND MARK)
ヴ UxFF73UxFF9E (HALFWIDTH KATAKANA LETTER UHALFWIDTH KATAKANA VOICED SOUND MARK)

(to be continued ...)

2
2
0

Register as a new user and use Qiita more conveniently

  1. You get articles that match your needs
  2. You can efficiently read back useful information
  3. You can use dark theme
What you can do with signing up
2
2