JavaScriptで扱えるUnicodeプロパティ一覧
ES2018版
Binary
\p{LoneUnicodePropertyNameOrValue}形式で指定する。
以下は、Binaryプロパティの名前の一覧。
| Short Name | Long Name | Alian | Description |
|---|---|---|---|
| AHex | ASCII_Hex_Digit | ||
| ASCII | |||
| Alpha | Alphabetic | ||
| Any | |||
| Assigned | |||
| Bidi_C | Bidi_Control | ||
| Bidi_M | Bidi_Mirrored | ||
| CE | Composition_Exclusion | 未実装 | |
| CI | Case_Ignorable | ||
| Cased | |||
| CWCF | Changes_When_Casefolded | ||
| CWCM | Changes_When_Casemapped | ||
| CWL | Changes_When_Lowercased | ||
| CWKCF | Changes_When_NFKC_Casefolded | ||
| CWT | Changes_When_Titlecased | ||
| CWU | Changes_When_Uppercased | ||
| Dash | |||
| DI | Default_Ignorable_Code_Point | ||
| Dep | Deprecated | ||
| Dia | Diacritic | 発音区別符号 | |
| EComp | Emoji_Component | ||
| EBase | Emoji_Modifier_Base | ||
| EMod | Emoji_Modifier | ||
| EPres | Emoji_Presentation | ||
| Emoji | |||
| Ext | Extender | ||
| Gr_Base | Grapheme_Base | 書記素 | |
| Gr_Ext | Grapheme_Extend | 書記素 | |
| Gr_Link | Grapheme_Link | 未実装 | |
| Hex | Hex_Digit | ||
| Hyphen | 未実装 | ||
| IDC | ID_Continue | ||
| IDSB | IDS_Binary_Operator | ||
| IDST | IDS_Trinary_Operator | ||
| IDS | ID_Start | ||
| Ideo | Ideographic | 表意文字 | |
| Join_C | Join_Control | ||
| LOE | Logical_Order_Exception | ||
| Lower | Lowercase | gc=Llとは登録されている文字が異なる | |
| Math | |||
| NChar | Noncharacter_Code_Point | ||
| OAlpha | Other_Alphabetic | 未実装 | |
| ODI | Other_Default_Ignorable_Code_Point | 未実装 | |
| OGr_Ext | Other_Grapheme_Extend | 未実装 | |
| OIDC | Other_ID_Continue | 未実装 | |
| OIDS | Other_ID_Start | 未実装 | |
| OLower | Other_Lowercase | 未実装 | |
| OMath | Other_Math | 未実装 | |
| OUpper | Other_Uppercase | 未実装 | |
| PCM | Prepended_Concatenation_Mark | 未実装 | |
| Pat_Syn | Pattern_Syntax | ||
| Pat_WS | Pattern_White_Space | ||
| QMark | Quotation_Mark | ||
| Radical | 語根 | ||
| RI | Regional_Indicator | ||
| STerm | Sentence_Terminal | ||
| SD | Soft_Dotted | ||
| STerm | Sentence_Terminal | ||
| Term | Terminal_Punctuation | ||
| UIdeo | Unified_Ideograph | ||
| Upper | Uppercase | gc=Luとは登録されている文字が異なる | |
| VS | Variation_Selector | ||
| WSpace | White_Space | space | |
| XIDC | XID_Continue | ||
| XIDS | XID_Start |
General_Category(gc)
\p{UnicodePropertyName=UnicodePropertyValue}で指定する。
つまり
\p{General_Category=UnicodePropertyValue}
\p{gc=UnicodePropertyValue}
と指定する。
また、General_CategoryはUnicodePropertyNameと省略できるので
\p{UnicodePropertyValue}
でも指定できる。
以下は、General_Category(gc)が取れうる値の一覧。
| Short Value | Long Value | Alian | Description |
|---|---|---|---|
| Cc | Control | cntrl | |
| Cf | Format | ||
| Cn | Unassigned | ||
| Co | Private_Use | ||
| Cs | Surrogate | ||
| C | Other | Cc+Cf+Cn+Co+Cs | |
| LC | Cased_Letter | Ll+Lt+Lu | |
| Ll | Lowercase_Letter | ||
| Lm | Modifier_Letter | ||
| Lo | Other_Letter | ||
| Lt | Titlecase_Letter | ||
| Lu | Uppercase_Letter | ||
| L | Letter | Ll+Lm+Lo+Lt+Lu | |
| Mc | Spacing_Mark | ||
| Me | Enclosing_Mark | ||
| Mn | Nonspacing_Mark | ||
| M | Mark | Combining_Mark | Mc+Me+Mn |
| Nd | Decimal_Number | digit | |
| Nl | Letter_Number | ||
| No | Other_Number | ||
| N | Number | Nd+Nl+No | |
| Pc | Connector_Punctuation | ||
| Pd | Dash_Punctuation | ||
| Pe | Close_Punctuation | ||
| Pf | Final_Punctuation | ||
| Pi | Initial_Punctuation | ||
| Po | Other_Punctuation | ||
| Ps | Open_Punctuation | ||
| P | Punctuation | punct | Pc+Pd+Pe+Pf+Pi+Po+Ps |
| Sc | Currency_Symbol | ||
| Sk | Modifier_Symbol | ||
| Sm | Math_Symbol | ||
| So | Other_Symbol | ||
| S | Symbol | Sc+Sk+Sm+So | |
| Zl | Line_Separator | ||
| Zp | Paragraph_Separator | ||
| Zs | Space_Separator | ||
| Z | Separator | Zl+Zp+Zs |
Script(sc) Script_Extensions(scx)
\p{UnicodePropertyName=UnicodePropertyValue}で指定する。
つまり
\p{Script=UnicodePropertyValue}
\p{sc=UnicodePropertyValue}
\p{Script_Extensions=UnicodePropertyValue}
\p{scx=UnicodePropertyValue}
で指定する。
以下は、Script(sc)およびScript_Extensions(scx)が取れうる値の一覧。
| Short Value | Long Value | Alian | Description |
|---|---|---|---|
| Adlm | Adlam | ||
| Aghb | Caucasian_Albanian | ||
| Ahom | |||
| Arab | Arabic | ||
| Armi | Imperial_Aramaic | ||
| Armn | Armenian | ||
| Avst | Avestan | ||
| Bali | Balinese | ||
| Bamu | Bamum | ||
| Bass | Bassa_Vah | ||
| Batk | Batak | ||
| Beng | Bengali | ||
| Bhks | Bhaiksuki | ||
| Bopo | Bopomofo | ||
| Brah | Brahmi | ||
| Brai | Braille | ||
| Bugi | Buginese | ||
| Buhd | Buhid | ||
| Cakm | Chakma | ||
| Cans | Canadian_Aboriginal | ||
| Cari | Carian | ||
| Cham | |||
| Cher | Cherokee | ||
| Copt | Coptic | Qaac | |
| Cprt | Cypriot | ||
| Cyrl | Cyrillic | ||
| Deva | Devanagari | ||
| Dogr | Dogra | ||
| Dsrt | Deseret | ||
| Dupl | Duployan | ||
| Egyp | Egyptian_Hieroglyphs | ||
| Elba | Elbasan | ||
| Elym | Elymaic | 未実装 | |
| Ethi | Ethiopic | ||
| Geor | Georgian | ||
| Glag | Glagolitic | ||
| Gong | Gunjala_Gondi | ||
| Gonm | Masaram_Gondi | ||
| Goth | Gothic | ||
| Gran | Grantha | ||
| Grek | Greek | ||
| Gujr | Gujarati | ||
| Guru | Gurmukhi | ||
| Hang | Hangul | ||
| Hani | Han | 漢字 | |
| Hano | Hanunoo | ||
| Hatr | Hatran | ||
| Hebr | Hebrew | ||
| Hira | Hiragana | 平仮名 | |
| Hluw | Anatolian_Hieroglyphs | ||
| Hmng | Pahawh_Hmong | ||
| Hmnp | Nyiakeng_Puachue_Hmong | 未実装 | |
| Hrkt | Katakana_Or_Hiragana | 未実装 | |
| Hung | Old_Hungarian | ||
| Ital | Old_Italic | ||
| Java | Javanese | ||
| Kali | Kayah_Li | ||
| Kana | Katakana | 片仮名 | |
| Khar | Kharoshthi | ||
| Khmr | Khmer | ||
| Khoj | Khojki | ||
| Knda | Kannada | ||
| Kthi | Kaithi | ||
| Lana | Tai_Tham | ||
| Laoo | Lao | ||
| Latn | Latin | ||
| Lepc | Lepcha | ||
| Limb | Limbu | ||
| Lina | Linear_A | ||
| Linb | Linear_B | ||
| Lisu | |||
| Lyci | Lycian | ||
| Lydi | Lydian | ||
| Mahj | Mahajani | ||
| Maka | Makasar | ||
| Mand | Mandaic | ||
| Mani | Manichaean | ||
| Marc | Marchen | ||
| Medf | Medefaidrin | ||
| Mend | Mende_Kikakui | ||
| Merc | Meroitic_Cursive | ||
| Mero | Meroitic_Hieroglyphs | ||
| Mlym | Malayalam | ||
| Modi | |||
| Mong | Mongolian | ||
| Mroo | Mro | ||
| Mtei | Meetei_Mayek | ||
| Mult | Multani | ||
| Mymr | Myanmar | ||
| Nand | Nandinagari | 未実装 | |
| Narb | Old_North_Arabian | ||
| Nbat | Nabataean | ||
| Newa | |||
| Nkoo | Nko | ||
| Nshu | Nushu | ||
| Ogam | Ogham | ||
| Olck | Ol_Chiki | ||
| Orkh | Old_Turkic | ||
| Orya | Oriya | ||
| Osge | Osage | ||
| Osma | Osmanya | ||
| Palm | Palmyrene | ||
| Pauc | Pau_Cin_Hau | ||
| Perm | Old_Permic | ||
| Phag | Phags_Pa | ||
| Phli | Inscriptional_Pahlavi | ||
| Phlp | Psalter_Pahlavi | ||
| Phnx | Phoenician | ||
| Plrd | Miao | ||
| Prti | Inscriptional_Parthian | ||
| Rjng | Rejang | ||
| Rohg | Hanifi_Rohingya | ||
| Runr | Runic | ||
| Samr | Samaritan | ||
| Sarb | Old_South_Arabian | ||
| Saur | Saurashtra | ||
| Sgnw | SignWriting | ||
| Shaw | Shavian | ||
| Shrd | Sharada | ||
| Sidd | Siddham | ||
| Sind | Khudawadi | ||
| Sinh | Sinhala | ||
| Sogd | Sogdian | ||
| Sogo | Old_Sogdian | ||
| Sora | Sora_Sompeng | ||
| Soyo | Soyombo | ||
| Sund | Sundanese | ||
| Sylo | Syloti_Nagri | ||
| Syrc | Syriac | ||
| Tagb | Tagbanwa | ||
| Takr | Takri | ||
| Tale | Tai_Le | ||
| Talu | New_Tai_Lue | ||
| Taml | Tamil | ||
| Tang | Tangut | ||
| Tavt | Tai_Viet | ||
| Telu | Telugu | ||
| Tfng | Tifinagh | ||
| Tglg | Tagalog | ||
| Thaa | Thaana | ||
| Thai | |||
| Tibt | Tibetan | ||
| Tirh | Tirhuta | ||
| Ugar | Ugaritic | ||
| Vaii | Vai | ||
| Wara | Warang_Citi | ||
| Wcho | Wancho | 未実装 | |
| Xpeo | Old_Persian | ||
| Xsux | Cuneiform | ||
| Yiii | Yi | ||
| Zanb | Zanabazar_Square | ||
| Zinh | Inherited | Qaai | |
| Zyyy | Common | ||
| Zzzz | Unknown |