JavaScriptで扱えるUnicodeプロパティ一覧
ES2018版
Binary
\p{LoneUnicodePropertyNameOrValue}
形式で指定する。
以下は、Binaryプロパティの名前の一覧。
Short Name | Long Name | Alian | Description |
---|---|---|---|
AHex | ASCII_Hex_Digit | ||
ASCII | |||
Alpha | Alphabetic | ||
Any | |||
Assigned | |||
Bidi_C | Bidi_Control | ||
Bidi_M | Bidi_Mirrored | ||
CE | Composition_Exclusion | 未実装 | |
CI | Case_Ignorable | ||
Cased | |||
CWCF | Changes_When_Casefolded | ||
CWCM | Changes_When_Casemapped | ||
CWL | Changes_When_Lowercased | ||
CWKCF | Changes_When_NFKC_Casefolded | ||
CWT | Changes_When_Titlecased | ||
CWU | Changes_When_Uppercased | ||
Dash | |||
DI | Default_Ignorable_Code_Point | ||
Dep | Deprecated | ||
Dia | Diacritic | 発音区別符号 | |
EComp | Emoji_Component | ||
EBase | Emoji_Modifier_Base | ||
EMod | Emoji_Modifier | ||
EPres | Emoji_Presentation | ||
Emoji | |||
Ext | Extender | ||
Gr_Base | Grapheme_Base | 書記素 | |
Gr_Ext | Grapheme_Extend | 書記素 | |
Gr_Link | Grapheme_Link | 未実装 | |
Hex | Hex_Digit | ||
Hyphen | 未実装 | ||
IDC | ID_Continue | ||
IDSB | IDS_Binary_Operator | ||
IDST | IDS_Trinary_Operator | ||
IDS | ID_Start | ||
Ideo | Ideographic | 表意文字 | |
Join_C | Join_Control | ||
LOE | Logical_Order_Exception | ||
Lower | Lowercase | gc=Llとは登録されている文字が異なる | |
Math | |||
NChar | Noncharacter_Code_Point | ||
OAlpha | Other_Alphabetic | 未実装 | |
ODI | Other_Default_Ignorable_Code_Point | 未実装 | |
OGr_Ext | Other_Grapheme_Extend | 未実装 | |
OIDC | Other_ID_Continue | 未実装 | |
OIDS | Other_ID_Start | 未実装 | |
OLower | Other_Lowercase | 未実装 | |
OMath | Other_Math | 未実装 | |
OUpper | Other_Uppercase | 未実装 | |
PCM | Prepended_Concatenation_Mark | 未実装 | |
Pat_Syn | Pattern_Syntax | ||
Pat_WS | Pattern_White_Space | ||
QMark | Quotation_Mark | ||
Radical | 語根 | ||
RI | Regional_Indicator | ||
STerm | Sentence_Terminal | ||
SD | Soft_Dotted | ||
STerm | Sentence_Terminal | ||
Term | Terminal_Punctuation | ||
UIdeo | Unified_Ideograph | ||
Upper | Uppercase | gc=Luとは登録されている文字が異なる | |
VS | Variation_Selector | ||
WSpace | White_Space | space | |
XIDC | XID_Continue | ||
XIDS | XID_Start |
General_Category(gc)
\p{UnicodePropertyName=UnicodePropertyValue}
で指定する。
つまり
\p{General_Category=UnicodePropertyValue}
\p{gc=UnicodePropertyValue}
と指定する。
また、General_Category
はUnicodePropertyName
と省略できるので
\p{UnicodePropertyValue}
でも指定できる。
以下は、General_Category(gc)
が取れうる値の一覧。
Short Value | Long Value | Alian | Description |
---|---|---|---|
Cc | Control | cntrl | |
Cf | Format | ||
Cn | Unassigned | ||
Co | Private_Use | ||
Cs | Surrogate | ||
C | Other | Cc+Cf+Cn+Co+Cs | |
LC | Cased_Letter | Ll+Lt+Lu | |
Ll | Lowercase_Letter | ||
Lm | Modifier_Letter | ||
Lo | Other_Letter | ||
Lt | Titlecase_Letter | ||
Lu | Uppercase_Letter | ||
L | Letter | Ll+Lm+Lo+Lt+Lu | |
Mc | Spacing_Mark | ||
Me | Enclosing_Mark | ||
Mn | Nonspacing_Mark | ||
M | Mark | Combining_Mark | Mc+Me+Mn |
Nd | Decimal_Number | digit | |
Nl | Letter_Number | ||
No | Other_Number | ||
N | Number | Nd+Nl+No | |
Pc | Connector_Punctuation | ||
Pd | Dash_Punctuation | ||
Pe | Close_Punctuation | ||
Pf | Final_Punctuation | ||
Pi | Initial_Punctuation | ||
Po | Other_Punctuation | ||
Ps | Open_Punctuation | ||
P | Punctuation | punct | Pc+Pd+Pe+Pf+Pi+Po+Ps |
Sc | Currency_Symbol | ||
Sk | Modifier_Symbol | ||
Sm | Math_Symbol | ||
So | Other_Symbol | ||
S | Symbol | Sc+Sk+Sm+So | |
Zl | Line_Separator | ||
Zp | Paragraph_Separator | ||
Zs | Space_Separator | ||
Z | Separator | Zl+Zp+Zs |
Script(sc) Script_Extensions(scx)
\p{UnicodePropertyName=UnicodePropertyValue}
で指定する。
つまり
\p{Script=UnicodePropertyValue}
\p{sc=UnicodePropertyValue}
\p{Script_Extensions=UnicodePropertyValue}
\p{scx=UnicodePropertyValue}
で指定する。
以下は、Script(sc)
およびScript_Extensions(scx)
が取れうる値の一覧。
Short Value | Long Value | Alian | Description |
---|---|---|---|
Adlm | Adlam | ||
Aghb | Caucasian_Albanian | ||
Ahom | |||
Arab | Arabic | ||
Armi | Imperial_Aramaic | ||
Armn | Armenian | ||
Avst | Avestan | ||
Bali | Balinese | ||
Bamu | Bamum | ||
Bass | Bassa_Vah | ||
Batk | Batak | ||
Beng | Bengali | ||
Bhks | Bhaiksuki | ||
Bopo | Bopomofo | ||
Brah | Brahmi | ||
Brai | Braille | ||
Bugi | Buginese | ||
Buhd | Buhid | ||
Cakm | Chakma | ||
Cans | Canadian_Aboriginal | ||
Cari | Carian | ||
Cham | |||
Cher | Cherokee | ||
Copt | Coptic | Qaac | |
Cprt | Cypriot | ||
Cyrl | Cyrillic | ||
Deva | Devanagari | ||
Dogr | Dogra | ||
Dsrt | Deseret | ||
Dupl | Duployan | ||
Egyp | Egyptian_Hieroglyphs | ||
Elba | Elbasan | ||
Elym | Elymaic | 未実装 | |
Ethi | Ethiopic | ||
Geor | Georgian | ||
Glag | Glagolitic | ||
Gong | Gunjala_Gondi | ||
Gonm | Masaram_Gondi | ||
Goth | Gothic | ||
Gran | Grantha | ||
Grek | Greek | ||
Gujr | Gujarati | ||
Guru | Gurmukhi | ||
Hang | Hangul | ||
Hani | Han | 漢字 | |
Hano | Hanunoo | ||
Hatr | Hatran | ||
Hebr | Hebrew | ||
Hira | Hiragana | 平仮名 | |
Hluw | Anatolian_Hieroglyphs | ||
Hmng | Pahawh_Hmong | ||
Hmnp | Nyiakeng_Puachue_Hmong | 未実装 | |
Hrkt | Katakana_Or_Hiragana | 未実装 | |
Hung | Old_Hungarian | ||
Ital | Old_Italic | ||
Java | Javanese | ||
Kali | Kayah_Li | ||
Kana | Katakana | 片仮名 | |
Khar | Kharoshthi | ||
Khmr | Khmer | ||
Khoj | Khojki | ||
Knda | Kannada | ||
Kthi | Kaithi | ||
Lana | Tai_Tham | ||
Laoo | Lao | ||
Latn | Latin | ||
Lepc | Lepcha | ||
Limb | Limbu | ||
Lina | Linear_A | ||
Linb | Linear_B | ||
Lisu | |||
Lyci | Lycian | ||
Lydi | Lydian | ||
Mahj | Mahajani | ||
Maka | Makasar | ||
Mand | Mandaic | ||
Mani | Manichaean | ||
Marc | Marchen | ||
Medf | Medefaidrin | ||
Mend | Mende_Kikakui | ||
Merc | Meroitic_Cursive | ||
Mero | Meroitic_Hieroglyphs | ||
Mlym | Malayalam | ||
Modi | |||
Mong | Mongolian | ||
Mroo | Mro | ||
Mtei | Meetei_Mayek | ||
Mult | Multani | ||
Mymr | Myanmar | ||
Nand | Nandinagari | 未実装 | |
Narb | Old_North_Arabian | ||
Nbat | Nabataean | ||
Newa | |||
Nkoo | Nko | ||
Nshu | Nushu | ||
Ogam | Ogham | ||
Olck | Ol_Chiki | ||
Orkh | Old_Turkic | ||
Orya | Oriya | ||
Osge | Osage | ||
Osma | Osmanya | ||
Palm | Palmyrene | ||
Pauc | Pau_Cin_Hau | ||
Perm | Old_Permic | ||
Phag | Phags_Pa | ||
Phli | Inscriptional_Pahlavi | ||
Phlp | Psalter_Pahlavi | ||
Phnx | Phoenician | ||
Plrd | Miao | ||
Prti | Inscriptional_Parthian | ||
Rjng | Rejang | ||
Rohg | Hanifi_Rohingya | ||
Runr | Runic | ||
Samr | Samaritan | ||
Sarb | Old_South_Arabian | ||
Saur | Saurashtra | ||
Sgnw | SignWriting | ||
Shaw | Shavian | ||
Shrd | Sharada | ||
Sidd | Siddham | ||
Sind | Khudawadi | ||
Sinh | Sinhala | ||
Sogd | Sogdian | ||
Sogo | Old_Sogdian | ||
Sora | Sora_Sompeng | ||
Soyo | Soyombo | ||
Sund | Sundanese | ||
Sylo | Syloti_Nagri | ||
Syrc | Syriac | ||
Tagb | Tagbanwa | ||
Takr | Takri | ||
Tale | Tai_Le | ||
Talu | New_Tai_Lue | ||
Taml | Tamil | ||
Tang | Tangut | ||
Tavt | Tai_Viet | ||
Telu | Telugu | ||
Tfng | Tifinagh | ||
Tglg | Tagalog | ||
Thaa | Thaana | ||
Thai | |||
Tibt | Tibetan | ||
Tirh | Tirhuta | ||
Ugar | Ugaritic | ||
Vaii | Vai | ||
Wara | Warang_Citi | ||
Wcho | Wancho | 未実装 | |
Xpeo | Old_Persian | ||
Xsux | Cuneiform | ||
Yiii | Yi | ||
Zanb | Zanabazar_Square | ||
Zinh | Inherited | Qaai | |
Zyyy | Common | ||
Zzzz | Unknown |