LoginSignup
7
8

More than 3 years have passed since last update.

JavaScriptで扱えるUnicodeプロパティ一覧

Last updated at Posted at 2019-04-11

JavaScriptで扱えるUnicodeプロパティ一覧

ES2018版

Binary

\p{LoneUnicodePropertyNameOrValue}形式で指定する。

以下は、Binaryプロパティの名前の一覧。

Short Name Long Name Alian Description
AHex ASCII_Hex_Digit
ASCII
Alpha Alphabetic
Any
Assigned
Bidi_C Bidi_Control
Bidi_M Bidi_Mirrored
CE Composition_Exclusion 未実装
CI Case_Ignorable
Cased
CWCF Changes_When_Casefolded
CWCM Changes_When_Casemapped
CWL Changes_When_Lowercased
CWKCF Changes_When_NFKC_Casefolded
CWT Changes_When_Titlecased
CWU Changes_When_Uppercased
Dash
DI Default_Ignorable_Code_Point
Dep Deprecated
Dia Diacritic 発音区別符号
EComp Emoji_Component
EBase Emoji_Modifier_Base
EMod Emoji_Modifier
EPres Emoji_Presentation
Emoji
Ext Extender
Gr_Base Grapheme_Base 書記素
Gr_Ext Grapheme_Extend 書記素
Gr_Link Grapheme_Link 未実装
Hex Hex_Digit
Hyphen 未実装
IDC ID_Continue
IDSB IDS_Binary_Operator
IDST IDS_Trinary_Operator
IDS ID_Start
Ideo Ideographic 表意文字
Join_C Join_Control
LOE Logical_Order_Exception
Lower Lowercase gc=Llとは登録されている文字が異なる
Math
NChar Noncharacter_Code_Point
OAlpha Other_Alphabetic 未実装
ODI Other_Default_Ignorable_Code_Point 未実装
OGr_Ext Other_Grapheme_Extend 未実装
OIDC Other_ID_Continue 未実装
OIDS Other_ID_Start 未実装
OLower Other_Lowercase 未実装
OMath Other_Math 未実装
OUpper Other_Uppercase 未実装
PCM Prepended_Concatenation_Mark 未実装
Pat_Syn Pattern_Syntax
Pat_WS Pattern_White_Space
QMark Quotation_Mark
Radical 語根
RI Regional_Indicator
STerm Sentence_Terminal
SD Soft_Dotted
STerm Sentence_Terminal
Term Terminal_Punctuation
UIdeo Unified_Ideograph
Upper Uppercase gc=Luとは登録されている文字が異なる
VS Variation_Selector
WSpace White_Space space
XIDC XID_Continue
XIDS XID_Start

General_Category(gc)

\p{UnicodePropertyName=UnicodePropertyValue}で指定する。
つまり
\p{General_Category=UnicodePropertyValue}
\p{gc=UnicodePropertyValue}
と指定する。
また、General_CategoryUnicodePropertyNameと省略できるので
\p{UnicodePropertyValue}
でも指定できる。

以下は、General_Category(gc)が取れうる値の一覧。

Short Value Long Value Alian Description
Cc Control cntrl
Cf Format
Cn Unassigned
Co Private_Use
Cs Surrogate
C Other Cc+Cf+Cn+Co+Cs
LC Cased_Letter Ll+Lt+Lu
Ll Lowercase_Letter
Lm Modifier_Letter
Lo Other_Letter
Lt Titlecase_Letter
Lu Uppercase_Letter
L Letter Ll+Lm+Lo+Lt+Lu
Mc Spacing_Mark
Me Enclosing_Mark
Mn Nonspacing_Mark
M Mark Combining_Mark Mc+Me+Mn
Nd Decimal_Number digit
Nl Letter_Number
No Other_Number
N Number Nd+Nl+No
Pc Connector_Punctuation
Pd Dash_Punctuation
Pe Close_Punctuation
Pf Final_Punctuation
Pi Initial_Punctuation
Po Other_Punctuation
Ps Open_Punctuation
P Punctuation punct Pc+Pd+Pe+Pf+Pi+Po+Ps
Sc Currency_Symbol
Sk Modifier_Symbol
Sm Math_Symbol
So Other_Symbol
S Symbol Sc+Sk+Sm+So
Zl Line_Separator
Zp Paragraph_Separator
Zs Space_Separator
Z Separator Zl+Zp+Zs

Script(sc) Script_Extensions(scx)

\p{UnicodePropertyName=UnicodePropertyValue}で指定する。
つまり
\p{Script=UnicodePropertyValue}
\p{sc=UnicodePropertyValue}
\p{Script_Extensions=UnicodePropertyValue}
\p{scx=UnicodePropertyValue}
で指定する。

以下は、Script(sc)およびScript_Extensions(scx)が取れうる値の一覧。

Short Value Long Value Alian Description
Adlm Adlam
Aghb Caucasian_Albanian
Ahom
Arab Arabic
Armi Imperial_Aramaic
Armn Armenian
Avst Avestan
Bali Balinese
Bamu Bamum
Bass Bassa_Vah
Batk Batak
Beng Bengali
Bhks Bhaiksuki
Bopo Bopomofo
Brah Brahmi
Brai Braille
Bugi Buginese
Buhd Buhid
Cakm Chakma
Cans Canadian_Aboriginal
Cari Carian
Cham
Cher Cherokee
Copt Coptic Qaac
Cprt Cypriot
Cyrl Cyrillic
Deva Devanagari
Dogr Dogra
Dsrt Deseret
Dupl Duployan
Egyp Egyptian_Hieroglyphs
Elba Elbasan
Elym Elymaic 未実装
Ethi Ethiopic
Geor Georgian
Glag Glagolitic
Gong Gunjala_Gondi
Gonm Masaram_Gondi
Goth Gothic
Gran Grantha
Grek Greek
Gujr Gujarati
Guru Gurmukhi
Hang Hangul
Hani Han 漢字
Hano Hanunoo
Hatr Hatran
Hebr Hebrew
Hira Hiragana 平仮名
Hluw Anatolian_Hieroglyphs
Hmng Pahawh_Hmong
Hmnp Nyiakeng_Puachue_Hmong 未実装
Hrkt Katakana_Or_Hiragana 未実装
Hung Old_Hungarian
Ital Old_Italic
Java Javanese
Kali Kayah_Li
Kana Katakana 片仮名
Khar Kharoshthi
Khmr Khmer
Khoj Khojki
Knda Kannada
Kthi Kaithi
Lana Tai_Tham
Laoo Lao
Latn Latin
Lepc Lepcha
Limb Limbu
Lina Linear_A
Linb Linear_B
Lisu
Lyci Lycian
Lydi Lydian
Mahj Mahajani
Maka Makasar
Mand Mandaic
Mani Manichaean
Marc Marchen
Medf Medefaidrin
Mend Mende_Kikakui
Merc Meroitic_Cursive
Mero Meroitic_Hieroglyphs
Mlym Malayalam
Modi
Mong Mongolian
Mroo Mro
Mtei Meetei_Mayek
Mult Multani
Mymr Myanmar
Nand Nandinagari 未実装
Narb Old_North_Arabian
Nbat Nabataean
Newa
Nkoo Nko
Nshu Nushu
Ogam Ogham
Olck Ol_Chiki
Orkh Old_Turkic
Orya Oriya
Osge Osage
Osma Osmanya
Palm Palmyrene
Pauc Pau_Cin_Hau
Perm Old_Permic
Phag Phags_Pa
Phli Inscriptional_Pahlavi
Phlp Psalter_Pahlavi
Phnx Phoenician
Plrd Miao
Prti Inscriptional_Parthian
Rjng Rejang
Rohg Hanifi_Rohingya
Runr Runic
Samr Samaritan
Sarb Old_South_Arabian
Saur Saurashtra
Sgnw SignWriting
Shaw Shavian
Shrd Sharada
Sidd Siddham
Sind Khudawadi
Sinh Sinhala
Sogd Sogdian
Sogo Old_Sogdian
Sora Sora_Sompeng
Soyo Soyombo
Sund Sundanese
Sylo Syloti_Nagri
Syrc Syriac
Tagb Tagbanwa
Takr Takri
Tale Tai_Le
Talu New_Tai_Lue
Taml Tamil
Tang Tangut
Tavt Tai_Viet
Telu Telugu
Tfng Tifinagh
Tglg Tagalog
Thaa Thaana
Thai
Tibt Tibetan
Tirh Tirhuta
Ugar Ugaritic
Vaii Vai
Wara Warang_Citi
Wcho Wancho 未実装
Xpeo Old_Persian
Xsux Cuneiform
Yiii Yi
Zanb Zanabazar_Square
Zinh Inherited Qaai
Zyyy Common
Zzzz Unknown
7
8
0

Register as a new user and use Qiita more conveniently

  1. You get articles that match your needs
  2. You can efficiently read back useful information
  3. You can use dark theme
What you can do with signing up
7
8