UScript
public
final
class
UScript
extends Object
java.lang.Object | |
↳ | android.icu.lang.UScript |
Constants for ISO 15924 script codes, and related functions.
The current set of script code constants supports at least all scripts that are encoded in the version of Unicode which ICU currently supports. The names of the constants are usually derived from the Unicode script property value aliases. See UAX #24 Unicode Script Property (http://www.unicode.org/reports/tr24/) and http://www.unicode.org/Public/UCD/latest/ucd/PropertyValueAliases.txt .
In addition, constants for many ISO 15924 script codes are included, for use with language tags, CLDR data, and similar. Some of those codes are not used in the Unicode Character Database (UCD). For example, there are no characters that have a UCD script property value of Hans or Hant. All Han ideographs have the Hani script property value in Unicode.
Private-use codes Qaaa..Qabx are not included, except as used in the UCD or in CLDR.
Starting with ICU 55, script codes are only added when their scripts have been or will certainly be encoded in Unicode, and have been assigned Unicode script property value aliases, to ensure that their script names are stable and match the names of the constants. Script codes like Latf and Aran that are not subject to separate encoding may be added at any time.
Summary
Constants | |
---|---|
int |
ADLAM
ISO 15924 script code |
int |
AFAKA
ISO 15924 script code |
int |
AHOM
ISO 15924 script code |
int |
ANATOLIAN_HIEROGLYPHS
ISO 15924 script code |
int |
ARABIC
Arabic |
int |
ARMENIAN
Armenian |
int |
AVESTAN
ISO 15924 script code |
int |
BALINESE
ISO 15924 script code |
int |
BAMUM
ISO 15924 script code |
int |
BASSA_VAH
ISO 15924 script code |
int |
BATAK
ISO 15924 script code |
int |
BENGALI
Bengali |
int |
BHAIKSUKI
ISO 15924 script code |
int |
BLISSYMBOLS
ISO 15924 script code |
int |
BOOK_PAHLAVI
ISO 15924 script code |
int |
BOPOMOFO
Bopomofo |
int |
BRAHMI
ISO 15924 script code |
int |
BRAILLE
Braille Script in Unicode 4 |
int |
BUGINESE
Script in Unicode 4.1 |
int |
BUHID
Buhid |
int |
CANADIAN_ABORIGINAL
Unified Canadian Aboriginal Symbols |
int |
CARIAN
ISO 15924 script code |
int |
CAUCASIAN_ALBANIAN
ISO 15924 script code |
int |
CHAKMA
ISO 15924 script code |
int |
CHAM
ISO 15924 script code |
int |
CHEROKEE
Cherokee |
int |
CHORASMIAN
|
int |
CIRTH
ISO 15924 script code |
int |
COMMON
Common |
int |
COPTIC
Coptic |
int |
CUNEIFORM
ISO 15924 script code |
int |
CYPRIOT
Cypriot Script in Unicode 4 |
int |
CYPRO_MINOAN
|
int |
CYRILLIC
Cyrillic |
int |
DEMOTIC_EGYPTIAN
ISO 15924 script code |
int |
DESERET
Deseret |
int |
DEVANAGARI
Devanagari |
int |
DIVES_AKURU
|
int |
DOGRA
ISO 15924 script code |
int |
DUPLOYAN
ISO 15924 script code |
int |
EASTERN_SYRIAC
ISO 15924 script code |
int |
EGYPTIAN_HIEROGLYPHS
ISO 15924 script code |
int |
ELBASAN
ISO 15924 script code |
int |
ELYMAIC
|
int |
ESTRANGELO_SYRIAC
ISO 15924 script code |
int |
ETHIOPIC
Ethiopic |
int |
GEORGIAN
Georgian |
int |
GLAGOLITIC
Script in Unicode 4.1 |
int |
GOTHIC
Gothic |
int |
GRANTHA
ISO 15924 script code |
int |
GREEK
Greek |
int |
GUJARATI
Gujarati |
int |
GUNJALA_GONDI
|
int |
GURMUKHI
Gurmukhi |
int |
HAN
Han |
int |
HANGUL
Hangul |
int |
HANIFI_ROHINGYA
|
int |
HANUNOO
Hanunooo |
int |
HAN_WITH_BOPOMOFO
ISO 15924 script code |
int |
HARAPPAN_INDUS
ISO 15924 script code |
int |
HATRAN
ISO 15924 script code |
int |
HEBREW
Hebrew |
int |
HIERATIC_EGYPTIAN
ISO 15924 script code |
int |
HIRAGANA
Hiragana |
int |
IMPERIAL_ARAMAIC
ISO 15924 script code |
int |
INHERITED
Inherited |
int |
INSCRIPTIONAL_PAHLAVI
ISO 15924 script code |
int |
INSCRIPTIONAL_PARTHIAN
ISO 15924 script code |
int |
INVALID_CODE
Invalid code |
int |
JAMO
ISO 15924 script code |
int |
JAPANESE
ISO 15924 script code |
int |
JAVANESE
ISO 15924 script code |
int |
JURCHEN
ISO 15924 script code |
int |
KAITHI
ISO 15924 script code |
int |
KANNADA
Kannada |
int |
KATAKANA
Katakana |
int |
KATAKANA_OR_HIRAGANA
Script in Unicode 4.0.1 |
int |
KAWI
|
int |
KAYAH_LI
ISO 15924 script code |
int |
KHAROSHTHI
Script in Unicode 4.1 |
int |
KHITAN_SMALL_SCRIPT
|
int |
KHMER
Khmer |
int |
KHOJKI
ISO 15924 script code |
int |
KHUDAWADI
ISO 15924 script code |
int |
KHUTSURI
ISO 15924 script code |
int |
KOREAN
ISO 15924 script code |
int |
KPELLE
ISO 15924 script code |
int |
LANNA
ISO 15924 script code |
int |
LAO
Lao |
int |
LATIN
Latin |
int |
LATIN_FRAKTUR
ISO 15924 script code |
int |
LATIN_GAELIC
ISO 15924 script code |
int |
LEPCHA
ISO 15924 script code |
int |
LIMBU
Limbu Script in Unicode 4 |
int |
LINEAR_A
ISO 15924 script code |
int |
LINEAR_B
Linear B Script in Unicode 4 |
int |
LISU
ISO 15924 script code |
int |
LOMA
ISO 15924 script code |
int |
LYCIAN
ISO 15924 script code |
int |
LYDIAN
ISO 15924 script code |
int |
MAHAJANI
ISO 15924 script code |
int |
MAKASAR
|
int |
MALAYALAM
Malayalam |
int |
MANDAEAN
ISO 15924 script code |
int |
MANDAIC
ISO 15924 script code |
int |
MANICHAEAN
ISO 15924 script code |
int |
MARCHEN
ISO 15924 script code |
int |
MASARAM_GONDI
ISO 15924 script code |
int |
MATHEMATICAL_NOTATION
ISO 15924 script code |
int |
MAYAN_HIEROGLYPHS
ISO 15924 script code |
int |
MEDEFAIDRIN
|
int |
MEITEI_MAYEK
ISO 15924 script code |
int |
MENDE
Mende Kikakui ISO 15924 script code |
int |
MEROITIC
ISO 15924 script code |
int |
MEROITIC_CURSIVE
ISO 15924 script code |
int |
MEROITIC_HIEROGLYPHS
ISO 15924 script code |
int |
MIAO
ISO 15924 script code |
int |
MODI
ISO 15924 script code |
int |
MONGOLIAN
Mangolian |
int |
MOON
ISO 15924 script code |
int |
MRO
ISO 15924 script code |
int |
MULTANI
ISO 15924 script code |
int |
MYANMAR
Myammar |
int |
NABATAEAN
ISO 15924 script code |
int |
NAG_MUNDARI
|
int |
NAKHI_GEBA
ISO 15924 script code |
int |
NANDINAGARI
|
int |
NEWA
ISO 15924 script code |
int |
NEW_TAI_LUE
Script in Unicode 4.1 |
int |
NKO
ISO 15924 script code |
int |
NUSHU
ISO 15924 script code |
int |
NYIAKENG_PUACHUE_HMONG
|
int |
OGHAM
Ogham |
int |
OLD_CHURCH_SLAVONIC_CYRILLIC
ISO 15924 script code |
int |
OLD_HUNGARIAN
ISO 15924 script code |
int |
OLD_ITALIC
Old Itallic |
int |
OLD_NORTH_ARABIAN
ISO 15924 script code |
int |
OLD_PERMIC
ISO 15924 script code |
int |
OLD_PERSIAN
Script in Unicode 4.1 |
int |
OLD_SOGDIAN
|
int |
OLD_SOUTH_ARABIAN
ISO 15924 script code |
int |
OLD_UYGHUR
|
int |
OL_CHIKI
ISO 15924 script code |
int |
ORIYA
Oriya |
int |
ORKHON
ISO 15924 script code |
int |
OSAGE
ISO 15924 script code |
int |
OSMANYA
Osmanya Script in Unicode 4 |
int |
PAHAWH_HMONG
ISO 15924 script code |
int |
PALMYRENE
ISO 15924 script code |
int |
PAU_CIN_HAU
ISO 15924 script code |
int |
PHAGS_PA
ISO 15924 script code |
int |
PHOENICIAN
ISO 15924 script code |
int |
PHONETIC_POLLARD
ISO 15924 script code |
int |
PSALTER_PAHLAVI
ISO 15924 script code |
int |
REJANG
ISO 15924 script code |
int |
RONGORONGO
ISO 15924 script code |
int |
RUNIC
Runic |
int |
SAMARITAN
ISO 15924 script code |
int |
SARATI
ISO 15924 script code |
int |
SAURASHTRA
ISO 15924 script code |
int |
SHARADA
ISO 15924 script code |
int |
SHAVIAN
Shavian Script in Unicode 4 |
int |
SIDDHAM
ISO 15924 script code |
int |
SIGN_WRITING
ISO 15924 script code for Sutton SignWriting |
int |
SIMPLIFIED_HAN
ISO 15924 script code |
int |
SINDHI
ISO 15924 script code |
int |
SINHALA
Sinhala |
int |
SOGDIAN
|
int |
SORA_SOMPENG
ISO 15924 script code |
int |
SOYOMBO
ISO 15924 script code |
int |
SUNDANESE
ISO 15924 script code |
int |
SYLOTI_NAGRI
Script in Unicode 4.1 |
int |
SYMBOLS
ISO 15924 script code |
int |
SYMBOLS_EMOJI
ISO 15924 script code |
int |
SYRIAC
Syriac |
int |
TAGALOG
Tagalog |
int |
TAGBANWA
Tagbanwa |
int |
TAI_LE
Tai Le Script in Unicode 4 |
int |
TAI_VIET
ISO 15924 script code |
int |
TAKRI
ISO 15924 script code |
int |
TAMIL
Tamil |
int |
TANGSA
|
int |
TANGUT
ISO 15924 script code |
int |
TELUGU
Telugu |
int |
TENGWAR
ISO 15924 script code |
int |
THAANA
Thana |
int |
THAI
Thai |
int |
TIBETAN
Tibetan |
int |
TIFINAGH
Script in Unicode 4.1 |
int |
TIRHUTA
ISO 15924 script code |
int |
TOTO
|
int |
TRADITIONAL_HAN
ISO 15924 script code |
int |
UCAS
Unified Canadian Aboriginal Symbols (alias) |
int |
UGARITIC
Ugaritic Script in Unicode 4 |
int |
UNKNOWN
ISO 15924 script code |
int |
UNWRITTEN_LANGUAGES
ISO 15924 script code |
int |
VAI
ISO 15924 script code |
int |
VISIBLE_SPEECH
ISO 15924 script code |
int |
VITHKUQI
|
int |
WANCHO
|
int |
WARANG_CITI
ISO 15924 script code |
int |
WESTERN_SYRIAC
ISO 15924 script code |
int |
WOLEAI
ISO 15924 script code |
int |
YEZIDI
|
int |
YI
Yi syllables |
int |
ZANABAZAR_SQUARE
ISO 15924 script code |
Public methods | |
---|---|
static
boolean
|
breaksBetweenLetters(int script)
Returns true if the script allows line breaks between letters (excluding hyphenation). |
static
int[]
|
getCode(ULocale locale)
Gets a script codes associated with the given locale or ISO 15924 abbreviation or name. |
static
int[]
|
getCode(String nameOrAbbrOrLocale)
Gets the script codes associated with the given locale or ISO 15924 abbreviation or name. |
static
int[]
|
getCode(Locale locale)
Gets a script codes associated with the given locale or ISO 15924 abbreviation or name. |
static
int
|
getCodeFromName(String nameOrAbbr)
Returns the script code associated with the given Unicode script property alias (name or abbreviation). |
static
String
|
getName(int scriptCode)
Returns the long Unicode script name, if there is one. |
static
String
|
getSampleString(int script)
Returns the script sample character string. |
static
int
|
getScript(int codepoint)
Gets the script code associated with the given codepoint. |
static
int
|
getScriptExtensions(int c, BitSet set)
Sets code point c's Script_Extensions as script code integers into the output BitSet. |
static
String
|
getShortName(int scriptCode)
Returns the 4-letter ISO 15924 script code, which is the same as the short Unicode script name if Unicode has names for the script. |
static
UScript.ScriptUsage
|
getUsage(int script)
Returns the script usage according to UAX #31 Unicode Identifier and Pattern Syntax. |
static
boolean
|
hasScript(int c, int sc)
Do the Script_Extensions of code point c contain script sc? If c does not have explicit Script_Extensions, then this tests whether c has the Script property value sc. |
static
boolean
|
isCased(int script)
Returns true if in modern (or most recent) usage of the script case distinctions are customary. |
static
boolean
|
isRightToLeft(int script)
Returns true if the script is written right-to-left. |
Inherited methods | |
---|---|
Constants
ADLAM
public static final int ADLAM
ISO 15924 script code
Constant Value: 167 (0x000000a7)
AFAKA
public static final int AFAKA
ISO 15924 script code
Constant Value: 147 (0x00000093)
AHOM
public static final int AHOM
ISO 15924 script code
Constant Value: 161 (0x000000a1)
ANATOLIAN_HIEROGLYPHS
public static final int ANATOLIAN_HIEROGLYPHS
ISO 15924 script code
Constant Value: 156 (0x0000009c)
ARMENIAN
public static final int ARMENIAN
Armenian
Constant Value: 3 (0x00000003)
AVESTAN
public static final int AVESTAN
ISO 15924 script code
Constant Value: 117 (0x00000075)
BALINESE
public static final int BALINESE
ISO 15924 script code
Constant Value: 62 (0x0000003e)
BAMUM
public static final int BAMUM
ISO 15924 script code
Constant Value: 130 (0x00000082)
BASSA_VAH
public static final int BASSA_VAH
ISO 15924 script code
Constant Value: 134 (0x00000086)
BATAK
public static final int BATAK
ISO 15924 script code
Constant Value: 63 (0x0000003f)
BENGALI
public static final int BENGALI
Bengali
Constant Value: 4 (0x00000004)
BHAIKSUKI
public static final int BHAIKSUKI
ISO 15924 script code
Constant Value: 168 (0x000000a8)
BLISSYMBOLS
public static final int BLISSYMBOLS
ISO 15924 script code
Constant Value: 64 (0x00000040)
BOOK_PAHLAVI
public static final int BOOK_PAHLAVI
ISO 15924 script code
Constant Value: 124 (0x0000007c)
BOPOMOFO
public static final int BOPOMOFO
Bopomofo
Constant Value: 5 (0x00000005)
BRAHMI
public static final int BRAHMI
ISO 15924 script code
Constant Value: 65 (0x00000041)
BRAILLE
public static final int BRAILLE
Braille Script in Unicode 4
Constant Value: 46 (0x0000002e)
BUGINESE
public static final int BUGINESE
Script in Unicode 4.1
Constant Value: 55 (0x00000037)
CANADIAN_ABORIGINAL
public static final int CANADIAN_ABORIGINAL
Unified Canadian Aboriginal Symbols
Constant Value: 40 (0x00000028)
CARIAN
public static final int CARIAN
ISO 15924 script code
Constant Value: 104 (0x00000068)
CAUCASIAN_ALBANIAN
public static final int CAUCASIAN_ALBANIAN
ISO 15924 script code
Constant Value: 159 (0x0000009f)
CHAKMA
public static final int CHAKMA
ISO 15924 script code
Constant Value: 118 (0x00000076)
CHAM
public static final int CHAM
ISO 15924 script code
Constant Value: 66 (0x00000042)
CHEROKEE
public static final int CHEROKEE
Cherokee
Constant Value: 6 (0x00000006)
CHORASMIAN
public static final int CHORASMIAN
Constant Value: 189 (0x000000bd)
CIRTH
public static final int CIRTH
ISO 15924 script code
Constant Value: 67 (0x00000043)
CUNEIFORM
public static final int CUNEIFORM
ISO 15924 script code
Constant Value: 101 (0x00000065)
CYPRIOT
public static final int CYPRIOT
Cypriot Script in Unicode 4
Constant Value: 47 (0x0000002f)
CYPRO_MINOAN
public static final int CYPRO_MINOAN
Constant Value: 193 (0x000000c1)
CYRILLIC
public static final int CYRILLIC
Cyrillic
Constant Value: 8 (0x00000008)
DEMOTIC_EGYPTIAN
public static final int DEMOTIC_EGYPTIAN
ISO 15924 script code
Constant Value: 69 (0x00000045)
DESERET
public static final int DESERET
Deseret
Constant Value: 9 (0x00000009)
DEVANAGARI
public static final int DEVANAGARI
Devanagari
Constant Value: 10 (0x0000000a)
DIVES_AKURU
public static final int DIVES_AKURU
Constant Value: 190 (0x000000be)
DOGRA
public static final int DOGRA
ISO 15924 script code
Constant Value: 178 (0x000000b2)
DUPLOYAN
public static final int DUPLOYAN
ISO 15924 script code
Constant Value: 135 (0x00000087)
EASTERN_SYRIAC
public static final int EASTERN_SYRIAC
ISO 15924 script code
Constant Value: 97 (0x00000061)
EGYPTIAN_HIEROGLYPHS
public static final int EGYPTIAN_HIEROGLYPHS
ISO 15924 script code
Constant Value: 71 (0x00000047)
ELBASAN
public static final int ELBASAN
ISO 15924 script code
Constant Value: 136 (0x00000088)
ESTRANGELO_SYRIAC
public static final int ESTRANGELO_SYRIAC
ISO 15924 script code
Constant Value: 95 (0x0000005f)
ETHIOPIC
public static final int ETHIOPIC
Ethiopic
Constant Value: 11 (0x0000000b)
GEORGIAN
public static final int GEORGIAN
Georgian
Constant Value: 12 (0x0000000c)
GLAGOLITIC
public static final int GLAGOLITIC
Script in Unicode 4.1
Constant Value: 56 (0x00000038)
GRANTHA
public static final int GRANTHA
ISO 15924 script code
Constant Value: 137 (0x00000089)
GUJARATI
public static final int GUJARATI
Gujarati
Constant Value: 15 (0x0000000f)
GUNJALA_GONDI
public static final int GUNJALA_GONDI
Constant Value: 179 (0x000000b3)
GURMUKHI
public static final int GURMUKHI
Gurmukhi
Constant Value: 16 (0x00000010)
HANIFI_ROHINGYA
public static final int HANIFI_ROHINGYA
Constant Value: 182 (0x000000b6)
HANUNOO
public static final int HANUNOO
Hanunooo
Constant Value: 43 (0x0000002b)
HAN_WITH_BOPOMOFO
public static final int HAN_WITH_BOPOMOFO
ISO 15924 script code
Constant Value: 172 (0x000000ac)
HARAPPAN_INDUS
public static final int HARAPPAN_INDUS
ISO 15924 script code
Constant Value: 77 (0x0000004d)
HATRAN
public static final int HATRAN
ISO 15924 script code
Constant Value: 162 (0x000000a2)
HIERATIC_EGYPTIAN
public static final int HIERATIC_EGYPTIAN
ISO 15924 script code
Constant Value: 70 (0x00000046)
HIRAGANA
public static final int HIRAGANA
Hiragana
Constant Value: 20 (0x00000014)
IMPERIAL_ARAMAIC
public static final int IMPERIAL_ARAMAIC
ISO 15924 script code
Constant Value: 116 (0x00000074)
INHERITED
public static final int INHERITED
Inherited
Constant Value: 1 (0x00000001)
INSCRIPTIONAL_PAHLAVI
public static final int INSCRIPTIONAL_PAHLAVI
ISO 15924 script code
Constant Value: 122 (0x0000007a)
INSCRIPTIONAL_PARTHIAN
public static final int INSCRIPTIONAL_PARTHIAN
ISO 15924 script code
Constant Value: 125 (0x0000007d)
INVALID_CODE
public static final int INVALID_CODE
Invalid code
Constant Value: -1 (0xffffffff)
JAMO
public static final int JAMO
ISO 15924 script code
Constant Value: 173 (0x000000ad)
JAPANESE
public static final int JAPANESE
ISO 15924 script code
Constant Value: 105 (0x00000069)
JAVANESE
public static final int JAVANESE
ISO 15924 script code
Constant Value: 78 (0x0000004e)
JURCHEN
public static final int JURCHEN
ISO 15924 script code
Constant Value: 148 (0x00000094)
KAITHI
public static final int KAITHI
ISO 15924 script code
Constant Value: 120 (0x00000078)
KANNADA
public static final int KANNADA
Kannada
Constant Value: 21 (0x00000015)
KATAKANA
public static final int KATAKANA
Katakana
Constant Value: 22 (0x00000016)
KATAKANA_OR_HIRAGANA
public static final int KATAKANA_OR_HIRAGANA
Script in Unicode 4.0.1
Constant Value: 54 (0x00000036)
KAYAH_LI
public static final int KAYAH_LI
ISO 15924 script code
Constant Value: 79 (0x0000004f)
KHAROSHTHI
public static final int KHAROSHTHI
Script in Unicode 4.1
Constant Value: 57 (0x00000039)
KHITAN_SMALL_SCRIPT
public static final int KHITAN_SMALL_SCRIPT
Constant Value: 191 (0x000000bf)
KHOJKI
public static final int KHOJKI
ISO 15924 script code
Constant Value: 157 (0x0000009d)
KHUDAWADI
public static final int KHUDAWADI
ISO 15924 script code
Constant Value: 145 (0x00000091)
KHUTSURI
public static final int KHUTSURI
ISO 15924 script code
Constant Value: 72 (0x00000048)
KOREAN
public static final int KOREAN
ISO 15924 script code
Constant Value: 119 (0x00000077)
KPELLE
public static final int KPELLE
ISO 15924 script code
Constant Value: 138 (0x0000008a)
LANNA
public static final int LANNA
ISO 15924 script code
Constant Value: 106 (0x0000006a)
LATIN_FRAKTUR
public static final int LATIN_FRAKTUR
ISO 15924 script code
Constant Value: 80 (0x00000050)
LATIN_GAELIC
public static final int LATIN_GAELIC
ISO 15924 script code
Constant Value: 81 (0x00000051)
LEPCHA
public static final int LEPCHA
ISO 15924 script code
Constant Value: 82 (0x00000052)
LIMBU
public static final int LIMBU
Limbu Script in Unicode 4
Constant Value: 48 (0x00000030)
LINEAR_A
public static final int LINEAR_A
ISO 15924 script code
Constant Value: 83 (0x00000053)
LINEAR_B
public static final int LINEAR_B
Linear B Script in Unicode 4
Constant Value: 49 (0x00000031)
LISU
public static final int LISU
ISO 15924 script code
Constant Value: 131 (0x00000083)
LOMA
public static final int LOMA
ISO 15924 script code
Constant Value: 139 (0x0000008b)
LYCIAN
public static final int LYCIAN
ISO 15924 script code
Constant Value: 107 (0x0000006b)
LYDIAN
public static final int LYDIAN
ISO 15924 script code
Constant Value: 108 (0x0000006c)
MAHAJANI
public static final int MAHAJANI
ISO 15924 script code
Constant Value: 160 (0x000000a0)
MALAYALAM
public static final int MALAYALAM
Malayalam
Constant Value: 26 (0x0000001a)
MANDAEAN
public static final int MANDAEAN
ISO 15924 script code
Constant Value: 84 (0x00000054)
MANDAIC
public static final int MANDAIC
ISO 15924 script code
Constant Value: 84 (0x00000054)
MANICHAEAN
public static final int MANICHAEAN
ISO 15924 script code
Constant Value: 121 (0x00000079)
MARCHEN
public static final int MARCHEN
ISO 15924 script code
Constant Value: 169 (0x000000a9)
MASARAM_GONDI
public static final int MASARAM_GONDI
ISO 15924 script code
Constant Value: 175 (0x000000af)
MATHEMATICAL_NOTATION
public static final int MATHEMATICAL_NOTATION
ISO 15924 script code
Constant Value: 128 (0x00000080)
MAYAN_HIEROGLYPHS
public static final int MAYAN_HIEROGLYPHS
ISO 15924 script code
Constant Value: 85 (0x00000055)
MEDEFAIDRIN
public static final int MEDEFAIDRIN
Constant Value: 181 (0x000000b5)
MEITEI_MAYEK
public static final int MEITEI_MAYEK
ISO 15924 script code
Constant Value: 115 (0x00000073)
MENDE
public static final int MENDE
Mende Kikakui ISO 15924 script code
Constant Value: 140 (0x0000008c)
MEROITIC
public static final int MEROITIC
ISO 15924 script code
Constant Value: 86 (0x00000056)
MEROITIC_CURSIVE
public static final int MEROITIC_CURSIVE
ISO 15924 script code
Constant Value: 141 (0x0000008d)
MEROITIC_HIEROGLYPHS
public static final int MEROITIC_HIEROGLYPHS
ISO 15924 script code
Constant Value: 86 (0x00000056)
MIAO
public static final int MIAO
ISO 15924 script code
Constant Value: 92 (0x0000005c)
MODI
public static final int MODI
ISO 15924 script code
Constant Value: 163 (0x000000a3)
MONGOLIAN
public static final int MONGOLIAN
Mangolian
Constant Value: 27 (0x0000001b)
MOON
public static final int MOON
ISO 15924 script code
Constant Value: 114 (0x00000072)
MRO
public static final int MRO
ISO 15924 script code
Constant Value: 149 (0x00000095)
MULTANI
public static final int MULTANI
ISO 15924 script code
Constant Value: 164 (0x000000a4)
MYANMAR
public static final int MYANMAR
Myammar
Constant Value: 28 (0x0000001c)
NABATAEAN
public static final int NABATAEAN
ISO 15924 script code
Constant Value: 143 (0x0000008f)
NAG_MUNDARI
public static final int NAG_MUNDARI
Constant Value: 199 (0x000000c7)
NAKHI_GEBA
public static final int NAKHI_GEBA
ISO 15924 script code
Constant Value: 132 (0x00000084)
NANDINAGARI
public static final int NANDINAGARI
Constant Value: 187 (0x000000bb)
NEWA
public static final int NEWA
ISO 15924 script code
Constant Value: 170 (0x000000aa)
NEW_TAI_LUE
public static final int NEW_TAI_LUE
Script in Unicode 4.1
Constant Value: 59 (0x0000003b)
NKO
public static final int NKO
ISO 15924 script code
Constant Value: 87 (0x00000057)
NUSHU
public static final int NUSHU
ISO 15924 script code
Constant Value: 150 (0x00000096)
NYIAKENG_PUACHUE_HMONG
public static final int NYIAKENG_PUACHUE_HMONG
Constant Value: 186 (0x000000ba)
OLD_CHURCH_SLAVONIC_CYRILLIC
public static final int OLD_CHURCH_SLAVONIC_CYRILLIC
ISO 15924 script code
Constant Value: 68 (0x00000044)
OLD_HUNGARIAN
public static final int OLD_HUNGARIAN
ISO 15924 script code
Constant Value: 76 (0x0000004c)
OLD_ITALIC
public static final int OLD_ITALIC
Old Itallic
Constant Value: 30 (0x0000001e)
OLD_NORTH_ARABIAN
public static final int OLD_NORTH_ARABIAN
ISO 15924 script code
Constant Value: 142 (0x0000008e)
OLD_PERMIC
public static final int OLD_PERMIC
ISO 15924 script code
Constant Value: 89 (0x00000059)
OLD_PERSIAN
public static final int OLD_PERSIAN
Script in Unicode 4.1
Constant Value: 61 (0x0000003d)
OLD_SOGDIAN
public static final int OLD_SOGDIAN
Constant Value: 184 (0x000000b8)
OLD_SOUTH_ARABIAN
public static final int OLD_SOUTH_ARABIAN
ISO 15924 script code
Constant Value: 133 (0x00000085)
OLD_UYGHUR
public static final int OLD_UYGHUR
Constant Value: 194 (0x000000c2)
OL_CHIKI
public static final int OL_CHIKI
ISO 15924 script code
Constant Value: 109 (0x0000006d)
ORKHON
public static final int ORKHON
ISO 15924 script code
Constant Value: 88 (0x00000058)
OSAGE
public static final int OSAGE
ISO 15924 script code
Constant Value: 171 (0x000000ab)
OSMANYA
public static final int OSMANYA
Osmanya Script in Unicode 4
Constant Value: 50 (0x00000032)
PAHAWH_HMONG
public static final int PAHAWH_HMONG
ISO 15924 script code
Constant Value: 75 (0x0000004b)
PALMYRENE
public static final int PALMYRENE
ISO 15924 script code
Constant Value: 144 (0x00000090)
PAU_CIN_HAU
public static final int PAU_CIN_HAU
ISO 15924 script code
Constant Value: 165 (0x000000a5)
PHAGS_PA
public static final int PHAGS_PA
ISO 15924 script code
Constant Value: 90 (0x0000005a)
PHOENICIAN
public static final int PHOENICIAN
ISO 15924 script code
Constant Value: 91 (0x0000005b)
PHONETIC_POLLARD
public static final int PHONETIC_POLLARD
ISO 15924 script code
Constant Value: 92 (0x0000005c)
PSALTER_PAHLAVI
public static final int PSALTER_PAHLAVI
ISO 15924 script code
Constant Value: 123 (0x0000007b)
REJANG
public static final int REJANG
ISO 15924 script code
Constant Value: 110 (0x0000006e)
RONGORONGO
public static final int RONGORONGO
ISO 15924 script code
Constant Value: 93 (0x0000005d)
SAMARITAN
public static final int SAMARITAN
ISO 15924 script code
Constant Value: 126 (0x0000007e)
SARATI
public static final int SARATI
ISO 15924 script code
Constant Value: 94 (0x0000005e)
SAURASHTRA
public static final int SAURASHTRA
ISO 15924 script code
Constant Value: 111 (0x0000006f)
SHARADA
public static final int SHARADA
ISO 15924 script code
Constant Value: 151 (0x00000097)
SHAVIAN
public static final int SHAVIAN
Shavian Script in Unicode 4
Constant Value: 51 (0x00000033)
SIDDHAM
public static final int SIDDHAM
ISO 15924 script code
Constant Value: 166 (0x000000a6)
SIGN_WRITING
public static final int SIGN_WRITING
ISO 15924 script code for Sutton SignWriting
Constant Value: 112 (0x00000070)
SIMPLIFIED_HAN
public static final int SIMPLIFIED_HAN
ISO 15924 script code
Constant Value: 73 (0x00000049)
SINDHI
public static final int SINDHI
ISO 15924 script code
Constant Value: 145 (0x00000091)
SINHALA
public static final int SINHALA
Sinhala
Constant Value: 33 (0x00000021)
SORA_SOMPENG
public static final int SORA_SOMPENG
ISO 15924 script code
Constant Value: 152 (0x00000098)
SOYOMBO
public static final int SOYOMBO
ISO 15924 script code
Constant Value: 176 (0x000000b0)
SUNDANESE
public static final int SUNDANESE
ISO 15924 script code
Constant Value: 113 (0x00000071)
SYLOTI_NAGRI
public static final int SYLOTI_NAGRI
Script in Unicode 4.1
Constant Value: 58 (0x0000003a)
SYMBOLS
public static final int SYMBOLS
ISO 15924 script code
Constant Value: 129 (0x00000081)
SYMBOLS_EMOJI
public static final int SYMBOLS_EMOJI
ISO 15924 script code
Constant Value: 174 (0x000000ae)
TAGALOG
public static final int TAGALOG
Tagalog
Constant Value: 42 (0x0000002a)
TAGBANWA
public static final int TAGBANWA
Tagbanwa
Constant Value: 45 (0x0000002d)
TAI_LE
public static final int TAI_LE
Tai Le Script in Unicode 4
Constant Value: 52 (0x00000034)
TAI_VIET
public static final int TAI_VIET
ISO 15924 script code
Constant Value: 127 (0x0000007f)
TAKRI
public static final int TAKRI
ISO 15924 script code
Constant Value: 153 (0x00000099)
TANGUT
public static final int TANGUT
ISO 15924 script code
Constant Value: 154 (0x0000009a)
TENGWAR
public static final int TENGWAR
ISO 15924 script code
Constant Value: 98 (0x00000062)
TIBETAN
public static final int TIBETAN
Tibetan
Constant Value: 39 (0x00000027)
TIFINAGH
public static final int TIFINAGH
Script in Unicode 4.1
Constant Value: 60 (0x0000003c)
TIRHUTA
public static final int TIRHUTA
ISO 15924 script code
Constant Value: 158 (0x0000009e)
TRADITIONAL_HAN
public static final int TRADITIONAL_HAN
ISO 15924 script code
Constant Value: 74 (0x0000004a)
UCAS
public static final int UCAS
Unified Canadian Aboriginal Symbols (alias)
Constant Value: 40 (0x00000028)
UGARITIC
public static final int UGARITIC
Ugaritic Script in Unicode 4
Constant Value: 53 (0x00000035)
UNKNOWN
public static final int UNKNOWN
ISO 15924 script code
Constant Value: 103 (0x00000067)
UNWRITTEN_LANGUAGES
public static final int UNWRITTEN_LANGUAGES
ISO 15924 script code
Constant Value: 102 (0x00000066)
VAI
public static final int VAI
ISO 15924 script code
Constant Value: 99 (0x00000063)
VISIBLE_SPEECH
public static final int VISIBLE_SPEECH
ISO 15924 script code
Constant Value: 100 (0x00000064)
WARANG_CITI
public static final int WARANG_CITI
ISO 15924 script code
Constant Value: 146 (0x00000092)
WESTERN_SYRIAC
public static final int WESTERN_SYRIAC
ISO 15924 script code
Constant Value: 96 (0x00000060)
WOLEAI
public static final int WOLEAI
ISO 15924 script code
Constant Value: 155 (0x0000009b)
ZANABAZAR_SQUARE
public static final int ZANABAZAR_SQUARE
ISO 15924 script code
Constant Value: 177 (0x000000b1)
Public methods
breaksBetweenLetters
public static boolean breaksBetweenLetters (int script)
Returns true if the script allows line breaks between letters (excluding hyphenation). Such a script typically requires dictionary-based line breaking. For example, Hani and Thai.
Parameters | |
---|---|
script |
int : script code |
Returns | |
---|---|
boolean |
true if the script allows line breaks between letters |
getCode
public static int[] getCode (ULocale locale)
Gets a script codes associated with the given locale or ISO 15924 abbreviation or name. Returns MALAYAM given "Malayam" OR "Mlym". Returns LATIN given "en" OR "en_US"
Parameters | |
---|---|
locale |
ULocale : ULocale |
Returns | |
---|---|
int[] |
The script codes array. null if the the code cannot be found. |
getCode
public static int[] getCode (String nameOrAbbrOrLocale)
Gets the script codes associated with the given locale or ISO 15924 abbreviation or name. Returns MALAYAM given "Malayam" OR "Mlym". Returns LATIN given "en" OR "en_US"
Note: To search by short or long script alias only, use
getCodeFromName(java.lang.String)
instead.
That does a fast lookup with no access of the locale data.
Parameters | |
---|---|
nameOrAbbrOrLocale |
String : name of the script or ISO 15924 code or locale |
Returns | |
---|---|
int[] |
The script codes array. null if the the code cannot be found. |
getCode
public static int[] getCode (Locale locale)
Gets a script codes associated with the given locale or ISO 15924 abbreviation or name. Returns MALAYAM given "Malayam" OR "Mlym". Returns LATIN given "en" OR "en_US"
Parameters | |
---|---|
locale |
Locale : Locale |
Returns | |
---|---|
int[] |
The script codes array. null if the the code cannot be found. |
getCodeFromName
public static int getCodeFromName (String nameOrAbbr)
Returns the script code associated with the given Unicode script property alias (name or abbreviation). Short aliases are ISO 15924 script codes. Returns MALAYAM given "Malayam" OR "Mlym".
Parameters | |
---|---|
nameOrAbbr |
String : name of the script or ISO 15924 code |
Returns | |
---|---|
int |
The script code value, or INVALID_CODE if the code cannot be found. |
getName
public static String getName (int scriptCode)
Returns the long Unicode script name, if there is one. Otherwise returns the 4-letter ISO 15924 script code. Returns "Malayam" given MALAYALAM.
Parameters | |
---|---|
scriptCode |
int : int script code |
Returns | |
---|---|
String |
long script name as given in PropertyValueAliases.txt, or the 4-letter code |
Throws | |
---|---|
IllegalArgumentException |
if the script code is not valid |
getSampleString
public static String getSampleString (int script)
Returns the script sample character string. This string normally consists of one code point but might be longer. The string is empty if the script is not encoded.
Parameters | |
---|---|
script |
int : script code |
Returns | |
---|---|
String |
the sample character string |
getScript
public static int getScript (int codepoint)
Gets the script code associated with the given codepoint. Returns UScript.MALAYAM given 0x0D02
Parameters | |
---|---|
codepoint |
int : UChar32 codepoint |
Returns | |
---|---|
int |
The script code |
getScriptExtensions
public static int getScriptExtensions (int c, BitSet set)
Sets code point c's Script_Extensions as script code integers into the output BitSet.
- If c does have Script_Extensions, then the return value is the negative number of Script_Extensions codes (= -set.cardinality()); in this case, the Script property value (normally Common or Inherited) is not included in the set.
- If c does not have Script_Extensions, then the one Script code is put into the set and also returned.
- If c is not a valid code point, then the one
UNKNOWN
code is put into the set and also returned.
Some characters are commonly used in multiple scripts. For more information, see UAX #24: http://www.unicode.org/reports/tr24/.
Parameters | |
---|---|
c |
int : code point |
set |
BitSet : set of script code integers; will be cleared, then bits are set
corresponding to c's Script_Extensions |
Returns | |
---|---|
int |
negative number of script codes in c's Script_Extensions, or the non-negative single Script value |
getShortName
public static String getShortName (int scriptCode)
Returns the 4-letter ISO 15924 script code, which is the same as the short Unicode script name if Unicode has names for the script. Returns "Mlym" given MALAYALAM.
Parameters | |
---|---|
scriptCode |
int : int script code |
Returns | |
---|---|
String |
short script name (4-letter code) |
Throws | |
---|---|
IllegalArgumentException |
if the script code is not valid |
getUsage
public static UScript.ScriptUsage getUsage (int script)
Returns the script usage according to UAX #31 Unicode Identifier and Pattern Syntax.
Returns ScriptUsage#NOT_ENCODED
if the script is not encoded in Unicode.
Parameters | |
---|---|
script |
int : script code |
Returns | |
---|---|
UScript.ScriptUsage |
script usage |
See also:
hasScript
public static boolean hasScript (int c, int sc)
Do the Script_Extensions of code point c contain script sc? If c does not have explicit Script_Extensions, then this tests whether c has the Script property value sc.
Some characters are commonly used in multiple scripts. For more information, see UAX #24: http://www.unicode.org/reports/tr24/.
Parameters | |
---|---|
c |
int : code point |
sc |
int : script code |
Returns | |
---|---|
boolean |
true if sc is in Script_Extensions(c) |
isCased
public static boolean isCased (int script)
Returns true if in modern (or most recent) usage of the script case distinctions are customary. For example, Latn and Cyrl.
Parameters | |
---|---|
script |
int : script code |
Returns | |
---|---|
boolean |
true if the script is cased |
isRightToLeft
public static boolean isRightToLeft (int script)
Returns true if the script is written right-to-left. For example, Arab and Hebr.
Parameters | |
---|---|
script |
int : script code |
Returns | |
---|---|
boolean |
true if the script is right-to-left |