public final class UScript extends Object
The current set of script code constants supports at least all scripts that are encoded in the version of Unicode which ICU currently supports. The names of the constants are usually derived from the Unicode script property value aliases. See UAX #24 Unicode Script Property (http://www.unicode.org/reports/tr24/) and http://www.unicode.org/Public/UCD/latest/ucd/PropertyValueAliases.txt .
In addition, constants for many ISO 15924 script codes are included, for use with language tags, CLDR data, and similar. Some of those codes are not used in the Unicode Character Database (UCD). For example, there are no characters that have a UCD script property value of Hans or Hant. All Han ideographs have the Hani script property value in Unicode.
Private-use codes Qaaa..Qabx are not included, except as used in the UCD or in CLDR.
Starting with ICU 55, script codes are only added when their scripts have been or will certainly be encoded in Unicode, and have been assigned Unicode script property value aliases, to ensure that their script names are stable and match the names of the constants. Script codes like Latf and Aran that are not subject to separate encoding may be added at any time.
Modifier and Type | Class | Description |
---|---|---|
static class |
UScript.ScriptUsage |
Script usage constants.
|
Modifier and Type | Field | Description |
---|---|---|
static int |
ADLAM |
ISO 15924 script code
|
static int |
AFAKA |
ISO 15924 script code
|
static int |
AHOM |
ISO 15924 script code
|
static int |
ANATOLIAN_HIEROGLYPHS |
ISO 15924 script code
|
static int |
ARABIC |
Arabic
|
static int |
ARMENIAN |
Armenian
|
static int |
AVESTAN |
ISO 15924 script code
|
static int |
BALINESE |
ISO 15924 script code
|
static int |
BAMUM |
ISO 15924 script code
|
static int |
BASSA_VAH |
ISO 15924 script code
|
static int |
BATAK |
ISO 15924 script code
|
static int |
BENGALI |
Bengali
|
static int |
BHAIKSUKI |
ISO 15924 script code
|
static int |
BLISSYMBOLS |
ISO 15924 script code
|
static int |
BOOK_PAHLAVI |
ISO 15924 script code
|
static int |
BOPOMOFO |
Bopomofo
|
static int |
BRAHMI |
ISO 15924 script code
|
static int |
BRAILLE |
Braille
Script in Unicode 4
|
static int |
BUGINESE |
Script in Unicode 4.1
|
static int |
BUHID |
Buhid
|
static int |
CANADIAN_ABORIGINAL |
Unified Canadian Aboriginal Symbols
|
static int |
CARIAN |
ISO 15924 script code
|
static int |
CAUCASIAN_ALBANIAN |
ISO 15924 script code
|
static int |
CHAKMA |
ISO 15924 script code
|
static int |
CHAM |
ISO 15924 script code
|
static int |
CHEROKEE |
Cherokee
|
static int |
CIRTH |
ISO 15924 script code
|
static int |
CODE_LIMIT |
Deprecated.
ICU 58 The numeric value may change over time, see ICU ticket #12420.
|
static int |
COMMON |
Common
|
static int |
COPTIC |
Coptic
|
static int |
CUNEIFORM |
ISO 15924 script code
|
static int |
CYPRIOT |
Cypriot
Script in Unicode 4
|
static int |
CYRILLIC |
Cyrillic
|
static int |
DEMOTIC_EGYPTIAN |
ISO 15924 script code
|
static int |
DESERET |
Deseret
|
static int |
DEVANAGARI |
Devanagari
|
static int |
DUPLOYAN |
ISO 15924 script code
|
static int |
DUPLOYAN_SHORTAND |
Deprecated.
ICU 54
|
static int |
EASTERN_SYRIAC |
ISO 15924 script code
|
static int |
EGYPTIAN_HIEROGLYPHS |
ISO 15924 script code
|
static int |
ELBASAN |
ISO 15924 script code
|
static int |
ESTRANGELO_SYRIAC |
ISO 15924 script code
|
static int |
ETHIOPIC |
Ethiopic
|
static int |
GEORGIAN |
Georgian
|
static int |
GLAGOLITIC |
Script in Unicode 4.1
|
static int |
GOTHIC |
Gothic
|
static int |
GRANTHA |
ISO 15924 script code
|
static int |
GREEK |
Greek
|
static int |
GUJARATI |
Gujarati
|
static int |
GURMUKHI |
Gurmukhi
|
static int |
HAN |
Han
|
static int |
HAN_WITH_BOPOMOFO |
ISO 15924 script code
|
static int |
HANGUL |
Hangul
|
static int |
HANUNOO |
Hanunooo
|
static int |
HARAPPAN_INDUS |
ISO 15924 script code
|
static int |
HATRAN |
ISO 15924 script code
|
static int |
HEBREW |
Hebrew
|
static int |
HIERATIC_EGYPTIAN |
ISO 15924 script code
|
static int |
HIRAGANA |
Hiragana
|
static int |
IMPERIAL_ARAMAIC |
ISO 15924 script code
|
static int |
INHERITED |
Inherited
|
static int |
INSCRIPTIONAL_PAHLAVI |
ISO 15924 script code
|
static int |
INSCRIPTIONAL_PARTHIAN |
ISO 15924 script code
|
static int |
INVALID_CODE |
Invalid code
|
static int |
JAMO |
ISO 15924 script code
|
static int |
JAPANESE |
ISO 15924 script code
|
static int |
JAVANESE |
ISO 15924 script code
|
static int |
JURCHEN |
ISO 15924 script code
|
static int |
KAITHI |
ISO 15924 script code
|
static int |
KANNADA |
Kannada
|
static int |
KATAKANA |
Katakana
|
static int |
KATAKANA_OR_HIRAGANA |
Script in Unicode 4.0.1
|
static int |
KAYAH_LI |
ISO 15924 script code
|
static int |
KHAROSHTHI |
Script in Unicode 4.1
|
static int |
KHMER |
Khmer
|
static int |
KHOJKI |
ISO 15924 script code
|
static int |
KHUDAWADI |
ISO 15924 script code
|
static int |
KHUTSURI |
ISO 15924 script code
|
static int |
KOREAN |
ISO 15924 script code
|
static int |
KPELLE |
ISO 15924 script code
|
static int |
LANNA |
ISO 15924 script code
|
static int |
LAO |
Lao
|
static int |
LATIN |
Latin
|
static int |
LATIN_FRAKTUR |
ISO 15924 script code
|
static int |
LATIN_GAELIC |
ISO 15924 script code
|
static int |
LEPCHA |
ISO 15924 script code
|
static int |
LIMBU |
Limbu
Script in Unicode 4
|
static int |
LINEAR_A |
ISO 15924 script code
|
static int |
LINEAR_B |
Linear B
Script in Unicode 4
|
static int |
LISU |
ISO 15924 script code
|
static int |
LOMA |
ISO 15924 script code
|
static int |
LYCIAN |
ISO 15924 script code
|
static int |
LYDIAN |
ISO 15924 script code
|
static int |
MAHAJANI |
ISO 15924 script code
|
static int |
MALAYALAM |
Malayalam
|
static int |
MANDAEAN |
ISO 15924 script code
|
static int |
MANDAIC |
ISO 15924 script code
|
static int |
MANICHAEAN |
ISO 15924 script code
|
static int |
MARCHEN |
ISO 15924 script code
|
static int |
MASARAM_GONDI |
ISO 15924 script code
|
static int |
MATHEMATICAL_NOTATION |
ISO 15924 script code
|
static int |
MAYAN_HIEROGLYPHS |
ISO 15924 script code
|
static int |
MEITEI_MAYEK |
ISO 15924 script code
|
static int |
MENDE |
Mende Kikakui
ISO 15924 script code
|
static int |
MEROITIC |
ISO 15924 script code
|
static int |
MEROITIC_CURSIVE |
ISO 15924 script code
|
static int |
MEROITIC_HIEROGLYPHS |
ISO 15924 script code
|
static int |
MIAO |
ISO 15924 script code
|
static int |
MODI |
ISO 15924 script code
|
static int |
MONGOLIAN |
Mangolian
|
static int |
MOON |
ISO 15924 script code
|
static int |
MRO |
ISO 15924 script code
|
static int |
MULTANI |
ISO 15924 script code
|
static int |
MYANMAR |
Myammar
|
static int |
NABATAEAN |
ISO 15924 script code
|
static int |
NAKHI_GEBA |
ISO 15924 script code
|
static int |
NEW_TAI_LUE |
Script in Unicode 4.1
|
static int |
NEWA |
ISO 15924 script code
|
static int |
NKO |
ISO 15924 script code
|
static int |
NUSHU |
ISO 15924 script code
|
static int |
OGHAM |
Ogham
|
static int |
OL_CHIKI |
ISO 15924 script code
|
static int |
OLD_CHURCH_SLAVONIC_CYRILLIC |
ISO 15924 script code
|
static int |
OLD_HUNGARIAN |
ISO 15924 script code
|
static int |
OLD_ITALIC |
Old Itallic
|
static int |
OLD_NORTH_ARABIAN |
ISO 15924 script code
|
static int |
OLD_PERMIC |
ISO 15924 script code
|
static int |
OLD_PERSIAN |
Script in Unicode 4.1
|
static int |
OLD_SOUTH_ARABIAN |
ISO 15924 script code
|
static int |
ORIYA |
Oriya
|
static int |
ORKHON |
ISO 15924 script code
|
static int |
OSAGE |
ISO 15924 script code
|
static int |
OSMANYA |
Osmanya
Script in Unicode 4
|
static int |
PAHAWH_HMONG |
ISO 15924 script code
|
static int |
PALMYRENE |
ISO 15924 script code
|
static int |
PAU_CIN_HAU |
ISO 15924 script code
|
static int |
PHAGS_PA |
ISO 15924 script code
|
static int |
PHOENICIAN |
ISO 15924 script code
|
static int |
PHONETIC_POLLARD |
ISO 15924 script code
|
static int |
PSALTER_PAHLAVI |
ISO 15924 script code
|
static int |
REJANG |
ISO 15924 script code
|
static int |
RONGORONGO |
ISO 15924 script code
|
static int |
RUNIC |
Runic
|
static int |
SAMARITAN |
ISO 15924 script code
|
static int |
SARATI |
ISO 15924 script code
|
static int |
SAURASHTRA |
ISO 15924 script code
|
static int |
SHARADA |
ISO 15924 script code
|
static int |
SHAVIAN |
Shavian
Script in Unicode 4
|
static int |
SIDDHAM |
ISO 15924 script code
|
static int |
SIGN_WRITING |
ISO 15924 script code for Sutton SignWriting
|
static int |
SIMPLIFIED_HAN |
ISO 15924 script code
|
static int |
SINDHI |
ISO 15924 script code
|
static int |
SINHALA |
Sinhala
|
static int |
SORA_SOMPENG |
ISO 15924 script code
|
static int |
SOYOMBO |
ISO 15924 script code
|
static int |
SUNDANESE |
ISO 15924 script code
|
static int |
SYLOTI_NAGRI |
Script in Unicode 4.1
|
static int |
SYMBOLS |
ISO 15924 script code
|
static int |
SYMBOLS_EMOJI |
ISO 15924 script code
|
static int |
SYRIAC |
Syriac
|
static int |
TAGALOG |
Tagalog
|
static int |
TAGBANWA |
Tagbanwa
|
static int |
TAI_LE |
Tai Le
Script in Unicode 4
|
static int |
TAI_VIET |
ISO 15924 script code
|
static int |
TAKRI |
ISO 15924 script code
|
static int |
TAMIL |
Tamil
|
static int |
TANGUT |
ISO 15924 script code
|
static int |
TELUGU |
Telugu
|
static int |
TENGWAR |
ISO 15924 script code
|
static int |
THAANA |
Thana
|
static int |
THAI |
Thai
|
static int |
TIBETAN |
Tibetan
|
static int |
TIFINAGH |
Script in Unicode 4.1
|
static int |
TIRHUTA |
ISO 15924 script code
|
static int |
TRADITIONAL_HAN |
ISO 15924 script code
|
static int |
UCAS |
Unified Canadian Aboriginal Symbols (alias)
|
static int |
UGARITIC |
Ugaritic
Script in Unicode 4
|
static int |
UNKNOWN |
ISO 15924 script code
|
static int |
UNWRITTEN_LANGUAGES |
ISO 15924 script code
|
static int |
VAI |
ISO 15924 script code
|
static int |
VISIBLE_SPEECH |
ISO 15924 script code
|
static int |
WARANG_CITI |
ISO 15924 script code
|
static int |
WESTERN_SYRIAC |
ISO 15924 script code
|
static int |
WOLEAI |
ISO 15924 script code
|
static int |
YI |
Yi syllables
|
static int |
ZANABAZAR_SQUARE |
ISO 15924 script code
|
Modifier and Type | Method | Description |
---|---|---|
static boolean |
breaksBetweenLetters(int script) |
Returns true if the script allows line breaks between letters (excluding hyphenation).
|
static int[] |
getCode(ULocale locale) |
Gets a script codes associated with the given locale or ISO 15924 abbreviation or name.
|
static int[] |
getCode(String nameOrAbbrOrLocale) |
Gets the script codes associated with the given locale or ISO 15924 abbreviation or name.
|
static int[] |
getCode(Locale locale) |
Gets a script codes associated with the given locale or ISO 15924 abbreviation or name.
|
static int |
getCodeFromName(String nameOrAbbr) |
Returns the script code associated with the given Unicode script property alias
(name or abbreviation).
|
static String |
getName(int scriptCode) |
Returns the long Unicode script name, if there is one.
|
static String |
getSampleString(int script) |
Returns the script sample character string.
|
static int |
getScript(int codepoint) |
Gets the script code associated with the given codepoint.
|
static int |
getScriptExtensions(int c,
BitSet set) |
Sets code point c's Script_Extensions as script code integers into the output BitSet.
|
static String |
getShortName(int scriptCode) |
Returns the 4-letter ISO 15924 script code,
which is the same as the short Unicode script name if Unicode has names for the script.
|
static UScript.ScriptUsage |
getUsage(int script) |
Returns the script usage according to UAX #31 Unicode Identifier and Pattern Syntax.
|
static boolean |
hasScript(int c,
int sc) |
Do the Script_Extensions of code point c contain script sc?
|
static boolean |
isCased(int script) |
Returns true if in modern (or most recent) usage of the script case distinctions are customary.
|
static boolean |
isRightToLeft(int script) |
Returns true if the script is written right-to-left.
|
public static final int INVALID_CODE
public static final int COMMON
public static final int INHERITED
public static final int ARABIC
public static final int ARMENIAN
public static final int BENGALI
public static final int BOPOMOFO
public static final int CHEROKEE
public static final int COPTIC
public static final int CYRILLIC
public static final int DESERET
public static final int DEVANAGARI
public static final int ETHIOPIC
public static final int GEORGIAN
public static final int GOTHIC
public static final int GREEK
public static final int GUJARATI
public static final int GURMUKHI
public static final int HAN
public static final int HANGUL
public static final int HEBREW
public static final int HIRAGANA
public static final int KANNADA
public static final int KATAKANA
public static final int KHMER
public static final int LAO
public static final int LATIN
public static final int MALAYALAM
public static final int MONGOLIAN
public static final int MYANMAR
public static final int OGHAM
public static final int OLD_ITALIC
public static final int ORIYA
public static final int RUNIC
public static final int SINHALA
public static final int SYRIAC
public static final int TAMIL
public static final int TELUGU
public static final int THAANA
public static final int THAI
public static final int TIBETAN
public static final int CANADIAN_ABORIGINAL
public static final int UCAS
public static final int YI
public static final int TAGALOG
public static final int HANUNOO
public static final int BUHID
public static final int TAGBANWA
public static final int BRAILLE
public static final int CYPRIOT
public static final int LIMBU
public static final int LINEAR_B
public static final int OSMANYA
public static final int SHAVIAN
public static final int TAI_LE
public static final int UGARITIC
public static final int KATAKANA_OR_HIRAGANA
public static final int BUGINESE
public static final int GLAGOLITIC
public static final int KHAROSHTHI
public static final int SYLOTI_NAGRI
public static final int NEW_TAI_LUE
public static final int TIFINAGH
public static final int OLD_PERSIAN
public static final int BALINESE
public static final int BATAK
public static final int BLISSYMBOLS
public static final int BRAHMI
public static final int CHAM
public static final int CIRTH
public static final int OLD_CHURCH_SLAVONIC_CYRILLIC
public static final int DEMOTIC_EGYPTIAN
public static final int HIERATIC_EGYPTIAN
public static final int EGYPTIAN_HIEROGLYPHS
public static final int KHUTSURI
public static final int SIMPLIFIED_HAN
public static final int TRADITIONAL_HAN
public static final int PAHAWH_HMONG
public static final int OLD_HUNGARIAN
public static final int HARAPPAN_INDUS
public static final int JAVANESE
public static final int KAYAH_LI
public static final int LATIN_FRAKTUR
public static final int LATIN_GAELIC
public static final int LEPCHA
public static final int LINEAR_A
public static final int MANDAIC
public static final int MANDAEAN
public static final int MAYAN_HIEROGLYPHS
public static final int MEROITIC_HIEROGLYPHS
public static final int MEROITIC
public static final int NKO
public static final int ORKHON
public static final int OLD_PERMIC
public static final int PHAGS_PA
public static final int PHOENICIAN
public static final int MIAO
public static final int PHONETIC_POLLARD
public static final int RONGORONGO
public static final int SARATI
public static final int ESTRANGELO_SYRIAC
public static final int WESTERN_SYRIAC
public static final int EASTERN_SYRIAC
public static final int TENGWAR
public static final int VAI
public static final int VISIBLE_SPEECH
public static final int CUNEIFORM
public static final int UNWRITTEN_LANGUAGES
public static final int UNKNOWN
public static final int CARIAN
public static final int JAPANESE
public static final int LANNA
public static final int LYCIAN
public static final int LYDIAN
public static final int OL_CHIKI
public static final int REJANG
public static final int SAURASHTRA
public static final int SIGN_WRITING
public static final int SUNDANESE
public static final int MOON
public static final int MEITEI_MAYEK
public static final int IMPERIAL_ARAMAIC
public static final int AVESTAN
public static final int CHAKMA
public static final int KOREAN
public static final int KAITHI
public static final int MANICHAEAN
public static final int INSCRIPTIONAL_PAHLAVI
public static final int PSALTER_PAHLAVI
public static final int BOOK_PAHLAVI
public static final int INSCRIPTIONAL_PARTHIAN
public static final int SAMARITAN
public static final int TAI_VIET
public static final int MATHEMATICAL_NOTATION
public static final int SYMBOLS
public static final int BAMUM
public static final int LISU
public static final int NAKHI_GEBA
public static final int OLD_SOUTH_ARABIAN
public static final int BASSA_VAH
public static final int DUPLOYAN
@Deprecated public static final int DUPLOYAN_SHORTAND
public static final int ELBASAN
public static final int GRANTHA
public static final int KPELLE
public static final int LOMA
public static final int MENDE
public static final int MEROITIC_CURSIVE
public static final int OLD_NORTH_ARABIAN
public static final int NABATAEAN
public static final int PALMYRENE
public static final int KHUDAWADI
public static final int SINDHI
public static final int WARANG_CITI
public static final int AFAKA
public static final int JURCHEN
public static final int MRO
public static final int NUSHU
public static final int SHARADA
public static final int SORA_SOMPENG
public static final int TAKRI
public static final int TANGUT
public static final int WOLEAI
public static final int ANATOLIAN_HIEROGLYPHS
public static final int KHOJKI
public static final int TIRHUTA
public static final int CAUCASIAN_ALBANIAN
public static final int MAHAJANI
public static final int AHOM
public static final int HATRAN
public static final int MODI
public static final int MULTANI
public static final int PAU_CIN_HAU
public static final int SIDDHAM
public static final int ADLAM
public static final int BHAIKSUKI
public static final int MARCHEN
public static final int NEWA
public static final int OSAGE
public static final int HAN_WITH_BOPOMOFO
public static final int JAMO
public static final int SYMBOLS_EMOJI
public static final int MASARAM_GONDI
public static final int SOYOMBO
public static final int ZANABAZAR_SQUARE
@Deprecated public static final int CODE_LIMIT
public static final int[] getCode(Locale locale)
locale
- Localepublic static final int[] getCode(ULocale locale)
locale
- ULocalepublic static final int[] getCode(String nameOrAbbrOrLocale)
Note: To search by short or long script alias only, use
getCodeFromName(String)
instead.
That does a fast lookup with no access of the locale data.
nameOrAbbrOrLocale
- name of the script or ISO 15924 code or localepublic static final int getCodeFromName(String nameOrAbbr)
nameOrAbbr
- name of the script or ISO 15924 codepublic static final int getScript(int codepoint)
codepoint
- UChar32 codepointpublic static final boolean hasScript(int c, int sc)
Some characters are commonly used in multiple scripts. For more information, see UAX #24: http://www.unicode.org/reports/tr24/.
c
- code pointsc
- script codepublic static final int getScriptExtensions(int c, BitSet set)
UNKNOWN
code is put into the set
and also returned.
Some characters are commonly used in multiple scripts. For more information, see UAX #24: http://www.unicode.org/reports/tr24/.
c
- code pointset
- set of script code integers; will be cleared, then bits are set
corresponding to c's Script_Extensionspublic static final String getName(int scriptCode)
scriptCode
- int script codeIllegalArgumentException
- if the script code is not validpublic static final String getShortName(int scriptCode)
scriptCode
- int script codeIllegalArgumentException
- if the script code is not validpublic static final String getSampleString(int script)
script
- script codepublic static final UScript.ScriptUsage getUsage(int script)
UScript.ScriptUsage.NOT_ENCODED
if the script is not encoded in Unicode.script
- script codeUScript.ScriptUsage
public static final boolean isRightToLeft(int script)
script
- script codepublic static final boolean breaksBetweenLetters(int script)
script
- script codepublic static final boolean isCased(int script)
script
- script codeCopyright © 2016 Unicode, Inc. and others.