Namespace: go.std.unicode
v1.0Contents
Summary
Provides a low-level interface to the unicode package.
Package unicode provides data and functions to test some properties of
Unicode code points.
Index
- *CaseRange
- *Range16
- *Range32
- *RangeTable
- *SpecialCase
- ASCII_Hex_Digit
- Adlam
- Ahom
- Anatolian_Hieroglyphs
- Arabic
- Armenian
- Avestan
- AzeriCase
- Balinese
- Bamum
- Bassa_Vah
- Batak
- Bengali
- Bhaiksuki
- Bidi_Control
- Bopomofo
- Brahmi
- Braille
- Buginese
- Buhid
- C
- Canadian_Aboriginal
- Carian
- CaseRange
- CaseRanges
- Categories
- Caucasian_Albanian
- Cc
- Cf
- Chakma
- Cham
- Cherokee
- Chorasmian
- Co
- Common
- Coptic
- Cs
- Cuneiform
- Cypriot
- Cyrillic
- Dash
- Deprecated
- Deseret
- Devanagari
- Diacritic
- Digit
- Dives_Akuru
- Dogra
- Duployan
- Egyptian_Hieroglyphs
- Elbasan
- Elymaic
- Ethiopic
- Extender
- FoldCategory
- FoldScript
- Georgian
- Glagolitic
- Gothic
- Grantha
- GraphicRanges
- Greek
- Gujarati
- Gunjala_Gondi
- Gurmukhi
- Han
- Hangul
- Hanifi_Rohingya
- Hanunoo
- Hatran
- Hebrew
- Hex_Digit
- Hiragana
- Hyphen
- IDS_Binary_Operator
- IDS_Trinary_Operator
- Ideographic
- Imperial_Aramaic
- In
- Inherited
- Inscriptional_Pahlavi
- Inscriptional_Parthian
- Is
- IsControl
- IsDigit
- IsGraphic
- IsLetter
- IsLower
- IsMark
- IsNumber
- IsOneOf
- IsPrint
- IsPunct
- IsSpace
- IsSymbol
- IsTitle
- IsUpper
- Javanese
- Join_Control
- Kaithi
- Kannada
- Katakana
- Kayah_Li
- Kharoshthi
- Khitan_Small_Script
- Khmer
- Khojki
- Khudawadi
- L
- Lao
- Latin
- Lepcha
- Letter
- Limbu
- Linear_A
- Linear_B
- Lisu
- Ll
- Lm
- Lo
- Logical_Order_Exception
- Lower
- LowerCase
- Lt
- Lu
- Lycian
- Lydian
- M
- Mahajani
- Makasar
- Malayalam
- Mandaic
- Manichaean
- Marchen
- Mark
- Masaram_Gondi
- MaxASCII
- MaxCase
- MaxLatin1
- MaxRune
- Mc
- Me
- Medefaidrin
- Meetei_Mayek
- Mende_Kikakui
- Meroitic_Cursive
- Meroitic_Hieroglyphs
- Miao
- Mn
- Modi
- Mongolian
- Mro
- Multani
- Myanmar
- N
- Nabataean
- Nandinagari
- Nd
- New_Tai_Lue
- Newa
- Nko
- Nl
- No
- Noncharacter_Code_Point
- Number
- Nushu
- Nyiakeng_Puachue_Hmong
- Ogham
- Ol_Chiki
- Old_Hungarian
- Old_Italic
- Old_North_Arabian
- Old_Permic
- Old_Persian
- Old_Sogdian
- Old_South_Arabian
- Old_Turkic
- Oriya
- Osage
- Osmanya
- Other
- Other_Alphabetic
- Other_Default_Ignorable_Code_Point
- Other_Grapheme_Extend
- Other_ID_Continue
- Other_ID_Start
- Other_Lowercase
- Other_Math
- Other_Uppercase
- P
- Pahawh_Hmong
- Palmyrene
- Pattern_Syntax
- Pattern_White_Space
- Pau_Cin_Hau
- Pc
- Pd
- Pe
- Pf
- Phags_Pa
- Phoenician
- Pi
- Po
- Prepended_Concatenation_Mark
- PrintRanges
- Properties
- Ps
- Psalter_Pahlavi
- Punct
- Quotation_Mark
- Radical
- Range16
- Range32
- RangeTable
- Regional_Indicator
- Rejang
- ReplacementChar
- Runic
- S
- STerm
- Samaritan
- Saurashtra
- Sc
- Scripts
- Sentence_Terminal
- Sharada
- Shavian
- Siddham
- SignWriting
- SimpleFold
- Sinhala
- Sk
- Sm
- So
- Soft_Dotted
- Sogdian
- Sora_Sompeng
- Soyombo
- Space
- SpecialCase
- Sundanese
- Syloti_Nagri
- Symbol
- Syriac
- Tagalog
- Tagbanwa
- Tai_Le
- Tai_Tham
- Tai_Viet
- Takri
- Tamil
- Tangut
- Telugu
- Terminal_Punctuation
- Thaana
- Thai
- Tibetan
- Tifinagh
- Tirhuta
- Title
- TitleCase
- To
- ToLower
- ToTitle
- ToUpper
- TurkishCase
- Ugaritic
- Unified_Ideograph
- Upper
- UpperCase
- UpperLower
- Vai
- Variation_Selector
- Version
- Wancho
- Warang_Citi
- White_Space
- Yezidi
- Yi
- Z
- Zanabazar_Square
- Zl
- Zp
- Zs
- arrayOfCaseRange
- arrayOfRange16
- arrayOfRange32
- arrayOfRangeTable
- arrayOfSpecialCase
Legend
-
Constant
Variable
Function
Macro
Special form
Type
GoVar
Receiver/Method
Constants
Constants are variables with :const true in their metadata. Joker currently does not recognize them as special; as such, it allows redefining them or their values.-
LowerCase
Int v1.0Indices into the Delta arrays inside CaseRanges for case mapping.
-
MaxASCII
Char v1.0maximum ASCII value.
-
MaxCase
Int v1.0Indices into the Delta arrays inside CaseRanges for case mapping.
-
MaxLatin1
Char v1.0maximum Latin-1 value.
-
MaxRune
Char v1.0Maximum valid Unicode code point.
-
ReplacementChar
Char v1.0Represents invalid code points.
-
TitleCase
Int v1.0Indices into the Delta arrays inside CaseRanges for case mapping.
-
UpperCase
Int v1.0Indices into the Delta arrays inside CaseRanges for case mapping.
-
UpperLower
Char v1.0(Cannot be a valid delta.)
-
Version
String v1.0Version is the Unicode edition from which the tables are derived.
Variables
-
ASCII_Hex_Digit
Var v1.0ASCII_Hex_Digit is the set of Unicode characters with property ASCII_Hex_Digit.
-
Adlam
Var v1.0Adlam is the set of Unicode characters in script Adlam.
-
Ahom
Var v1.0Ahom is the set of Unicode characters in script Ahom.
-
Anatolian_Hieroglyphs
Var v1.0Anatolian_Hieroglyphs is the set of Unicode characters in script Anatolian_Hieroglyphs.
-
Arabic
Var v1.0Arabic is the set of Unicode characters in script Arabic.
-
Armenian
Var v1.0Armenian is the set of Unicode characters in script Armenian.
-
Avestan
Var v1.0Avestan is the set of Unicode characters in script Avestan.
-
AzeriCase
Var v1.0 -
Balinese
Var v1.0Balinese is the set of Unicode characters in script Balinese.
-
Bamum
Var v1.0Bamum is the set of Unicode characters in script Bamum.
-
Bassa_Vah
Var v1.0Bassa_Vah is the set of Unicode characters in script Bassa_Vah.
-
Batak
Var v1.0Batak is the set of Unicode characters in script Batak.
-
Bengali
Var v1.0Bengali is the set of Unicode characters in script Bengali.
-
Bhaiksuki
Var v1.0Bhaiksuki is the set of Unicode characters in script Bhaiksuki.
-
Bidi_Control
Var v1.0Bidi_Control is the set of Unicode characters with property Bidi_Control.
-
Bopomofo
Var v1.0Bopomofo is the set of Unicode characters in script Bopomofo.
-
Brahmi
Var v1.0Brahmi is the set of Unicode characters in script Brahmi.
-
Braille
Var v1.0Braille is the set of Unicode characters in script Braille.
-
Buginese
Var v1.0Buginese is the set of Unicode characters in script Buginese.
-
Buhid
Var v1.0Buhid is the set of Unicode characters in script Buhid.
-
C
Var v1.0These variables have type *RangeTable.
-
Canadian_Aboriginal
Var v1.0Canadian_Aboriginal is the set of Unicode characters in script Canadian_Aboriginal.
-
Carian
Var v1.0Carian is the set of Unicode characters in script Carian.
-
CaseRanges
Var v1.0CaseRanges is the table describing case mappings for all letters with
non-self mappings.
-
Categories
Var v1.0Categories is the set of Unicode category tables.
-
Caucasian_Albanian
Var v1.0Caucasian_Albanian is the set of Unicode characters in script Caucasian_Albanian.
-
Cc
Var v1.0Cc is the set of Unicode characters in category Cc (Other, control).
-
Cf
Var v1.0Cf is the set of Unicode characters in category Cf (Other, format).
-
Chakma
Var v1.0Chakma is the set of Unicode characters in script Chakma.
-
Cham
Var v1.0Cham is the set of Unicode characters in script Cham.
-
Cherokee
Var v1.0Cherokee is the set of Unicode characters in script Cherokee.
-
Chorasmian
Var v1.0Chorasmian is the set of Unicode characters in script Chorasmian.
-
Co
Var v1.0Co is the set of Unicode characters in category Co (Other, private use).
-
Common
Var v1.0Common is the set of Unicode characters in script Common.
-
Coptic
Var v1.0Coptic is the set of Unicode characters in script Coptic.
-
Cs
Var v1.0Cs is the set of Unicode characters in category Cs (Other, surrogate).
-
Cuneiform
Var v1.0Cuneiform is the set of Unicode characters in script Cuneiform.
-
Cypriot
Var v1.0Cypriot is the set of Unicode characters in script Cypriot.
-
Cyrillic
Var v1.0Cyrillic is the set of Unicode characters in script Cyrillic.
-
Dash
Var v1.0Dash is the set of Unicode characters with property Dash.
-
Deprecated
Var v1.0Deprecated is the set of Unicode characters with property Deprecated.
-
Deseret
Var v1.0Deseret is the set of Unicode characters in script Deseret.
-
Devanagari
Var v1.0Devanagari is the set of Unicode characters in script Devanagari.
-
Diacritic
Var v1.0Diacritic is the set of Unicode characters with property Diacritic.
-
Digit
Var v1.0Digit is the set of Unicode characters with the "decimal digit" property.
-
Dives_Akuru
Var v1.0Dives_Akuru is the set of Unicode characters in script Dives_Akuru.
-
Dogra
Var v1.0Dogra is the set of Unicode characters in script Dogra.
-
Duployan
Var v1.0Duployan is the set of Unicode characters in script Duployan.
-
Egyptian_Hieroglyphs
Var v1.0Egyptian_Hieroglyphs is the set of Unicode characters in script Egyptian_Hieroglyphs.
-
Elbasan
Var v1.0Elbasan is the set of Unicode characters in script Elbasan.
-
Elymaic
Var v1.0Elymaic is the set of Unicode characters in script Elymaic.
-
Ethiopic
Var v1.0Ethiopic is the set of Unicode characters in script Ethiopic.
-
Extender
Var v1.0Extender is the set of Unicode characters with property Extender.
-
FoldCategory
Var v1.0FoldCategory maps a category name to a table of
code points outside the category that are equivalent under
simple case folding to code points inside the category.
If there is no entry for a category name, there are no such points.
-
FoldScript
Var v1.0FoldScript maps a script name to a table of
code points outside the script that are equivalent under
simple case folding to code points inside the script.
If there is no entry for a script name, there are no such points.
-
Georgian
Var v1.0Georgian is the set of Unicode characters in script Georgian.
-
Glagolitic
Var v1.0Glagolitic is the set of Unicode characters in script Glagolitic.
-
Gothic
Var v1.0Gothic is the set of Unicode characters in script Gothic.
-
Grantha
Var v1.0Grantha is the set of Unicode characters in script Grantha.
-
GraphicRanges
Var v1.0GraphicRanges defines the set of graphic characters according to Unicode.
-
Greek
Var v1.0Greek is the set of Unicode characters in script Greek.
-
Gujarati
Var v1.0Gujarati is the set of Unicode characters in script Gujarati.
-
Gunjala_Gondi
Var v1.0Gunjala_Gondi is the set of Unicode characters in script Gunjala_Gondi.
-
Gurmukhi
Var v1.0Gurmukhi is the set of Unicode characters in script Gurmukhi.
-
Han
Var v1.0Han is the set of Unicode characters in script Han.
-
Hangul
Var v1.0Hangul is the set of Unicode characters in script Hangul.
-
Hanifi_Rohingya
Var v1.0Hanifi_Rohingya is the set of Unicode characters in script Hanifi_Rohingya.
-
Hanunoo
Var v1.0Hanunoo is the set of Unicode characters in script Hanunoo.
-
Hatran
Var v1.0Hatran is the set of Unicode characters in script Hatran.
-
Hebrew
Var v1.0Hebrew is the set of Unicode characters in script Hebrew.
-
Hex_Digit
Var v1.0Hex_Digit is the set of Unicode characters with property Hex_Digit.
-
Hiragana
Var v1.0Hiragana is the set of Unicode characters in script Hiragana.
-
Hyphen
Var v1.0Hyphen is the set of Unicode characters with property Hyphen.
-
IDS_Binary_Operator
Var v1.0IDS_Binary_Operator is the set of Unicode characters with property IDS_Binary_Operator.
-
IDS_Trinary_Operator
Var v1.0IDS_Trinary_Operator is the set of Unicode characters with property IDS_Trinary_Operator.
-
Ideographic
Var v1.0Ideographic is the set of Unicode characters with property Ideographic.
-
Imperial_Aramaic
Var v1.0Imperial_Aramaic is the set of Unicode characters in script Imperial_Aramaic.
-
Inherited
Var v1.0Inherited is the set of Unicode characters in script Inherited.
-
Inscriptional_Pahlavi
Var v1.0Inscriptional_Pahlavi is the set of Unicode characters in script Inscriptional_Pahlavi.
-
Inscriptional_Parthian
Var v1.0Inscriptional_Parthian is the set of Unicode characters in script Inscriptional_Parthian.
-
Javanese
Var v1.0Javanese is the set of Unicode characters in script Javanese.
-
Join_Control
Var v1.0Join_Control is the set of Unicode characters with property Join_Control.
-
Kaithi
Var v1.0Kaithi is the set of Unicode characters in script Kaithi.
-
Kannada
Var v1.0Kannada is the set of Unicode characters in script Kannada.
-
Katakana
Var v1.0Katakana is the set of Unicode characters in script Katakana.
-
Kayah_Li
Var v1.0Kayah_Li is the set of Unicode characters in script Kayah_Li.
-
Kharoshthi
Var v1.0Kharoshthi is the set of Unicode characters in script Kharoshthi.
-
Khitan_Small_Script
Var v1.0Khitan_Small_Script is the set of Unicode characters in script Khitan_Small_Script.
-
Khmer
Var v1.0Khmer is the set of Unicode characters in script Khmer.
-
Khojki
Var v1.0Khojki is the set of Unicode characters in script Khojki.
-
Khudawadi
Var v1.0Khudawadi is the set of Unicode characters in script Khudawadi.
-
L
Var v1.0These variables have type *RangeTable.
-
Lao
Var v1.0Lao is the set of Unicode characters in script Lao.
-
Latin
Var v1.0Latin is the set of Unicode characters in script Latin.
-
Lepcha
Var v1.0Lepcha is the set of Unicode characters in script Lepcha.
-
Letter
Var v1.0Letter/L is the set of Unicode letters, category L.
-
Limbu
Var v1.0Limbu is the set of Unicode characters in script Limbu.
-
Linear_A
Var v1.0Linear_A is the set of Unicode characters in script Linear_A.
-
Linear_B
Var v1.0Linear_B is the set of Unicode characters in script Linear_B.
-
Lisu
Var v1.0Lisu is the set of Unicode characters in script Lisu.
-
Ll
Var v1.0Ll is the set of Unicode characters in category Ll (Letter, lowercase).
-
Lm
Var v1.0Lm is the set of Unicode characters in category Lm (Letter, modifier).
-
Lo
Var v1.0Lo is the set of Unicode characters in category Lo (Letter, other).
-
Logical_Order_Exception
Var v1.0Logical_Order_Exception is the set of Unicode characters with property Logical_Order_Exception.
-
Lower
Var v1.0Lower is the set of Unicode lower case letters.
-
Lt
Var v1.0Lt is the set of Unicode characters in category Lt (Letter, titlecase).
-
Lu
Var v1.0Lu is the set of Unicode characters in category Lu (Letter, uppercase).
-
Lycian
Var v1.0Lycian is the set of Unicode characters in script Lycian.
-
Lydian
Var v1.0Lydian is the set of Unicode characters in script Lydian.
-
M
Var v1.0These variables have type *RangeTable.
-
Mahajani
Var v1.0Mahajani is the set of Unicode characters in script Mahajani.
-
Makasar
Var v1.0Makasar is the set of Unicode characters in script Makasar.
-
Malayalam
Var v1.0Malayalam is the set of Unicode characters in script Malayalam.
-
Mandaic
Var v1.0Mandaic is the set of Unicode characters in script Mandaic.
-
Manichaean
Var v1.0Manichaean is the set of Unicode characters in script Manichaean.
-
Marchen
Var v1.0Marchen is the set of Unicode characters in script Marchen.
-
Mark
Var v1.0Mark/M is the set of Unicode mark characters, category M.
-
Masaram_Gondi
Var v1.0Masaram_Gondi is the set of Unicode characters in script Masaram_Gondi.
-
Mc
Var v1.0Mc is the set of Unicode characters in category Mc (Mark, spacing combining).
-
Me
Var v1.0Me is the set of Unicode characters in category Me (Mark, enclosing).
-
Medefaidrin
Var v1.0Medefaidrin is the set of Unicode characters in script Medefaidrin.
-
Meetei_Mayek
Var v1.0Meetei_Mayek is the set of Unicode characters in script Meetei_Mayek.
-
Mende_Kikakui
Var v1.0Mende_Kikakui is the set of Unicode characters in script Mende_Kikakui.
-
Meroitic_Cursive
Var v1.0Meroitic_Cursive is the set of Unicode characters in script Meroitic_Cursive.
-
Meroitic_Hieroglyphs
Var v1.0Meroitic_Hieroglyphs is the set of Unicode characters in script Meroitic_Hieroglyphs.
-
Miao
Var v1.0Miao is the set of Unicode characters in script Miao.
-
Mn
Var v1.0Mn is the set of Unicode characters in category Mn (Mark, nonspacing).
-
Modi
Var v1.0Modi is the set of Unicode characters in script Modi.
-
Mongolian
Var v1.0Mongolian is the set of Unicode characters in script Mongolian.
-
Mro
Var v1.0Mro is the set of Unicode characters in script Mro.
-
Multani
Var v1.0Multani is the set of Unicode characters in script Multani.
-
Myanmar
Var v1.0Myanmar is the set of Unicode characters in script Myanmar.
-
N
Var v1.0These variables have type *RangeTable.
-
Nabataean
Var v1.0Nabataean is the set of Unicode characters in script Nabataean.
-
Nandinagari
Var v1.0Nandinagari is the set of Unicode characters in script Nandinagari.
-
Nd
Var v1.0Nd is the set of Unicode characters in category Nd (Number, decimal digit).
-
New_Tai_Lue
Var v1.0New_Tai_Lue is the set of Unicode characters in script New_Tai_Lue.
-
Newa
Var v1.0Newa is the set of Unicode characters in script Newa.
-
Nko
Var v1.0Nko is the set of Unicode characters in script Nko.
-
Nl
Var v1.0Nl is the set of Unicode characters in category Nl (Number, letter).
-
No
Var v1.0No is the set of Unicode characters in category No (Number, other).
-
Noncharacter_Code_Point
Var v1.0Noncharacter_Code_Point is the set of Unicode characters with property Noncharacter_Code_Point.
-
Number
Var v1.0Number/N is the set of Unicode number characters, category N.
-
Nushu
Var v1.0Nushu is the set of Unicode characters in script Nushu.
-
Nyiakeng_Puachue_Hmong
Var v1.0Nyiakeng_Puachue_Hmong is the set of Unicode characters in script Nyiakeng_Puachue_Hmong.
-
Ogham
Var v1.0Ogham is the set of Unicode characters in script Ogham.
-
Ol_Chiki
Var v1.0Ol_Chiki is the set of Unicode characters in script Ol_Chiki.
-
Old_Hungarian
Var v1.0Old_Hungarian is the set of Unicode characters in script Old_Hungarian.
-
Old_Italic
Var v1.0Old_Italic is the set of Unicode characters in script Old_Italic.
-
Old_North_Arabian
Var v1.0Old_North_Arabian is the set of Unicode characters in script Old_North_Arabian.
-
Old_Permic
Var v1.0Old_Permic is the set of Unicode characters in script Old_Permic.
-
Old_Persian
Var v1.0Old_Persian is the set of Unicode characters in script Old_Persian.
-
Old_Sogdian
Var v1.0Old_Sogdian is the set of Unicode characters in script Old_Sogdian.
-
Old_South_Arabian
Var v1.0Old_South_Arabian is the set of Unicode characters in script Old_South_Arabian.
-
Old_Turkic
Var v1.0Old_Turkic is the set of Unicode characters in script Old_Turkic.
-
Oriya
Var v1.0Oriya is the set of Unicode characters in script Oriya.
-
Osage
Var v1.0Osage is the set of Unicode characters in script Osage.
-
Osmanya
Var v1.0Osmanya is the set of Unicode characters in script Osmanya.
-
Other
Var v1.0Other/C is the set of Unicode control and special characters, category C.
-
Other_Alphabetic
Var v1.0Other_Alphabetic is the set of Unicode characters with property Other_Alphabetic.
-
Other_Default_Ignorable_Code_Point
Var v1.0Other_Default_Ignorable_Code_Point is the set of Unicode characters with property Other_Default_Ignorable_Code_Point.
-
Other_Grapheme_Extend
Var v1.0Other_Grapheme_Extend is the set of Unicode characters with property Other_Grapheme_Extend.
-
Other_ID_Continue
Var v1.0Other_ID_Continue is the set of Unicode characters with property Other_ID_Continue.
-
Other_ID_Start
Var v1.0Other_ID_Start is the set of Unicode characters with property Other_ID_Start.
-
Other_Lowercase
Var v1.0Other_Lowercase is the set of Unicode characters with property Other_Lowercase.
-
Other_Math
Var v1.0Other_Math is the set of Unicode characters with property Other_Math.
-
Other_Uppercase
Var v1.0Other_Uppercase is the set of Unicode characters with property Other_Uppercase.
-
P
Var v1.0These variables have type *RangeTable.
-
Pahawh_Hmong
Var v1.0Pahawh_Hmong is the set of Unicode characters in script Pahawh_Hmong.
-
Palmyrene
Var v1.0Palmyrene is the set of Unicode characters in script Palmyrene.
-
Pattern_Syntax
Var v1.0Pattern_Syntax is the set of Unicode characters with property Pattern_Syntax.
-
Pattern_White_Space
Var v1.0Pattern_White_Space is the set of Unicode characters with property Pattern_White_Space.
-
Pau_Cin_Hau
Var v1.0Pau_Cin_Hau is the set of Unicode characters in script Pau_Cin_Hau.
-
Pc
Var v1.0Pc is the set of Unicode characters in category Pc (Punctuation, connector).
-
Pd
Var v1.0Pd is the set of Unicode characters in category Pd (Punctuation, dash).
-
Pe
Var v1.0Pe is the set of Unicode characters in category Pe (Punctuation, close).
-
Pf
Var v1.0Pf is the set of Unicode characters in category Pf (Punctuation, final quote).
-
Phags_Pa
Var v1.0Phags_Pa is the set of Unicode characters in script Phags_Pa.
-
Phoenician
Var v1.0Phoenician is the set of Unicode characters in script Phoenician.
-
Pi
Var v1.0Pi is the set of Unicode characters in category Pi (Punctuation, initial quote).
-
Po
Var v1.0Po is the set of Unicode characters in category Po (Punctuation, other).
-
Prepended_Concatenation_Mark
Var v1.0Prepended_Concatenation_Mark is the set of Unicode characters with property Prepended_Concatenation_Mark.
-
PrintRanges
Var v1.0PrintRanges defines the set of printable characters according to Go.
ASCII space, U+0020, is handled separately.
-
Properties
Var v1.0Properties is the set of Unicode property tables.
-
Ps
Var v1.0Ps is the set of Unicode characters in category Ps (Punctuation, open).
-
Psalter_Pahlavi
Var v1.0Psalter_Pahlavi is the set of Unicode characters in script Psalter_Pahlavi.
-
Punct
Var v1.0Punct/P is the set of Unicode punctuation characters, category P.
-
Quotation_Mark
Var v1.0Quotation_Mark is the set of Unicode characters with property Quotation_Mark.
-
Radical
Var v1.0Radical is the set of Unicode characters with property Radical.
-
Regional_Indicator
Var v1.0Regional_Indicator is the set of Unicode characters with property Regional_Indicator.
-
Rejang
Var v1.0Rejang is the set of Unicode characters in script Rejang.
-
Runic
Var v1.0Runic is the set of Unicode characters in script Runic.
-
S
Var v1.0These variables have type *RangeTable.
-
STerm
Var v1.0STerm is an alias for Sentence_Terminal.
-
Samaritan
Var v1.0Samaritan is the set of Unicode characters in script Samaritan.
-
Saurashtra
Var v1.0Saurashtra is the set of Unicode characters in script Saurashtra.
-
Sc
Var v1.0Sc is the set of Unicode characters in category Sc (Symbol, currency).
-
Scripts
Var v1.0Scripts is the set of Unicode script tables.
-
Sentence_Terminal
Var v1.0Sentence_Terminal is the set of Unicode characters with property Sentence_Terminal.
-
Sharada
Var v1.0Sharada is the set of Unicode characters in script Sharada.
-
Shavian
Var v1.0Shavian is the set of Unicode characters in script Shavian.
-
Siddham
Var v1.0Siddham is the set of Unicode characters in script Siddham.
-
SignWriting
Var v1.0SignWriting is the set of Unicode characters in script SignWriting.
-
Sinhala
Var v1.0Sinhala is the set of Unicode characters in script Sinhala.
-
Sk
Var v1.0Sk is the set of Unicode characters in category Sk (Symbol, modifier).
-
Sm
Var v1.0Sm is the set of Unicode characters in category Sm (Symbol, math).
-
So
Var v1.0So is the set of Unicode characters in category So (Symbol, other).
-
Soft_Dotted
Var v1.0Soft_Dotted is the set of Unicode characters with property Soft_Dotted.
-
Sogdian
Var v1.0Sogdian is the set of Unicode characters in script Sogdian.
-
Sora_Sompeng
Var v1.0Sora_Sompeng is the set of Unicode characters in script Sora_Sompeng.
-
Soyombo
Var v1.0Soyombo is the set of Unicode characters in script Soyombo.
-
Space
Var v1.0Space/Z is the set of Unicode space characters, category Z.
-
Sundanese
Var v1.0Sundanese is the set of Unicode characters in script Sundanese.
-
Syloti_Nagri
Var v1.0Syloti_Nagri is the set of Unicode characters in script Syloti_Nagri.
-
Symbol
Var v1.0Symbol/S is the set of Unicode symbol characters, category S.
-
Syriac
Var v1.0Syriac is the set of Unicode characters in script Syriac.
-
Tagalog
Var v1.0Tagalog is the set of Unicode characters in script Tagalog.
-
Tagbanwa
Var v1.0Tagbanwa is the set of Unicode characters in script Tagbanwa.
-
Tai_Le
Var v1.0Tai_Le is the set of Unicode characters in script Tai_Le.
-
Tai_Tham
Var v1.0Tai_Tham is the set of Unicode characters in script Tai_Tham.
-
Tai_Viet
Var v1.0Tai_Viet is the set of Unicode characters in script Tai_Viet.
-
Takri
Var v1.0Takri is the set of Unicode characters in script Takri.
-
Tamil
Var v1.0Tamil is the set of Unicode characters in script Tamil.
-
Tangut
Var v1.0Tangut is the set of Unicode characters in script Tangut.
-
Telugu
Var v1.0Telugu is the set of Unicode characters in script Telugu.
-
Terminal_Punctuation
Var v1.0Terminal_Punctuation is the set of Unicode characters with property Terminal_Punctuation.
-
Thaana
Var v1.0Thaana is the set of Unicode characters in script Thaana.
-
Thai
Var v1.0Thai is the set of Unicode characters in script Thai.
-
Tibetan
Var v1.0Tibetan is the set of Unicode characters in script Tibetan.
-
Tifinagh
Var v1.0Tifinagh is the set of Unicode characters in script Tifinagh.
-
Tirhuta
Var v1.0Tirhuta is the set of Unicode characters in script Tirhuta.
-
Title
Var v1.0Title is the set of Unicode title case letters.
-
TurkishCase
Var v1.0 -
Ugaritic
Var v1.0Ugaritic is the set of Unicode characters in script Ugaritic.
-
Unified_Ideograph
Var v1.0Unified_Ideograph is the set of Unicode characters with property Unified_Ideograph.
-
Upper
Var v1.0Upper is the set of Unicode upper case letters.
-
Vai
Var v1.0Vai is the set of Unicode characters in script Vai.
-
Variation_Selector
Var v1.0Variation_Selector is the set of Unicode characters with property Variation_Selector.
-
Wancho
Var v1.0Wancho is the set of Unicode characters in script Wancho.
-
Warang_Citi
Var v1.0Warang_Citi is the set of Unicode characters in script Warang_Citi.
-
White_Space
Var v1.0White_Space is the set of Unicode characters with property White_Space.
-
Yezidi
Var v1.0Yezidi is the set of Unicode characters in script Yezidi.
-
Yi
Var v1.0Yi is the set of Unicode characters in script Yi.
-
Z
Var v1.0These variables have type *RangeTable.
-
Zanabazar_Square
Var v1.0Zanabazar_Square is the set of Unicode characters in script Zanabazar_Square.
-
Zl
Var v1.0Zl is the set of Unicode characters in category Zl (Separator, line).
-
Zp
Var v1.0Zp is the set of Unicode characters in category Zp (Separator, paragraph).
-
Zs
Var v1.0Zs is the set of Unicode characters in category Zs (Separator, space).
Functions, Macros, and Special Forms
-
In
Function v1.0(In r & ranges)
In reports whether the rune is a member of one of the ranges.
Go input arguments: (r rune, ranges ...*RangeTable)
Go returns: bool
Joker input arguments: [^Char r, & ^*RangeTable ranges]
Joker returns: ^Boolean -
Is
Function v1.0(Is rangeTab r)
Is reports whether the rune is in the specified table of ranges.
Go input arguments: (rangeTab *RangeTable, r rune)
Go returns: bool
Joker input arguments: [^*RangeTable rangeTab, ^Char r]
Joker returns: ^Boolean -
IsControl
Function v1.0(IsControl r)
IsControl reports whether the rune is a control character.
The C (Other) Unicode category includes more code points
such as surrogates; use Is(C, r) to test for them.
Go input arguments: (r rune)
Go returns: bool
Joker input arguments: [^Char r]
Joker returns: ^Boolean -
IsDigit
Function v1.0(IsDigit r)
IsDigit reports whether the rune is a decimal digit.
Go input arguments: (r rune)
Go returns: bool
Joker input arguments: [^Char r]
Joker returns: ^Boolean -
IsGraphic
Function v1.0(IsGraphic r)
IsGraphic reports whether the rune is defined as a Graphic by Unicode.
Such characters include letters, marks, numbers, punctuation, symbols, and
spaces, from categories L, M, N, P, S, Zs.
Go input arguments: (r rune)
Go returns: bool
Joker input arguments: [^Char r]
Joker returns: ^Boolean -
IsLetter
Function v1.0(IsLetter r)
IsLetter reports whether the rune is a letter (category L).
Go input arguments: (r rune)
Go returns: bool
Joker input arguments: [^Char r]
Joker returns: ^Boolean -
IsLower
Function v1.0(IsLower r)
IsLower reports whether the rune is a lower case letter.
Go input arguments: (r rune)
Go returns: bool
Joker input arguments: [^Char r]
Joker returns: ^Boolean -
IsMark
Function v1.0(IsMark r)
IsMark reports whether the rune is a mark character (category M).
Go input arguments: (r rune)
Go returns: bool
Joker input arguments: [^Char r]
Joker returns: ^Boolean -
IsNumber
Function v1.0(IsNumber r)
IsNumber reports whether the rune is a number (category N).
Go input arguments: (r rune)
Go returns: bool
Joker input arguments: [^Char r]
Joker returns: ^Boolean -
IsOneOf
Function v1.0(IsOneOf ranges r)
IsOneOf reports whether the rune is a member of one of the ranges.
The function "In" provides a nicer signature and should be used in preference to IsOneOf.
Go input arguments: (ranges []*RangeTable, r rune)
Go returns: bool
Joker input arguments: [^arrayOf*RangeTable ranges, ^Char r]
Joker returns: ^Boolean -
IsPrint
Function v1.0(IsPrint r)
IsPrint reports whether the rune is defined as printable by Go. Such
characters include letters, marks, numbers, punctuation, symbols, and the
ASCII space character, from categories L, M, N, P, S and the ASCII space
character. This categorization is the same as IsGraphic except that the
only spacing character is ASCII space, U+0020.
Go input arguments: (r rune)
Go returns: bool
Joker input arguments: [^Char r]
Joker returns: ^Boolean -
IsPunct
Function v1.0(IsPunct r)
IsPunct reports whether the rune is a Unicode punctuation character
(category P).
Go input arguments: (r rune)
Go returns: bool
Joker input arguments: [^Char r]
Joker returns: ^Boolean -
IsSpace
Function v1.0(IsSpace r)
IsSpace reports whether the rune is a space character as defined
by Unicode's White Space property; in the Latin-1 space
this is
'\t', '\n', '\v', '\f', '\r', ' ', U+0085 (NEL), U+00A0 (NBSP).
Other definitions of spacing characters are set by category
Z and property Pattern_White_Space.
Go input arguments: (r rune)
Go returns: bool
Joker input arguments: [^Char r]
Joker returns: ^Boolean -
IsSymbol
Function v1.0(IsSymbol r)
IsSymbol reports whether the rune is a symbolic character.
Go input arguments: (r rune)
Go returns: bool
Joker input arguments: [^Char r]
Joker returns: ^Boolean -
IsTitle
Function v1.0(IsTitle r)
IsTitle reports whether the rune is a title case letter.
Go input arguments: (r rune)
Go returns: bool
Joker input arguments: [^Char r]
Joker returns: ^Boolean -
IsUpper
Function v1.0(IsUpper r)
IsUpper reports whether the rune is an upper case letter.
Go input arguments: (r rune)
Go returns: bool
Joker input arguments: [^Char r]
Joker returns: ^Boolean -
SimpleFold
Function v1.0(SimpleFold r)
SimpleFold iterates over Unicode code points equivalent under
the Unicode-defined simple case folding. Among the code points
equivalent to rune (including rune itself), SimpleFold returns the
smallest rune > r if one exists, or else the smallest rune >= 0.
If r is not a valid Unicode code point, SimpleFold(r) returns r.
For example:
SimpleFold('A') = 'a'
SimpleFold('a') = 'A'
SimpleFold('K') = 'k'
SimpleFold('k') = '\u212A' (Kelvin symbol, K)
SimpleFold('\u212A') = 'K'
SimpleFold('1') = '1'
SimpleFold(-2) = -2
Go input arguments: (r rune)
Go returns: rune
Joker input arguments: [^Char r]
Joker returns: ^Char -
To
Function v1.0(To _case r)
To maps the rune to the specified case: UpperCase, LowerCase, or TitleCase.
Go input arguments: (_case int, r rune)
Go returns: rune
Joker input arguments: [^Int _case, ^Char r]
Joker returns: ^Char -
ToLower
Function v1.0(ToLower r)
ToLower maps the rune to lower case.
Go input arguments: (r rune)
Go returns: rune
Joker input arguments: [^Char r]
Joker returns: ^Char -
ToTitle
Function v1.0(ToTitle r)
ToTitle maps the rune to title case.
Go input arguments: (r rune)
Go returns: rune
Joker input arguments: [^Char r]
Joker returns: ^Char -
ToUpper
Function v1.0(ToUpper r)
ToUpper maps the rune to upper case.
Go input arguments: (r rune)
Go returns: rune
Joker input arguments: [^Char r]
Joker returns: ^Char
Types
-
*CaseRange
Concrete Type v1.0CaseRange represents a range of Unicode code points for simple (one
code point to one code point) case conversion.
The range runs from Lo to Hi inclusive, with a fixed stride of 1. Deltas
are the number to add to the code point to reach the code point for a
different case for that character. They may be negative. If zero, it
means the character is in the corresponding case. There is a special
case representing sequences of alternating corresponding Upper and Lower
pairs. It appears with a fixed Delta of
{UpperLower, UpperLower, UpperLower}
The constant UpperLower has an otherwise impossible delta value.
-
*Range16
Concrete Type v1.0Range16 represents of a range of 16-bit Unicode code points. The range runs from Lo to Hi
inclusive and has the specified stride.
-
*Range32
Concrete Type v1.0Range32 represents of a range of Unicode code points and is used when one or
more of the values will not fit in 16 bits. The range runs from Lo to Hi
inclusive and has the specified stride. Lo and Hi must always be >= 1<<16.
-
*RangeTable
Concrete Type v1.0RangeTable defines a set of Unicode code points by listing the ranges of
code points within the set. The ranges are listed in two slices
to save space: a slice of 16-bit ranges and a slice of 32-bit ranges.
The two slices must be in sorted order and non-overlapping.
Also, R32 should contain only values >= 0x10000 (1<<16).
-
*SpecialCase
Concrete Type v1.0SpecialCase represents language-specific case mappings such as Turkish.
Methods of SpecialCase customize (by overriding) the standard mappings.
-
CaseRange
Concrete Type v1.0CaseRange represents a range of Unicode code points for simple (one
code point to one code point) case conversion.
The range runs from Lo to Hi inclusive, with a fixed stride of 1. Deltas
are the number to add to the code point to reach the code point for a
different case for that character. They may be negative. If zero, it
means the character is in the corresponding case. There is a special
case representing sequences of alternating corresponding Upper and Lower
pairs. It appears with a fixed Delta of
{UpperLower, UpperLower, UpperLower}
The constant UpperLower has an otherwise impossible delta value.
-
Range16
Concrete Type v1.0Range16 represents of a range of 16-bit Unicode code points. The range runs from Lo to Hi
inclusive and has the specified stride.
-
Range32
Concrete Type v1.0Range32 represents of a range of Unicode code points and is used when one or
more of the values will not fit in 16 bits. The range runs from Lo to Hi
inclusive and has the specified stride. Lo and Hi must always be >= 1<<16.
-
RangeTable
Concrete Type v1.0RangeTable defines a set of Unicode code points by listing the ranges of
code points within the set. The ranges are listed in two slices
to save space: a slice of 16-bit ranges and a slice of 32-bit ranges.
The two slices must be in sorted order and non-overlapping.
Also, R32 should contain only values >= 0x10000 (1<<16).
-
SpecialCase
Concrete Type v1.0SpecialCase represents language-specific case mappings such as Turkish.
Methods of SpecialCase customize (by overriding) the standard mappings.
-
ToLower
Receiver for SpecialCase v1.0([r])
ToLower maps the rune to lower case giving priority to the special mapping.
-
ToTitle
Receiver for SpecialCase v1.0([r])
ToTitle maps the rune to title case giving priority to the special mapping.
-
ToUpper
Receiver for SpecialCase v1.0([r])
ToUpper maps the rune to upper case giving priority to the special mapping.
-
arrayOfCaseRange
Concrete Type v1.0CaseRange represents a range of Unicode code points for simple (one
code point to one code point) case conversion.
The range runs from Lo to Hi inclusive, with a fixed stride of 1. Deltas
are the number to add to the code point to reach the code point for a
different case for that character. They may be negative. If zero, it
means the character is in the corresponding case. There is a special
case representing sequences of alternating corresponding Upper and Lower
pairs. It appears with a fixed Delta of
{UpperLower, UpperLower, UpperLower}
The constant UpperLower has an otherwise impossible delta value.
-
arrayOfRange16
Concrete Type v1.0Range16 represents of a range of 16-bit Unicode code points. The range runs from Lo to Hi
inclusive and has the specified stride.
-
arrayOfRange32
Concrete Type v1.0Range32 represents of a range of Unicode code points and is used when one or
more of the values will not fit in 16 bits. The range runs from Lo to Hi
inclusive and has the specified stride. Lo and Hi must always be >= 1<<16.
-
arrayOfRangeTable
Concrete Type v1.0RangeTable defines a set of Unicode code points by listing the ranges of
code points within the set. The ranges are listed in two slices
to save space: a slice of 16-bit ranges and a slice of 32-bit ranges.
The two slices must be in sorted order and non-overlapping.
Also, R32 should contain only values >= 0x10000 (1<<16).
-
arrayOfSpecialCase
Concrete Type v1.0SpecialCase represents language-specific case mappings such as Turkish.
Methods of SpecialCase customize (by overriding) the standard mappings.