Namespace: go.std.unicode

v1.0

Summary
Index
Constants
Variables
Functions, Macros, and Special Forms
Types

Summary

Provides a low-level interface to the unicode package.

Package unicode provides data and functions to test some properties of
Unicode code points.

Index

*CaseRange
*Range16
*Range32
*RangeTable
*SpecialCase
ASCII_Hex_Digit
Adlam
Ahom
Anatolian_Hieroglyphs
Arabic
Armenian
Avestan
AzeriCase
Balinese
Bamum
Bassa_Vah
Batak
Bengali
Bhaiksuki
Bidi_Control
Bopomofo
Brahmi
Braille
Buginese
Buhid
C
Canadian_Aboriginal
Carian
CaseRange
CaseRanges
Categories
Caucasian_Albanian
Cc
Cf
Chakma
Cham
Cherokee
Chorasmian
Co
Common
Coptic
Cs
Cuneiform
Cypriot
Cyrillic
Dash
Deprecated
Deseret
Devanagari
Diacritic
Digit
Dives_Akuru
Dogra
Duployan
Egyptian_Hieroglyphs
Elbasan
Elymaic
Ethiopic
Extender
FoldCategory
FoldScript
Georgian
Glagolitic
Gothic
Grantha
GraphicRanges
Greek
Gujarati
Gunjala_Gondi
Gurmukhi
Han
Hangul
Hanifi_Rohingya
Hanunoo
Hatran
Hebrew
Hex_Digit
Hiragana
Hyphen
IDS_Binary_Operator
IDS_Trinary_Operator
Ideographic
Imperial_Aramaic
In
Inherited
Inscriptional_Pahlavi
Inscriptional_Parthian
Is
IsControl
IsDigit
IsGraphic
IsLetter
IsLower
IsMark
IsNumber
IsOneOf
IsPrint
IsPunct
IsSpace
IsSymbol
IsTitle
IsUpper
Javanese
Join_Control
Kaithi
Kannada
Katakana
Kayah_Li
Kharoshthi
Khitan_Small_Script
Khmer
Khojki
Khudawadi
L
Lao
Latin
Lepcha
Letter
Limbu
Linear_A
Linear_B
Lisu
Ll
Lm
Lo
Logical_Order_Exception
Lower
LowerCase
Lt
Lu
Lycian
Lydian
M
Mahajani
Makasar
Malayalam
Mandaic
Manichaean
Marchen
Mark
Masaram_Gondi
MaxASCII
MaxCase
MaxLatin1
MaxRune
Mc
Me
Medefaidrin
Meetei_Mayek
Mende_Kikakui
Meroitic_Cursive
Meroitic_Hieroglyphs
Miao
Mn
Modi
Mongolian
Mro
Multani
Myanmar
N
Nabataean
Nandinagari
Nd
New_Tai_Lue
Newa
Nko
Nl
No
Noncharacter_Code_Point
Number
Nushu
Nyiakeng_Puachue_Hmong
Ogham
Ol_Chiki
Old_Hungarian
Old_Italic
Old_North_Arabian
Old_Permic
Old_Persian
Old_Sogdian
Old_South_Arabian
Old_Turkic
Oriya
Osage
Osmanya
Other
Other_Alphabetic
Other_Default_Ignorable_Code_Point
Other_Grapheme_Extend
Other_ID_Continue
Other_ID_Start
Other_Lowercase
Other_Math
Other_Uppercase
P
Pahawh_Hmong
Palmyrene
Pattern_Syntax
Pattern_White_Space
Pau_Cin_Hau
Pc
Pd
Pe
Pf
Phags_Pa
Phoenician
Pi
Po
Prepended_Concatenation_Mark
PrintRanges
Properties
Ps
Psalter_Pahlavi
Punct
Quotation_Mark
Radical
Range16
Range32
RangeTable
Regional_Indicator
Rejang
ReplacementChar
Runic
S
STerm
Samaritan
Saurashtra
Sc
Scripts
Sentence_Terminal
Sharada
Shavian
Siddham
SignWriting
SimpleFold
Sinhala
Sk
Sm
So
Soft_Dotted
Sogdian
Sora_Sompeng
Soyombo
Space
SpecialCase
Sundanese
Syloti_Nagri
Symbol
Syriac
Tagalog
Tagbanwa
Tai_Le
Tai_Tham
Tai_Viet
Takri
Tamil
Tangut
Telugu
Terminal_Punctuation
Thaana
Thai
Tibetan
Tifinagh
Tirhuta
Title
TitleCase
To
ToLower
ToTitle
ToUpper
TurkishCase
Ugaritic
Unified_Ideograph
Upper
UpperCase
UpperLower
Vai
Variation_Selector
Version
Wancho
Warang_Citi
White_Space
Yezidi
Yi
Z
Zanabazar_Square
Zl
Zp
Zs
arrayOfCaseRange
arrayOfRange16
arrayOfRange32
arrayOfRangeTable
arrayOfSpecialCase

Legend

Constant

Variable

Function

Macro

Special form

Type

GoVar

Receiver/Method

Constants

Constants are variables with :const true in their metadata. Joker currently does not recognize them as special; as such, it allows redefining them or their values.

LowerCase
Int v1.0
Indices into the Delta arrays inside CaseRanges for case mapping.
MaxASCII
Char v1.0
maximum ASCII value.
MaxCase
Int v1.0
Indices into the Delta arrays inside CaseRanges for case mapping.
MaxLatin1
Char v1.0
maximum Latin-1 value.
MaxRune
Char v1.0
Maximum valid Unicode code point.
ReplacementChar
Char v1.0
Represents invalid code points.
TitleCase
Int v1.0
Indices into the Delta arrays inside CaseRanges for case mapping.
UpperCase
Int v1.0
Indices into the Delta arrays inside CaseRanges for case mapping.
UpperLower
Char v1.0
(Cannot be a valid delta.)
Version
String v1.0
Version is the Unicode edition from which the tables are derived.

Variables

ASCII_Hex_Digit
Var v1.0
ASCII_Hex_Digit is the set of Unicode characters with property ASCII_Hex_Digit.
Adlam
Var v1.0
Adlam is the set of Unicode characters in script Adlam.
Ahom
Var v1.0
Ahom is the set of Unicode characters in script Ahom.
Anatolian_Hieroglyphs
Var v1.0
Anatolian_Hieroglyphs is the set of Unicode characters in script Anatolian_Hieroglyphs.
Arabic
Var v1.0
Arabic is the set of Unicode characters in script Arabic.
Armenian
Var v1.0
Armenian is the set of Unicode characters in script Armenian.
Avestan
Var v1.0
Avestan is the set of Unicode characters in script Avestan.
AzeriCase
Var v1.0
Balinese
Var v1.0
Balinese is the set of Unicode characters in script Balinese.
Bamum
Var v1.0
Bamum is the set of Unicode characters in script Bamum.
Bassa_Vah
Var v1.0
Bassa_Vah is the set of Unicode characters in script Bassa_Vah.
Batak
Var v1.0
Batak is the set of Unicode characters in script Batak.
Bengali
Var v1.0
Bengali is the set of Unicode characters in script Bengali.
Bhaiksuki
Var v1.0
Bhaiksuki is the set of Unicode characters in script Bhaiksuki.
Bidi_Control
Var v1.0
Bidi_Control is the set of Unicode characters with property Bidi_Control.
Bopomofo
Var v1.0
Bopomofo is the set of Unicode characters in script Bopomofo.
Brahmi
Var v1.0
Brahmi is the set of Unicode characters in script Brahmi.
Braille
Var v1.0
Braille is the set of Unicode characters in script Braille.
Buginese
Var v1.0
Buginese is the set of Unicode characters in script Buginese.
Buhid
Var v1.0
Buhid is the set of Unicode characters in script Buhid.
C
Var v1.0
These variables have type *RangeTable.
Canadian_Aboriginal
Var v1.0
Canadian_Aboriginal is the set of Unicode characters in script Canadian_Aboriginal.
Carian
Var v1.0
Carian is the set of Unicode characters in script Carian.
CaseRanges
Var v1.0
CaseRanges is the table describing case mappings for all letters with
non-self mappings.
Categories
Var v1.0
Categories is the set of Unicode category tables.
Caucasian_Albanian
Var v1.0
Caucasian_Albanian is the set of Unicode characters in script Caucasian_Albanian.
Cc
Var v1.0
Cc is the set of Unicode characters in category Cc (Other, control).
Cf
Var v1.0
Cf is the set of Unicode characters in category Cf (Other, format).
Chakma
Var v1.0
Chakma is the set of Unicode characters in script Chakma.
Cham
Var v1.0
Cham is the set of Unicode characters in script Cham.
Cherokee
Var v1.0
Cherokee is the set of Unicode characters in script Cherokee.
Chorasmian
Var v1.0
Chorasmian is the set of Unicode characters in script Chorasmian.
Co
Var v1.0
Co is the set of Unicode characters in category Co (Other, private use).
Common
Var v1.0
Common is the set of Unicode characters in script Common.
Coptic
Var v1.0
Coptic is the set of Unicode characters in script Coptic.
Cs
Var v1.0
Cs is the set of Unicode characters in category Cs (Other, surrogate).
Cuneiform
Var v1.0
Cuneiform is the set of Unicode characters in script Cuneiform.
Cypriot
Var v1.0
Cypriot is the set of Unicode characters in script Cypriot.
Cyrillic
Var v1.0
Cyrillic is the set of Unicode characters in script Cyrillic.
Dash
Var v1.0
Dash is the set of Unicode characters with property Dash.
Deprecated
Var v1.0
Deprecated is the set of Unicode characters with property Deprecated.
Deseret
Var v1.0
Deseret is the set of Unicode characters in script Deseret.
Devanagari
Var v1.0
Devanagari is the set of Unicode characters in script Devanagari.
Diacritic
Var v1.0
Diacritic is the set of Unicode characters with property Diacritic.
Digit
Var v1.0
Digit is the set of Unicode characters with the "decimal digit" property.
Dives_Akuru
Var v1.0
Dives_Akuru is the set of Unicode characters in script Dives_Akuru.
Dogra
Var v1.0
Dogra is the set of Unicode characters in script Dogra.
Duployan
Var v1.0
Duployan is the set of Unicode characters in script Duployan.
Egyptian_Hieroglyphs
Var v1.0
Egyptian_Hieroglyphs is the set of Unicode characters in script Egyptian_Hieroglyphs.
Elbasan
Var v1.0
Elbasan is the set of Unicode characters in script Elbasan.
Elymaic
Var v1.0
Elymaic is the set of Unicode characters in script Elymaic.
Ethiopic
Var v1.0
Ethiopic is the set of Unicode characters in script Ethiopic.
Extender
Var v1.0
Extender is the set of Unicode characters with property Extender.
FoldCategory
Var v1.0
FoldCategory maps a category name to a table of
code points outside the category that are equivalent under
simple case folding to code points inside the category.
If there is no entry for a category name, there are no such points.
FoldScript
Var v1.0
FoldScript maps a script name to a table of
code points outside the script that are equivalent under
simple case folding to code points inside the script.
If there is no entry for a script name, there are no such points.
Georgian
Var v1.0
Georgian is the set of Unicode characters in script Georgian.
Glagolitic
Var v1.0
Glagolitic is the set of Unicode characters in script Glagolitic.
Gothic
Var v1.0
Gothic is the set of Unicode characters in script Gothic.
Grantha
Var v1.0
Grantha is the set of Unicode characters in script Grantha.
GraphicRanges
Var v1.0
GraphicRanges defines the set of graphic characters according to Unicode.
Greek
Var v1.0
Greek is the set of Unicode characters in script Greek.
Gujarati
Var v1.0
Gujarati is the set of Unicode characters in script Gujarati.
Gunjala_Gondi
Var v1.0
Gunjala_Gondi is the set of Unicode characters in script Gunjala_Gondi.
Gurmukhi
Var v1.0
Gurmukhi is the set of Unicode characters in script Gurmukhi.
Han
Var v1.0
Han is the set of Unicode characters in script Han.
Hangul
Var v1.0
Hangul is the set of Unicode characters in script Hangul.
Hanifi_Rohingya
Var v1.0
Hanifi_Rohingya is the set of Unicode characters in script Hanifi_Rohingya.
Hanunoo
Var v1.0
Hanunoo is the set of Unicode characters in script Hanunoo.
Hatran
Var v1.0
Hatran is the set of Unicode characters in script Hatran.
Hebrew
Var v1.0
Hebrew is the set of Unicode characters in script Hebrew.
Hex_Digit
Var v1.0
Hex_Digit is the set of Unicode characters with property Hex_Digit.
Hiragana
Var v1.0
Hiragana is the set of Unicode characters in script Hiragana.
Hyphen
Var v1.0
Hyphen is the set of Unicode characters with property Hyphen.
IDS_Binary_Operator
Var v1.0
IDS_Binary_Operator is the set of Unicode characters with property IDS_Binary_Operator.
IDS_Trinary_Operator
Var v1.0
IDS_Trinary_Operator is the set of Unicode characters with property IDS_Trinary_Operator.
Ideographic
Var v1.0
Ideographic is the set of Unicode characters with property Ideographic.
Imperial_Aramaic
Var v1.0
Imperial_Aramaic is the set of Unicode characters in script Imperial_Aramaic.
Inherited
Var v1.0
Inherited is the set of Unicode characters in script Inherited.
Inscriptional_Pahlavi
Var v1.0
Inscriptional_Pahlavi is the set of Unicode characters in script Inscriptional_Pahlavi.
Inscriptional_Parthian
Var v1.0
Inscriptional_Parthian is the set of Unicode characters in script Inscriptional_Parthian.
Javanese
Var v1.0
Javanese is the set of Unicode characters in script Javanese.
Join_Control
Var v1.0
Join_Control is the set of Unicode characters with property Join_Control.
Kaithi
Var v1.0
Kaithi is the set of Unicode characters in script Kaithi.
Kannada
Var v1.0
Kannada is the set of Unicode characters in script Kannada.
Katakana
Var v1.0
Katakana is the set of Unicode characters in script Katakana.
Kayah_Li
Var v1.0
Kayah_Li is the set of Unicode characters in script Kayah_Li.
Kharoshthi
Var v1.0
Kharoshthi is the set of Unicode characters in script Kharoshthi.
Khitan_Small_Script
Var v1.0
Khitan_Small_Script is the set of Unicode characters in script Khitan_Small_Script.
Khmer
Var v1.0
Khmer is the set of Unicode characters in script Khmer.
Khojki
Var v1.0
Khojki is the set of Unicode characters in script Khojki.
Khudawadi
Var v1.0
Khudawadi is the set of Unicode characters in script Khudawadi.
L
Var v1.0
These variables have type *RangeTable.
Lao
Var v1.0
Lao is the set of Unicode characters in script Lao.
Latin
Var v1.0
Latin is the set of Unicode characters in script Latin.
Lepcha
Var v1.0
Lepcha is the set of Unicode characters in script Lepcha.
Letter
Var v1.0
Letter/L is the set of Unicode letters, category L.
Limbu
Var v1.0
Limbu is the set of Unicode characters in script Limbu.
Linear_A
Var v1.0
Linear_A is the set of Unicode characters in script Linear_A.
Linear_B
Var v1.0
Linear_B is the set of Unicode characters in script Linear_B.
Lisu
Var v1.0
Lisu is the set of Unicode characters in script Lisu.
Ll
Var v1.0
Ll is the set of Unicode characters in category Ll (Letter, lowercase).
Lm
Var v1.0
Lm is the set of Unicode characters in category Lm (Letter, modifier).
Lo
Var v1.0
Lo is the set of Unicode characters in category Lo (Letter, other).
Logical_Order_Exception
Var v1.0
Logical_Order_Exception is the set of Unicode characters with property Logical_Order_Exception.
Lower
Var v1.0
Lower is the set of Unicode lower case letters.
Lt
Var v1.0
Lt is the set of Unicode characters in category Lt (Letter, titlecase).
Lu
Var v1.0
Lu is the set of Unicode characters in category Lu (Letter, uppercase).
Lycian
Var v1.0
Lycian is the set of Unicode characters in script Lycian.
Lydian
Var v1.0
Lydian is the set of Unicode characters in script Lydian.
M
Var v1.0
These variables have type *RangeTable.
Mahajani
Var v1.0
Mahajani is the set of Unicode characters in script Mahajani.
Makasar
Var v1.0
Makasar is the set of Unicode characters in script Makasar.
Malayalam
Var v1.0
Malayalam is the set of Unicode characters in script Malayalam.
Mandaic
Var v1.0
Mandaic is the set of Unicode characters in script Mandaic.
Manichaean
Var v1.0
Manichaean is the set of Unicode characters in script Manichaean.
Marchen
Var v1.0
Marchen is the set of Unicode characters in script Marchen.
Mark
Var v1.0
Mark/M is the set of Unicode mark characters, category M.
Masaram_Gondi
Var v1.0
Masaram_Gondi is the set of Unicode characters in script Masaram_Gondi.
Mc
Var v1.0
Mc is the set of Unicode characters in category Mc (Mark, spacing combining).
Me
Var v1.0
Me is the set of Unicode characters in category Me (Mark, enclosing).
Medefaidrin
Var v1.0
Medefaidrin is the set of Unicode characters in script Medefaidrin.
Meetei_Mayek
Var v1.0
Meetei_Mayek is the set of Unicode characters in script Meetei_Mayek.
Mende_Kikakui
Var v1.0
Mende_Kikakui is the set of Unicode characters in script Mende_Kikakui.
Meroitic_Cursive
Var v1.0
Meroitic_Cursive is the set of Unicode characters in script Meroitic_Cursive.
Meroitic_Hieroglyphs
Var v1.0
Meroitic_Hieroglyphs is the set of Unicode characters in script Meroitic_Hieroglyphs.
Miao
Var v1.0
Miao is the set of Unicode characters in script Miao.
Mn
Var v1.0
Mn is the set of Unicode characters in category Mn (Mark, nonspacing).
Modi
Var v1.0
Modi is the set of Unicode characters in script Modi.
Mongolian
Var v1.0
Mongolian is the set of Unicode characters in script Mongolian.
Mro
Var v1.0
Mro is the set of Unicode characters in script Mro.
Multani
Var v1.0
Multani is the set of Unicode characters in script Multani.
Myanmar
Var v1.0
Myanmar is the set of Unicode characters in script Myanmar.
N
Var v1.0
These variables have type *RangeTable.
Nabataean
Var v1.0
Nabataean is the set of Unicode characters in script Nabataean.
Nandinagari
Var v1.0
Nandinagari is the set of Unicode characters in script Nandinagari.
Nd
Var v1.0
Nd is the set of Unicode characters in category Nd (Number, decimal digit).
New_Tai_Lue
Var v1.0
New_Tai_Lue is the set of Unicode characters in script New_Tai_Lue.
Newa
Var v1.0
Newa is the set of Unicode characters in script Newa.
Nko
Var v1.0
Nko is the set of Unicode characters in script Nko.
Nl
Var v1.0
Nl is the set of Unicode characters in category Nl (Number, letter).
No
Var v1.0
No is the set of Unicode characters in category No (Number, other).
Noncharacter_Code_Point
Var v1.0
Noncharacter_Code_Point is the set of Unicode characters with property Noncharacter_Code_Point.
Number
Var v1.0
Number/N is the set of Unicode number characters, category N.
Nushu
Var v1.0
Nushu is the set of Unicode characters in script Nushu.
Nyiakeng_Puachue_Hmong
Var v1.0
Nyiakeng_Puachue_Hmong is the set of Unicode characters in script Nyiakeng_Puachue_Hmong.
Ogham
Var v1.0
Ogham is the set of Unicode characters in script Ogham.
Ol_Chiki
Var v1.0
Ol_Chiki is the set of Unicode characters in script Ol_Chiki.
Old_Hungarian
Var v1.0
Old_Hungarian is the set of Unicode characters in script Old_Hungarian.
Old_Italic
Var v1.0
Old_Italic is the set of Unicode characters in script Old_Italic.
Old_North_Arabian
Var v1.0
Old_North_Arabian is the set of Unicode characters in script Old_North_Arabian.
Old_Permic
Var v1.0
Old_Permic is the set of Unicode characters in script Old_Permic.
Old_Persian
Var v1.0
Old_Persian is the set of Unicode characters in script Old_Persian.
Old_Sogdian
Var v1.0
Old_Sogdian is the set of Unicode characters in script Old_Sogdian.
Old_South_Arabian
Var v1.0
Old_South_Arabian is the set of Unicode characters in script Old_South_Arabian.
Old_Turkic
Var v1.0
Old_Turkic is the set of Unicode characters in script Old_Turkic.
Oriya
Var v1.0
Oriya is the set of Unicode characters in script Oriya.
Osage
Var v1.0
Osage is the set of Unicode characters in script Osage.
Osmanya
Var v1.0
Osmanya is the set of Unicode characters in script Osmanya.
Other
Var v1.0
Other/C is the set of Unicode control and special characters, category C.
Other_Alphabetic
Var v1.0
Other_Alphabetic is the set of Unicode characters with property Other_Alphabetic.
Other_Default_Ignorable_Code_Point
Var v1.0
Other_Default_Ignorable_Code_Point is the set of Unicode characters with property Other_Default_Ignorable_Code_Point.
Other_Grapheme_Extend
Var v1.0
Other_Grapheme_Extend is the set of Unicode characters with property Other_Grapheme_Extend.
Other_ID_Continue
Var v1.0
Other_ID_Continue is the set of Unicode characters with property Other_ID_Continue.
Other_ID_Start
Var v1.0
Other_ID_Start is the set of Unicode characters with property Other_ID_Start.
Other_Lowercase
Var v1.0
Other_Lowercase is the set of Unicode characters with property Other_Lowercase.
Other_Math
Var v1.0
Other_Math is the set of Unicode characters with property Other_Math.
Other_Uppercase
Var v1.0
Other_Uppercase is the set of Unicode characters with property Other_Uppercase.
P
Var v1.0
These variables have type *RangeTable.
Pahawh_Hmong
Var v1.0
Pahawh_Hmong is the set of Unicode characters in script Pahawh_Hmong.
Palmyrene
Var v1.0
Palmyrene is the set of Unicode characters in script Palmyrene.
Pattern_Syntax
Var v1.0
Pattern_Syntax is the set of Unicode characters with property Pattern_Syntax.
Pattern_White_Space
Var v1.0
Pattern_White_Space is the set of Unicode characters with property Pattern_White_Space.
Pau_Cin_Hau
Var v1.0
Pau_Cin_Hau is the set of Unicode characters in script Pau_Cin_Hau.
Pc
Var v1.0
Pc is the set of Unicode characters in category Pc (Punctuation, connector).
Pd
Var v1.0
Pd is the set of Unicode characters in category Pd (Punctuation, dash).
Pe
Var v1.0
Pe is the set of Unicode characters in category Pe (Punctuation, close).
Pf
Var v1.0
Pf is the set of Unicode characters in category Pf (Punctuation, final quote).
Phags_Pa
Var v1.0
Phags_Pa is the set of Unicode characters in script Phags_Pa.
Phoenician
Var v1.0
Phoenician is the set of Unicode characters in script Phoenician.
Pi
Var v1.0
Pi is the set of Unicode characters in category Pi (Punctuation, initial quote).
Po
Var v1.0
Po is the set of Unicode characters in category Po (Punctuation, other).
Prepended_Concatenation_Mark
Var v1.0
Prepended_Concatenation_Mark is the set of Unicode characters with property Prepended_Concatenation_Mark.
PrintRanges
Var v1.0
PrintRanges defines the set of printable characters according to Go.
ASCII space, U+0020, is handled separately.
Properties
Var v1.0
Properties is the set of Unicode property tables.
Ps
Var v1.0
Ps is the set of Unicode characters in category Ps (Punctuation, open).
Psalter_Pahlavi
Var v1.0
Psalter_Pahlavi is the set of Unicode characters in script Psalter_Pahlavi.
Punct
Var v1.0
Punct/P is the set of Unicode punctuation characters, category P.
Quotation_Mark
Var v1.0
Quotation_Mark is the set of Unicode characters with property Quotation_Mark.
Radical
Var v1.0
Radical is the set of Unicode characters with property Radical.
Regional_Indicator
Var v1.0
Regional_Indicator is the set of Unicode characters with property Regional_Indicator.
Rejang
Var v1.0
Rejang is the set of Unicode characters in script Rejang.
Runic
Var v1.0
Runic is the set of Unicode characters in script Runic.
S
Var v1.0
These variables have type *RangeTable.
STerm
Var v1.0
STerm is an alias for Sentence_Terminal.
Samaritan
Var v1.0
Samaritan is the set of Unicode characters in script Samaritan.
Saurashtra
Var v1.0
Saurashtra is the set of Unicode characters in script Saurashtra.
Sc
Var v1.0
Sc is the set of Unicode characters in category Sc (Symbol, currency).
Scripts
Var v1.0
Scripts is the set of Unicode script tables.
Sentence_Terminal
Var v1.0
Sentence_Terminal is the set of Unicode characters with property Sentence_Terminal.
Sharada
Var v1.0
Sharada is the set of Unicode characters in script Sharada.
Shavian
Var v1.0
Shavian is the set of Unicode characters in script Shavian.
Siddham
Var v1.0
Siddham is the set of Unicode characters in script Siddham.
SignWriting
Var v1.0
SignWriting is the set of Unicode characters in script SignWriting.
Sinhala
Var v1.0
Sinhala is the set of Unicode characters in script Sinhala.
Sk
Var v1.0
Sk is the set of Unicode characters in category Sk (Symbol, modifier).
Sm
Var v1.0
Sm is the set of Unicode characters in category Sm (Symbol, math).
So
Var v1.0
So is the set of Unicode characters in category So (Symbol, other).
Soft_Dotted
Var v1.0
Soft_Dotted is the set of Unicode characters with property Soft_Dotted.
Sogdian
Var v1.0
Sogdian is the set of Unicode characters in script Sogdian.
Sora_Sompeng
Var v1.0
Sora_Sompeng is the set of Unicode characters in script Sora_Sompeng.
Soyombo
Var v1.0
Soyombo is the set of Unicode characters in script Soyombo.
Space
Var v1.0
Space/Z is the set of Unicode space characters, category Z.
Sundanese
Var v1.0
Sundanese is the set of Unicode characters in script Sundanese.
Syloti_Nagri
Var v1.0
Syloti_Nagri is the set of Unicode characters in script Syloti_Nagri.
Symbol
Var v1.0
Symbol/S is the set of Unicode symbol characters, category S.
Syriac
Var v1.0
Syriac is the set of Unicode characters in script Syriac.
Tagalog
Var v1.0
Tagalog is the set of Unicode characters in script Tagalog.
Tagbanwa
Var v1.0
Tagbanwa is the set of Unicode characters in script Tagbanwa.
Tai_Le
Var v1.0
Tai_Le is the set of Unicode characters in script Tai_Le.
Tai_Tham
Var v1.0
Tai_Tham is the set of Unicode characters in script Tai_Tham.
Tai_Viet
Var v1.0
Tai_Viet is the set of Unicode characters in script Tai_Viet.
Takri
Var v1.0
Takri is the set of Unicode characters in script Takri.
Tamil
Var v1.0
Tamil is the set of Unicode characters in script Tamil.
Tangut
Var v1.0
Tangut is the set of Unicode characters in script Tangut.
Telugu
Var v1.0
Telugu is the set of Unicode characters in script Telugu.
Terminal_Punctuation
Var v1.0
Terminal_Punctuation is the set of Unicode characters with property Terminal_Punctuation.
Thaana
Var v1.0
Thaana is the set of Unicode characters in script Thaana.
Thai
Var v1.0
Thai is the set of Unicode characters in script Thai.
Tibetan
Var v1.0
Tibetan is the set of Unicode characters in script Tibetan.
Tifinagh
Var v1.0
Tifinagh is the set of Unicode characters in script Tifinagh.
Tirhuta
Var v1.0
Tirhuta is the set of Unicode characters in script Tirhuta.
Title
Var v1.0
Title is the set of Unicode title case letters.
TurkishCase
Var v1.0
Ugaritic
Var v1.0
Ugaritic is the set of Unicode characters in script Ugaritic.
Unified_Ideograph
Var v1.0
Unified_Ideograph is the set of Unicode characters with property Unified_Ideograph.
Upper
Var v1.0
Upper is the set of Unicode upper case letters.
Vai
Var v1.0
Vai is the set of Unicode characters in script Vai.
Variation_Selector
Var v1.0
Variation_Selector is the set of Unicode characters with property Variation_Selector.
Wancho
Var v1.0
Wancho is the set of Unicode characters in script Wancho.
Warang_Citi
Var v1.0
Warang_Citi is the set of Unicode characters in script Warang_Citi.
White_Space
Var v1.0
White_Space is the set of Unicode characters with property White_Space.
Yezidi
Var v1.0
Yezidi is the set of Unicode characters in script Yezidi.
Yi
Var v1.0
Yi is the set of Unicode characters in script Yi.
Z
Var v1.0
These variables have type *RangeTable.
Zanabazar_Square
Var v1.0
Zanabazar_Square is the set of Unicode characters in script Zanabazar_Square.
Zl
Var v1.0
Zl is the set of Unicode characters in category Zl (Separator, line).
Zp
Var v1.0
Zp is the set of Unicode characters in category Zp (Separator, paragraph).
Zs
Var v1.0
Zs is the set of Unicode characters in category Zs (Separator, space).

Functions, Macros, and Special Forms

In
Function v1.0
```
(In r & ranges)
```
In reports whether the rune is a member of one of the ranges.

Go input arguments: (r rune, ranges ...*RangeTable)

Go returns: bool

Joker input arguments: [^Char r, & ^*RangeTable ranges]

Joker returns: ^Boolean
Is
Function v1.0
```
(Is rangeTab r)
```
Is reports whether the rune is in the specified table of ranges.

Go input arguments: (rangeTab *RangeTable, r rune)

Go returns: bool

Joker input arguments: [^*RangeTable rangeTab, ^Char r]

Joker returns: ^Boolean
IsControl
Function v1.0
```
(IsControl r)
```
IsControl reports whether the rune is a control character.
The C (Other) Unicode category includes more code points
such as surrogates; use Is(C, r) to test for them.

Go input arguments: (r rune)

Go returns: bool

Joker input arguments: [^Char r]

Joker returns: ^Boolean
IsDigit
Function v1.0
```
(IsDigit r)
```
IsDigit reports whether the rune is a decimal digit.

Go input arguments: (r rune)

Go returns: bool

Joker input arguments: [^Char r]

Joker returns: ^Boolean
IsGraphic
Function v1.0
```
(IsGraphic r)
```
IsGraphic reports whether the rune is defined as a Graphic by Unicode.
Such characters include letters, marks, numbers, punctuation, symbols, and
spaces, from categories L, M, N, P, S, Zs.

Go input arguments: (r rune)

Go returns: bool

Joker input arguments: [^Char r]

Joker returns: ^Boolean
IsLetter
Function v1.0
```
(IsLetter r)
```
IsLetter reports whether the rune is a letter (category L).

Go input arguments: (r rune)

Go returns: bool

Joker input arguments: [^Char r]

Joker returns: ^Boolean
IsLower
Function v1.0
```
(IsLower r)
```
IsLower reports whether the rune is a lower case letter.

Go input arguments: (r rune)

Go returns: bool

Joker input arguments: [^Char r]

Joker returns: ^Boolean
IsMark
Function v1.0
```
(IsMark r)
```
IsMark reports whether the rune is a mark character (category M).

Go input arguments: (r rune)

Go returns: bool

Joker input arguments: [^Char r]

Joker returns: ^Boolean
IsNumber
Function v1.0
```
(IsNumber r)
```
IsNumber reports whether the rune is a number (category N).

Go input arguments: (r rune)

Go returns: bool

Joker input arguments: [^Char r]

Joker returns: ^Boolean
IsOneOf
Function v1.0
```
(IsOneOf ranges r)
```
IsOneOf reports whether the rune is a member of one of the ranges.
The function "In" provides a nicer signature and should be used in preference to IsOneOf.

Go input arguments: (ranges []*RangeTable, r rune)

Go returns: bool

Joker input arguments: [^arrayOf*RangeTable ranges, ^Char r]

Joker returns: ^Boolean
IsPrint
Function v1.0
```
(IsPrint r)
```
IsPrint reports whether the rune is defined as printable by Go. Such
characters include letters, marks, numbers, punctuation, symbols, and the
ASCII space character, from categories L, M, N, P, S and the ASCII space
character. This categorization is the same as IsGraphic except that the
only spacing character is ASCII space, U+0020.

Go input arguments: (r rune)

Go returns: bool

Joker input arguments: [^Char r]

Joker returns: ^Boolean
IsPunct
Function v1.0
```
(IsPunct r)
```
IsPunct reports whether the rune is a Unicode punctuation character
(category P).

Go input arguments: (r rune)

Go returns: bool

Joker input arguments: [^Char r]

Joker returns: ^Boolean
IsSpace
Function v1.0
```
(IsSpace r)
```
IsSpace reports whether the rune is a space character as defined
by Unicode's White Space property; in the Latin-1 space
this is

'\t', '\n', '\v', '\f', '\r', ' ', U+0085 (NEL), U+00A0 (NBSP).

Other definitions of spacing characters are set by category
Z and property Pattern_White_Space.

Go input arguments: (r rune)

Go returns: bool

Joker input arguments: [^Char r]

Joker returns: ^Boolean
IsSymbol
Function v1.0
```
(IsSymbol r)
```
IsSymbol reports whether the rune is a symbolic character.

Go input arguments: (r rune)

Go returns: bool

Joker input arguments: [^Char r]

Joker returns: ^Boolean
IsTitle
Function v1.0
```
(IsTitle r)
```
IsTitle reports whether the rune is a title case letter.

Go input arguments: (r rune)

Go returns: bool

Joker input arguments: [^Char r]

Joker returns: ^Boolean
IsUpper
Function v1.0
```
(IsUpper r)
```
IsUpper reports whether the rune is an upper case letter.

Go input arguments: (r rune)

Go returns: bool

Joker input arguments: [^Char r]

Joker returns: ^Boolean
SimpleFold
Function v1.0
```
(SimpleFold r)
```
SimpleFold iterates over Unicode code points equivalent under
the Unicode-defined simple case folding. Among the code points
equivalent to rune (including rune itself), SimpleFold returns the
smallest rune > r if one exists, or else the smallest rune >= 0.
If r is not a valid Unicode code point, SimpleFold(r) returns r.

For example:

SimpleFold('A') = 'a'
SimpleFold('a') = 'A'

SimpleFold('K') = 'k'
SimpleFold('k') = '\u212A' (Kelvin symbol, K)
SimpleFold('\u212A') = 'K'

SimpleFold('1') = '1'

SimpleFold(-2) = -2

Go input arguments: (r rune)

Go returns: rune

Joker input arguments: [^Char r]

Joker returns: ^Char
To
Function v1.0
```
(To _case r)
```
To maps the rune to the specified case: UpperCase, LowerCase, or TitleCase.

Go input arguments: (_case int, r rune)

Go returns: rune

Joker input arguments: [^Int _case, ^Char r]

Joker returns: ^Char
ToLower
Function v1.0
```
(ToLower r)
```
ToLower maps the rune to lower case.

Go input arguments: (r rune)

Go returns: rune

Joker input arguments: [^Char r]

Joker returns: ^Char
ToTitle
Function v1.0
```
(ToTitle r)
```
ToTitle maps the rune to title case.

Go input arguments: (r rune)

Go returns: rune

Joker input arguments: [^Char r]

Joker returns: ^Char
ToUpper
Function v1.0
```
(ToUpper r)
```
ToUpper maps the rune to upper case.

Go input arguments: (r rune)

Go returns: rune

Joker input arguments: [^Char r]

Joker returns: ^Char

Types

*CaseRange
Concrete Type v1.0
CaseRange represents a range of Unicode code points for simple (one
code point to one code point) case conversion.
The range runs from Lo to Hi inclusive, with a fixed stride of 1. Deltas
are the number to add to the code point to reach the code point for a
different case for that character. They may be negative. If zero, it
means the character is in the corresponding case. There is a special
case representing sequences of alternating corresponding Upper and Lower
pairs. It appears with a fixed Delta of

{UpperLower, UpperLower, UpperLower}

The constant UpperLower has an otherwise impossible delta value.
*Range16
Concrete Type v1.0
Range16 represents of a range of 16-bit Unicode code points. The range runs from Lo to Hi
inclusive and has the specified stride.
*Range32
Concrete Type v1.0
Range32 represents of a range of Unicode code points and is used when one or
more of the values will not fit in 16 bits. The range runs from Lo to Hi
inclusive and has the specified stride. Lo and Hi must always be >= 1<<16.
*RangeTable
Concrete Type v1.0
RangeTable defines a set of Unicode code points by listing the ranges of
code points within the set. The ranges are listed in two slices
to save space: a slice of 16-bit ranges and a slice of 32-bit ranges.
The two slices must be in sorted order and non-overlapping.
Also, R32 should contain only values >= 0x10000 (1<<16).
*SpecialCase
Concrete Type v1.0
SpecialCase represents language-specific case mappings such as Turkish.
Methods of SpecialCase customize (by overriding) the standard mappings.
CaseRange
Concrete Type v1.0
CaseRange represents a range of Unicode code points for simple (one
code point to one code point) case conversion.
The range runs from Lo to Hi inclusive, with a fixed stride of 1. Deltas
are the number to add to the code point to reach the code point for a
different case for that character. They may be negative. If zero, it
means the character is in the corresponding case. There is a special
case representing sequences of alternating corresponding Upper and Lower
pairs. It appears with a fixed Delta of

{UpperLower, UpperLower, UpperLower}

The constant UpperLower has an otherwise impossible delta value.
Range16
Concrete Type v1.0
Range16 represents of a range of 16-bit Unicode code points. The range runs from Lo to Hi
inclusive and has the specified stride.
Range32
Concrete Type v1.0
Range32 represents of a range of Unicode code points and is used when one or
more of the values will not fit in 16 bits. The range runs from Lo to Hi
inclusive and has the specified stride. Lo and Hi must always be >= 1<<16.
RangeTable
Concrete Type v1.0
RangeTable defines a set of Unicode code points by listing the ranges of
code points within the set. The ranges are listed in two slices
to save space: a slice of 16-bit ranges and a slice of 32-bit ranges.
The two slices must be in sorted order and non-overlapping.
Also, R32 should contain only values >= 0x10000 (1<<16).
SpecialCase
Concrete Type v1.0
SpecialCase represents language-specific case mappings such as Turkish.
Methods of SpecialCase customize (by overriding) the standard mappings.
ToLower
Receiver for SpecialCase v1.0
```
([r])
```
ToLower maps the rune to lower case giving priority to the special mapping.
ToTitle
Receiver for SpecialCase v1.0
```
([r])
```
ToTitle maps the rune to title case giving priority to the special mapping.
ToUpper
Receiver for SpecialCase v1.0
```
([r])
```
ToUpper maps the rune to upper case giving priority to the special mapping.
arrayOfCaseRange
Concrete Type v1.0
CaseRange represents a range of Unicode code points for simple (one
code point to one code point) case conversion.
The range runs from Lo to Hi inclusive, with a fixed stride of 1. Deltas
are the number to add to the code point to reach the code point for a
different case for that character. They may be negative. If zero, it
means the character is in the corresponding case. There is a special
case representing sequences of alternating corresponding Upper and Lower
pairs. It appears with a fixed Delta of

{UpperLower, UpperLower, UpperLower}

The constant UpperLower has an otherwise impossible delta value.
arrayOfRange16
Concrete Type v1.0
Range16 represents of a range of 16-bit Unicode code points. The range runs from Lo to Hi
inclusive and has the specified stride.
arrayOfRange32
Concrete Type v1.0
Range32 represents of a range of Unicode code points and is used when one or
more of the values will not fit in 16 bits. The range runs from Lo to Hi
inclusive and has the specified stride. Lo and Hi must always be >= 1<<16.
arrayOfRangeTable
Concrete Type v1.0
RangeTable defines a set of Unicode code points by listing the ranges of
code points within the set. The ranges are listed in two slices
to save space: a slice of 16-bit ranges and a slice of 32-bit ranges.
The two slices must be in sorted order and non-overlapping.
Also, R32 should contain only values >= 0x10000 (1<<16).
arrayOfSpecialCase
Concrete Type v1.0
SpecialCase represents language-specific case mappings such as Turkish.
Methods of SpecialCase customize (by overriding) the standard mappings.

Namespace: go.std.unicode

Contents

Summary

Index

Legend

Constants

LowerCase

MaxASCII

MaxCase

MaxLatin1

MaxRune

ReplacementChar

TitleCase

UpperCase

UpperLower

Version

Variables

ASCII_Hex_Digit

Adlam

Ahom

Anatolian_Hieroglyphs

Arabic

Armenian

Avestan

AzeriCase

Balinese

Bamum

Bassa_Vah

Batak

Bengali

Bhaiksuki

Bidi_Control

Bopomofo

Brahmi

Braille

Buginese

Buhid

C

Canadian_Aboriginal

Carian

CaseRanges

Categories

Caucasian_Albanian

Cc

Cf

Chakma

Cham

Cherokee

Chorasmian

Co

Common

Coptic

Cs

Cuneiform

Cypriot

Cyrillic

Dash

Deprecated

Deseret

Devanagari

Diacritic

Digit

Dives_Akuru

Dogra

Duployan

Egyptian_Hieroglyphs

Elbasan

Elymaic

Ethiopic

Extender

FoldCategory

FoldScript

Georgian

Glagolitic

Gothic

Grantha

GraphicRanges

Greek

Gujarati

Gunjala_Gondi