Ucs_db_auxUnicode character set properties.
This module provides an interface to the Unicode character set database.
An alias for the abstract type representing a map of all Unicode code points to the value of its corresponding property.
The property index type. The full Unicode character database is large, and the portion required by the Orsetto Ucs library itself is small, so values of this type provide an abstraction of the relevant portion of the database available to the application.
type utyp += The core population of the extensible universal property type.
val create_index : (string * 'a) list -> 'a indexUse create_index s to compose an index from a sequence of pairs.
Use query m c to resolve the value property m for character c.
val search_index : 'a index -> string -> 'a optionUse search_index idx nym to query the index idx for the entry named by nym. Index keys are loosely matched.
Use search_property idx nym to query the property database index idx for the property named nym. Property names are loosely matched.
Use require_property idx nym to query the property database index idx for the property named nym. Property names are loosedly matched. Raises Not_found if no property named nym is indexed.
type blk = [ | `ASCII| `Adlam| `Aegean_Numbers| `Ahom| `Alchemical| `Alphabetic_PF| `Anatolian_Hieroglyphs| `Ancient_Greek_Music| `Ancient_Greek_Numbers| `Ancient_Symbols| `Arabic| `Arabic_Ext_A| `Arabic_Math| `Arabic_PF_A| `Arabic_PF_B| `Arabic_Sup| `Armenian| `Arrows| `Avestan| `Balinese| `Bamum| `Bamum_Sup| `Bassa_Vah| `Batak| `Bengali| `Bhaiksuki| `Block_Elements| `Bopomofo| `Bopomofo_Ext| `Box_Drawing| `Brahmi| `Braille| `Buginese| `Buhid| `Byzantine_Music| `CJK| `CJK_Compat| `CJK_Compat_Forms| `CJK_Compat_Ideographs| `CJK_Compat_Ideographs_Sup| `CJK_Ext_A| `CJK_Ext_B| `CJK_Ext_C| `CJK_Ext_D| `CJK_Ext_E| `CJK_Ext_F| `CJK_Radicals_Sup| `CJK_Strokes| `CJK_Symbols| `Carian| `Caucasian_Albanian| `Chakma| `Cham| `Cherokee| `Cherokee_Sup| `Chess_Symbols| `Compat_Jamo| `Control_Pictures| `Coptic| `Coptic_Epact_Numbers| `Counting_Rod| `Cuneiform| `Cuneiform_Numbers| `Currency_Symbols| `Cypriot_Syllabary| `Cyrillic| `Cyrillic_Ext_A| `Cyrillic_Ext_B| `Cyrillic_Ext_C| `Cyrillic_Sup| `Deseret| `Devanagari| `Devanagari_Ext| `Diacriticals| `Diacriticals_Ext| `Diacriticals_For_Symbols| `Diacriticals_Sup| `Dingbats| `Dogra| `Domino| `Duployan| `Early_Dynastic_Cuneiform| `Egyptian_Hieroglyphs| `Egyptian_Hieroglyph_Format_Controls| `Elbasan| `Elymaic| `Emoticons| `Enclosed_Alphanum| `Enclosed_Alphanum_Sup| `Enclosed_CJK| `Enclosed_Ideographic_Sup| `Ethiopic| `Ethiopic_Ext| `Ethiopic_Ext_A| `Ethiopic_Sup| `Geometric_Shapes| `Geometric_Shapes_Ext| `Georgian| `Georgian_Ext| `Georgian_Sup| `Glagolitic| `Glagolitic_Sup| `Gothic| `Grantha| `Greek| `Greek_Ext| `Gujarati| `Gunjala_Gondi| `Gurmukhi| `Half_And_Full_Forms| `Half_Marks| `Hangul| `Hanifi_Rohingya| `Hanunoo| `Hatran| `Hebrew| `High_PU_Surrogates| `High_Surrogates| `Hiragana| `IDC| `IPA_Ext| `Ideographic_Symbols| `Imperial_Aramaic| `Indic_Number_Forms| `Indic_Siyaq_Numbers| `Inscriptional_Pahlavi| `Inscriptional_Parthian| `Jamo| `Jamo_Ext_A| `Jamo_Ext_B| `Javanese| `Kaithi| `Kana_Ext_A| `Kana_Sup| `Kanbun| `Kangxi| `Kannada| `Katakana| `Katakana_Ext| `Kayah_Li| `Kharoshthi| `Khmer| `Khmer_Symbols| `Khojki| `Khudawadi| `Lao| `Latin_1_Sup| `Latin_Ext_A| `Latin_Ext_Additional| `Latin_Ext_B| `Latin_Ext_C| `Latin_Ext_D| `Latin_Ext_E| `Lepcha| `Letterlike_Symbols| `Limbu| `Linear_A| `Linear_B_Ideograms| `Linear_B_Syllabary| `Lisu| `Low_Surrogates| `Lycian| `Lydian| `Mahajani| `Mahjong| `Makasar| `Malayalam| `Mandaic| `Manichaean| `Marchen| `Masaram_Gondi| `Math_Alphanum| `Math_Operators| `Mayan_Numerals| `Medefaidrin| `Meetei_Mayek| `Meetei_Mayek_Ext| `Mende_Kikakui| `Meroitic_Cursive| `Meroitic_Hieroglyphs| `Miao| `Misc_Arrows| `Misc_Math_Symbols_A| `Misc_Math_Symbols_B| `Misc_Pictographs| `Misc_Symbols| `Misc_Technical| `Modi| `Modifier_Letters| `Modifier_Tone_Letters| `Mongolian| `Mongolian_Sup| `Mro| `Multani| `Music| `Myanmar| `Myanmar_Ext_A| `Myanmar_Ext_B| `NB| `NKo| `Nabataean| `Nandinagari| `New_Tai_Lue| `Newa| `No_Block_Assigned| `Number_Forms| `Nushu| `Nyiakeng_Puachue_Hmong| `OCR| `Ogham| `Ol_Chiki| `Old_Hungarian| `Old_Italic| `Old_North_Arabian| `Old_Permic| `Old_Persian| `Old_Sogdian| `Old_South_Arabian| `Old_Turkic| `Oriya| `Ornamental_Dingbats| `Osage| `Osmanya| `Ottoman_Siyaq_Numbers| `PUA| `Pahawh_Hmong| `Palmyrene| `Pau_Cin_Hau| `Phags_Pa| `Phaistos| `Phoenician| `Phonetic_Ext| `Phonetic_Ext_Sup| `Playing_Cards| `Psalter_Pahlavi| `Punctuation| `Rejang| `Rumi| `Runic| `Samaritan| `Saurashtra| `Sharada| `Shavian| `Shorthand_Format_Controls| `Siddham| `Sinhala| `Sinhala_Archaic_Numbers| `Small_Forms| `Small_Kana_Ext| `Sogdian| `Sora_Sompeng| `Soyombo| `Specials| `Sundanese| `Sundanese_Sup| `Sup_Arrows_A| `Sup_Arrows_B| `Sup_Arrows_C| `Sup_Math_Operators| `Sup_PUA_A| `Sup_PUA_B| `Sup_Punctuation| `Sup_Symbols_And_Pictographs| `Super_And_Sub| `Sutton_SignWriting| `Syloti_Nagri| `Symbols_And_Pictographs_Ext_A| `Syriac| `Syriac_Sup| `Tagalog| `Tagbanwa| `Tags| `Tai_Le| `Tai_Tham| `Tai_Viet| `Tai_Xuan_Jing| `Takri| `Tamil| `Tamil_Sup| `Tangut| `Tangut_Components| `Telugu| `Thaana| `Thai| `Tibetan| `Tifinagh| `Tirhuta| `Transport_And_Map| `UCAS| `UCAS_Ext| `Ugaritic| `VS| `VS_Sup| `Vai| `Vedic_Ext| `Vertical_Forms| `Wancho| `Warang_Citi| `Yi_Radicals| `Yi_Syllables| `Yijing| `Zanabazar_Square ]Unicode code block
val show_blk : blk -> stringString representation
type gc = [ | `C| `Cc| `Cf| `Cs| `Co| `Cn| `L| `LC| `Lu| `Ll| `Lt| `Lm| `Lo| `M| `Mn| `Mc| `Me| `N| `Nd| `Nl| `No| `P| `Pc| `Pd| `Ps| `Pe| `Pi| `Pf| `Po| `S| `Sm| `Sc| `Sk| `So| `Z| `Zs| `Zl| `Zp ]The general category property value type.
val show_gc : gc -> stringString representation
val show_qc : qc -> stringString representation
type script = [ | `Adlm| `Aghb| `Ahom| `Arab| `Armi| `Armn| `Avst| `Bali| `Bamu| `Bass| `Batk| `Beng| `Bhks| `Bopo| `Brah| `Brai| `Bugi| `Buhd| `Cakm| `Cans| `Cari| `Cham| `Cher| `Copt| `Cprt| `Cyrl| `Deva| `Dogr| `Dsrt| `Dupl| `Egyp| `Elba| `Elym| `Ethi| `Geor| `Glag| `Gong| `Gonm| `Goth| `Gran| `Grek| `Gujr| `Guru| `Hang| `Hani| `Hano| `Hatr| `Hebr| `Hira| `Hluw| `Hmng| `Hmnp| `Hrkt| `Hung| `Ital| `Java| `Kali| `Kana| `Khar| `Khmr| `Khoj| `Knda| `Kthi| `Lana| `Laoo| `Latn| `Lepc| `Limb| `Lina| `Linb| `Lisu| `Lyci| `Lydi| `Mahj| `Maka| `Mand| `Mani| `Marc| `Medf| `Mend| `Merc| `Mero| `Mlym| `Modi| `Mong| `Mroo| `Mtei| `Mult| `Mymr| `Nand| `Narb| `Nbat| `Newa| `Nkoo| `Nshu| `Ogam| `Olck| `Orkh| `Orya| `Osge| `Osma| `Palm| `Pauc| `Perm| `Phag| `Phli| `Phlp| `Phnx| `Plrd| `Prti| `Qaai| `Rjng| `Rohg| `Runr| `Samr| `Sarb| `Saur| `Sgnw| `Shaw| `Shrd| `Sidd| `Sind| `Sinh| `Sogd| `Sogo| `Sora| `Soyo| `Sund| `Sylo| `Syrc| `Tagb| `Takr| `Tale| `Talu| `Taml| `Tang| `Tavt| `Telu| `Tfng| `Tglg| `Thaa| `Thai| `Tibt| `Tirh| `Ugar| `Vaii| `Wara| `Wcho| `Xpeo| `Xsux| `Yiii| `Zanb| `Zinh| `Zyyy| `Zzzz ]Unicode script identifier
val show_script : script -> stringString representation
module Quick : sig ... endThis module contains internal fast-path functions for property query.