Gutf8Pure ocaml module for conversion between UCS and UTF8
type unistring = unichar arrayval from_unichar : unichar -> stringfrom_unichar 0xiii converts a code point iii (usually in hexadecimal form) into a string containing the UTF-8 encoded character 0xiii. See unicode.org for charmaps. Does not check that the given code point is a valid unicode point.
val from_unistring : unistring -> stringto_unichar_validated decodes an UTF-8 encoded code point and checks for incomplete characters, invalid characters and overlong encodings.
to_unichar decodes an UTF-8 encoded code point. Result is undefined if pos does not point to a valid UTF-8 encoded character.
val to_unistring : string -> unistringto_unistring decodes an UTF-8 encoded string into an array of unichar. The string must be valid.
val first_char : string -> unicharfirst_char returns the first UTF-8 encoded code point.
next returns the position of the code point following the one at pos.
module Error : sig ... endto_unichar_validated may raise PARTIAL_INPUT or ILLEGAL_SEQUENCE