charset library
Classes
- Charset
- Global method for Encoders
- CodePage
- A mapping between bytes and characters.
- CodePageDecoder
- A code page decoder, converts from bytes to characters.
- CodePageEncoder
- Encoder for a code page.
- EucJPCodec
- The EUC-JP Codec
- EucJPDecoder
- The EUC-JP Decoder
- EucJPEncoder
- The EUC-JP Encoder
- EucKRCodec
- EUC-KR Codec
- EucKRDecoder
- EUC-KR Decoder
- EucKREncoder
- EUC-KR Encoder
- GbkCodec
- A GbkCodec encodes strings to GBK code units (bytes) and decodes GBK code units to strings.
- GbkDecoder
- This class converts GBK code units (lists of unsigned 8-bit integers) to a string.
- GbkEncoder
- This class converts strings to their GBK code units (a list of unsigned 8-bit integers).
- IterableUtf32Decoder
-
Return type of
decodeUtf32AsIterable
and variants. The Iterable type provides an iterator on demand and the iterator will only translate bytes as requested by the user of the iterator. (Note: results are not cached.) - ShiftJISCodec
- ShiftJis Codec
- ShiftJISDecoder
- ShiftJis Decoder
- ShiftJISEncoder
- ShiftJis Encoder
- Utf16beBytesToCodeUnitsDecoder
- Convert UTF-16BE encoded bytes to utf16 code units by grouping 1-2 bytes to produce the code unit (0-(2^16)-1).
- Utf16BytesToCodeUnitsDecoder
- Convert UTF-16 encoded bytes to UTF-16 code units by grouping 1-2 bytes to produce the code unit (0-(2^16)-1). Relies on BOM to determine endian-ness, and defaults to BE.
- Utf16Codec
- Utf16 Codec
- Utf16Decoder
- Utf16Decoder
- Utf16Encoder
- Utf16 Encoder
- Utf16leBytesToCodeUnitsDecoder
- Convert UTF-16LE encoded bytes to utf16 code units by grouping 1-2 bytes to produce the code unit (0-(2^16)-1).
- Utf32beBytesDecoder
- Convert UTF-32BE encoded bytes to codepoints by grouping 4 bytes to produce the unicode codepoint.
- Utf32BytesDecoder
- Abstract parent class converts encoded bytes to codepoints.
- Utf32Codec
- Utf32Codec
- Utf32Decoder
- Utf32Decoder
- Utf32Encoder
- Utf32Encoder
- Utf32leBytesDecoder
- Convert UTF-32BE encoded bytes to codepoints by grouping 4 bytes to produce the unicode codepoint.
Constants
- eucJp → const EucJPCodec
- The instance of EUC-JP codec
- eucKr → const EucKRCodec
- The instance of EucKRCodec
- gbk → const GbkCodec
- An instance of the default implementation of the GbkCodec.
- replacementCharacterGBK → const int
- The GBK Replacement character GBK 0xA7F6
- replacementCharacterUnicode → const int
-
The GBK Replacement character
U+E7B3
(). - shiftJis → const ShiftJISCodec
- The instance of ShiftJIS Codec
- unicodeBomCharacterRune → const int
-
The Unicode Byte Order Marker (BOM) character
U+FEFF
. - utf16 → const Utf16Codec
- instance of utf16 codec
- utf32 → const Utf32Codec
- The instance of Utf32Codec
Properties
- cp1046 → CodePage
-
The cp1046 codec
final
- cp1124 → CodePage
-
The cp1124 codec
final
- cp1125 → CodePage
-
The cp1125 codec
final
- cp1129 → CodePage
-
The cp1129 codec
final
- cp1133 → CodePage
-
The cp1133 codec
final
- cp1161 → CodePage
-
The cp1161 codec
final
- cp1162 → CodePage
-
The cp1162 codec
final
- cp1163 → CodePage
-
The cp1163 codec
final
- cp437 → CodePage
-
The cp437 codec
final
- cp737 → CodePage
-
The cp737 codec
final
- cp775 → CodePage
-
The cp775 codec
final
- cp850 → CodePage
-
The cp850 codec
final
- cp852 → CodePage
-
The cp852 codec
final
- cp855 → CodePage
-
The cp855 codec
final
- cp856 → CodePage
-
The cp856 codec
final
- cp857 → CodePage
-
The cp857 codec
final
- cp858 → CodePage
-
The cp858 codec
final
- cp860 → CodePage
-
The cp860 codec
final
- cp861 → CodePage
-
The cp861 codec
final
- cp862 → CodePage
-
The cp862 codec
final
- cp863 → CodePage
-
The cp863 codec
final
- cp864 → CodePage
-
The cp864 codec
final
- cp865 → CodePage
-
The cp865 codec
final
- cp866 → CodePage
-
The cp866 codec
final
- cp869 → CodePage
-
The cp869 codec
final
- cp922 → CodePage
-
The cp922 codec
final
- latin10 → CodePage
-
The ISO-8859-16/Latin-10 (South Eastern European) code page.
final
- latin2 → CodePage
-
The ISO-8859-2/Latin-2 (Eastern European) code page.
final
- latin3 → CodePage
-
The ISO-8859-3/Latin-3 (South European) code page.
final
- latin4 → CodePage
-
The ISO-8859-4/Latin-4 (North European) code page.
final
- latin5 → CodePage
-
The ISO-8859-9/Latin-5 (Turkish) code page.
final
- latin6 → CodePage
-
The ISO-8859-10/Latin-6 (Nordic) code page.
final
- latin7 → CodePage
-
The ISO-8859-13/Latin-6 (Baltic Rim) code page.
final
- latin8 → CodePage
-
The ISO-8859-14/Latin-8 (Celtic) code page.
final
- latin9 → CodePage
-
The ISO-8859-15/Latin-9 (Western European revised) code page.
final
- latinArabic → CodePage
-
The ISO-8859-6/Latin-Arabic code page.
final
- latinCyrillic → CodePage
-
The ISO-8859-5/Latin-Cyrillic code page.
final
- latinGreek → CodePage
-
The ISO-8859-7/Latin-Greek code page.
final
- latinHebrew → CodePage
-
The ISO-8859-7/Latin-Hebrew code page.
final
- latinThai → CodePage
-
The ISO-8859-11/Latin-Thai code page.
final
- windows1250 → CodePage
-
The windows1250 codec
final
- windows1251 → CodePage
-
The windows1251 codec
final
- windows1252 → CodePage
-
The windows1252 codec
final
- windows1253 → CodePage
-
The windows1253 codec
final
- windows1254 → CodePage
-
The windows1254 codec
final
- windows1255 → CodePage
-
The windows1255 codec
final
- windows1256 → CodePage
-
The windows1256 codec
final
- windows1257 → CodePage
-
The windows1257 codec
final
- windows1258 → CodePage
-
The windows1258 codec
final
- windows874 → CodePage
-
The windows874 codec
final
Functions
-
hasUtf16BeBom(
List< int> utf16EncodedBytes, [int offset = 0, int? length]) → bool - Identifies whether a List of bytes starts (based on offset) with a big-endian byte-order marker (BOM).
-
hasUtf16Bom(
List< int> utf32EncodedBytes, [int offset = 0, int? length]) → bool - Identifies whether a List of bytes starts (based on offset) with a byte-order marker (BOM).
-
hasUtf16LeBom(
List< int> utf16EncodedBytes, [int offset = 0, int? length]) → bool - Identifies whether a List of bytes starts (based on offset) with a little-endian byte-order marker (BOM).
-
hasUtf32beBom(
List< int> utf32EncodedBytes, [int offset = 0, int? length]) → bool - Identifies whether a List of bytes starts (based on offset) with a big-endian byte-order marker (BOM).
-
hasUtf32Bom(
List< int> utf32EncodedBytes, [int offset = 0, int? length]) → bool - Identifies whether a List of bytes starts (based on offset) with a byte-order marker (BOM).
-
hasUtf32leBom(
List< int> utf32EncodedBytes, [int offset = 0, int? length]) → bool - Identifies whether a List of bytes starts (based on offset) with a little-endian byte-order marker (BOM).
Typedefs
- Utf32BytesDecoderProvider = Utf32BytesDecoder Function()
- A Function to create Utf32BytesDecoder