encodingType property
The encoding type used by the API to calculate sentence offsets. Possible string values are:
- "NONE" : If
EncodingType
is not specified, encoding-dependent information (such asbegin_offset
) will be set at-1
. - "UTF8" : Encoding-dependent information (such as
begin_offset
) is calculated based on the UTF-8 encoding of the input. C++ and Go are examples of languages that use this encoding natively. - "UTF16" : Encoding-dependent information (such as
begin_offset
) is calculated based on the UTF-16 encoding of the input. Java and JavaScript are examples of languages that use this encoding natively. - "UTF32" : Encoding-dependent information (such as
begin_offset
) is calculated based on the UTF-32 encoding of the input. Python is an example of a language that uses this encoding natively.
Implementation
core.String? encodingType;