TextAnalyzer class abstract

An interface exposes language-specific properties and methods used in text analysis.

  • characterFilter is a function that manipulates text prior to stemming and tokenization;
  • termFilter is a filter function that returns a collection of terms from a term. It returns an empty collection if the term is to be excluded from analysis or, returns multiple terms if the term is split (e.g. at hyphens) and / or, returns modified term(s), such as applying a stemmer algorithm; and
  • termSplitter returns a list of terms from text;
  • sentenceSplitter splits text into a list of sentences at sentence and line endings;
  • paragraphSplitter splits text into a list of paragraphs at line endings;
  • syllableCounter returns the number of syllables in a word or text.

Constructors

TextAnalyzer()

Properties

characterFilter AsyncTermModifier
A function that filters out unwanted characters or replaces them with other characters.
no setter
hashCode int
The hash code for this object.
no setterinherited
jsonTokenizer JsonTokenizer
Extracts tokens from the fields in a JSON document for use in full-text search queries and indexes.
no setter
nGrammer NGrammer
A language-specific function that generates n-grams from text.
no setter
paragraphSplitter TextSplitter
Returns a list of paragraphs from text.
no setter
phraseSplitter PhraseSplitter
Returns a list of keywords from text.
no setter
runtimeType Type
A representation of the runtime type of the object.
no setterinherited
sentenceSplitter TextSplitter
Returns a list of sentences from text.
no setter
syllableCounter SyllableCounter
Returns the number of syllables in a string after stripping out all white-space and punctuation.
no setter
termExpander TermExpander?
Expands text to a collection of related Strings, e.g. synonyms, abbreviations or spelling suggestions.
no setter
termFilter AsyncTermModifier
A filter function that returns a modified term or null.
no setter
termSplitter TextSplitter
Returns a list of words from text.
no setter
tokenizer Tokenizer
Extracts one or more tokens from text for use in full-text search queries and indexes.
no setter

Methods

noSuchMethod(Invocation invocation) → dynamic
Invoked when a nonexistent method or property is accessed.
inherited
toString() String
A string representation of this object.
inherited

Operators

operator ==(Object other) bool
The equality operator.
inherited