TextAnalyzer class abstract
An interface exposes language-specific properties and methods used in text analysis.
- characterFilter is a function that manipulates text prior to stemming and tokenization;
- termFilter is a filter function that returns a collection of terms from a term. It returns an empty collection if the term is to be excluded from analysis or, returns multiple terms if the term is split (e.g. at hyphens) and / or, returns modified term(s), such as applying a stemmer algorithm; and
- termSplitter returns a list of terms from text;
- sentenceSplitter splits text into a list of sentences at sentence and line endings;
- paragraphSplitter splits text into a list of paragraphs at line endings;
- syllableCounter returns the number of syllables in a word or text.
Constructors
Properties
- characterFilter → AsyncTermModifier
-
A function that filters out unwanted characters or replaces them with
other characters.
no setter
- hashCode → int
-
The hash code for this object.
no setterinherited
- jsonTokenizer → JsonTokenizer
-
Extracts tokens from the fields in a JSON document for use in full-text
search queries and indexes.
no setter
- nGrammer → NGrammer
-
A language-specific function that generates n-grams from text.
no setter
- paragraphSplitter → TextSplitter
-
Returns a list of paragraphs from text.
no setter
- phraseSplitter → PhraseSplitter
-
Returns a list of keywords from text.
no setter
- runtimeType → Type
-
A representation of the runtime type of the object.
no setterinherited
- sentenceSplitter → TextSplitter
-
Returns a list of sentences from text.
no setter
- syllableCounter → SyllableCounter
-
Returns the number of syllables in a string after stripping out all
white-space and punctuation.
no setter
- termExpander → TermExpander?
-
Expands text to a collection of related Strings, e.g. synonyms,
abbreviations or spelling suggestions.
no setter
- termFilter → AsyncTermModifier
-
A filter function that returns a modified term or null.
no setter
- termSplitter → TextSplitter
-
Returns a list of words from text.
no setter
- tokenizer → Tokenizer
-
Extracts one or more tokens from text for use in full-text search queries
and indexes.
no setter
Methods
-
noSuchMethod(
Invocation invocation) → dynamic -
Invoked when a nonexistent method or property is accessed.
inherited
-
toString(
) → String -
A string representation of this object.
inherited
Operators
-
operator ==(
Object other) → bool -
The equality operator.
inherited