TextAnalysisExtensionsOnString extension

Extension methods on term that exposes methods analysing and tokenizing text.

on

Properties

diphtongs List<String>
Returns all the diphtongs in the String.
no setter
termCount int
Returns the number of individual terms in the String.
no setter
triptongs List<String>
Returns all the diphtongs in the String.
no setter
vowelDipthongAndTriptongCount int
Returns the number of single vowels, diphtongs and triptongs in the String.
no setter
vowels List<String>
Returns all the vowels in the String.
no setter
words List<String>
Returns all the words in the String.
no setter

Methods

normalizeHyphens() String
Replaces all dashes and hyphens (U+2011 through U+2014) with a standard hyphen (U+2011).
normalizeQuotes() String
Replaces all double quote characters with U+0022, and single quote characters with +U0027
normalizeWhitespace() String
Replace all white-space sequences with single space and trim.
removeEnclosingQuotes() String
Trims all forms of quotation marks from start of the String.
removePossessives() String
Removes all posessive apostropes.
removeQuotes() String
Removes all quote marks from the string, except where within a word or where it is a apostrophe preceded by n "s" or "S" (possessive plural).
splitAtWhitespace() List<String>
Split the String at (one or more) white-space characters.