TextAnalysisExtensionsOnString extension
Extension methods on term that exposes methods analysing and tokenizing text.
- on
Properties
-
diphtongs
→ List<
String> -
Returns all the diphtongs in the String.
no setter
- termCount → int
-
Returns the number of individual terms in the String.
no setter
-
triptongs
→ List<
String> -
Returns all the diphtongs in the String.
no setter
- vowelDipthongAndTriptongCount → int
-
Returns the number of single vowels, diphtongs and triptongs in the
String.
no setter
-
vowels
→ List<
String> -
Returns all the vowels in the String.
no setter
-
words
→ List<
String> -
Returns all the words in the String.
no setter
Methods
-
normalizeHyphens(
) → String - Replaces all dashes and hyphens (U+2011 through U+2014) with a standard hyphen (U+2011).
-
normalizeQuotes(
) → String - Replaces all double quote characters with U+0022, and single quote characters with +U0027
-
normalizeWhitespace(
) → String - Replace all white-space sequences with single space and trim.
-
removeEnclosingQuotes(
) → String - Trims all forms of quotation marks from start of the String.
-
removePossessives(
) → String - Removes all posessive apostropes.
-
removeQuotes(
) → String - Removes all quote marks from the string, except where within a word or where it is a apostrophe preceded by n "s" or "S" (possessive plural).
-
splitAtWhitespace(
) → List< String> - Split the String at (one or more) white-space characters.