post_process_text library
Text-level normalization passes for OCR post-processing.
Handles punctuation spacing, multi-character letter confusion resolution, and other text-wide normalization.
Functions
-
normalizeLetterConfusions(
String text) → String - Normalizes common multi-character letter confusions within non-dictionary words.
-
normalizePunctuationSpacing(
String text) → String - Normalizes punctuation spacing errors common in OCR.