post_process_text library

Text-level normalization passes for OCR post-processing.

Handles punctuation spacing, multi-character letter confusion resolution, and other text-wide normalization.

Functions

normalizeLetterConfusions(String text) String
Normalizes common multi-character letter confusions within non-dictionary words.
normalizePunctuationSpacing(String text) String
Normalizes punctuation spacing errors common in OCR.