string/text_fingerprint_utils library

Text fingerprinting (simhash-style) — roadmap #417.

Functions

fingerprintDistance(int a, int b) int
Hamming distance between two 32-bit fingerprints (number of differing bits).
textFingerprint(String text) int
Simple 32-bit fingerprint: hash of word shingles. text split on non-letters.