string/text_fingerprint_utils library
Text fingerprinting (simhash-style) — roadmap #417.
Functions
-
fingerprintDistance(
int a, int b) → int - Hamming distance between two 32-bit fingerprints (number of differing bits).
-
textFingerprint(
String text) → int -
Simple 32-bit fingerprint: hash of word shingles.
textsplit on non-letters.