Bytes Speak All Languages: Cross-Script Name Retrieval via Contrastive Learning
screening system checks a name against a watchlist, it faces a silent failure mode that nobody talks about. Type “Владимир Путин” into a system indexed on “Vladimir Putin” and most name-matching approaches return nothing. The two strings share zero characters, so edit distance is meaningless, phonetic codes fail (they assume Latin), and BM25 gives up …
Bytes Speak All Languages: Cross-Script Name Retrieval via Contrastive Learning Read More »










