From: | Dag Lem <dag(at)nimrod(dot)no> |
---|---|
To: | PostgreSQL Hackers <pgsql-hackers(at)lists(dot)postgresql(dot)org> |
Subject: | daitch_mokotoff module |
Date: | 2021-12-03 20:07:29 |
Message-ID: | yger1atbgfy.fsf@sid.nimrod.no |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-hackers |
Hello,
Please find attached a patch for the daitch_mokotoff module.
This implements the Daitch-Mokotoff Soundex System, as described in
https://www.avotaynu.com/soundex.htm
The module is used in production at Finance Norway.
In order to verify correctness, I have compared generated soundex codes
with corresponding results from the implementation by Stephen P. Morse
at https://stevemorse.org/census/soundex.html
Where soundex codes differ, the daitch_mokotoff module has been found
to be correct. The Morse implementation uses a few unofficial rules,
and also has an error in the handling of adjacent identical code
digits. Please see daitch_mokotoff.c for further references and
comments.
For reference, detailed instructions for soundex code comparison are
attached.
Best regards
Dag Lem
Attachment | Content-Type | Size |
---|---|---|
daitch_mokotoff.patch | text/x-patch | 51.2 KB |
soundex-comparison.sh | application/x-sh | 1.0 KB |
From | Date | Subject | |
---|---|---|---|
Next Message | Andrew Dunstan | 2021-12-03 20:11:11 | Re: The "char" type versus non-ASCII characters |
Previous Message | Melanie Plageman | 2021-12-03 20:02:24 | Re: pg_stat_bgwriter.buffers_backend is pretty meaningless (and more?) |