From: | Andrew Sullivan <ajs(at)crankycanuck(dot)ca> |
---|---|
To: | pgsql-general(at)postgresql(dot)org |
Subject: | Re: something better than pgtrgm? |
Date: | 2012-10-09 15:16:03 |
Message-ID: | 20121009151601.GH594@crankycanuck.ca |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-general |
On Tue, Oct 09, 2012 at 03:54:35PM +0200, Willy-Bas Loos wrote:
>
> > If so, I
> > can almost imagine a way this could work
> >
>
> Great! How?
Well, it involves very large tables. But basically, you work out a
"variant" table for any language you like, and then query across it
with subsets of the trigrams you were just working with. It probably
sucks in performance, but at least you're likely to get valid
sequences this way.
For inspiration on this (and why I have so much depressing news on the
subject of internationalization in a multi-script and multi-lingual
environment), see RFC 3743 and RFC 4290. These are related (among
other things) to how to make "variants" of different DNS labels
somehow hang together. The problem is not directly related to what
you're working on, but it's a similar sort of problem: people have
rough ideas of what they're entering, and they need an exact match.
You have the good fortune of being able to provide them with a hint!
I wish I were in your shoes.
A
--
Andrew Sullivan
ajs(at)crankycanuck(dot)ca
From | Date | Subject | |
---|---|---|---|
Next Message | José Pedro Santos | 2012-10-09 15:37:59 | PostgreSQL and WMS/WFS Service |
Previous Message | Matthijs Möhlmann | 2012-10-09 14:55:50 | plpgsql: trigger insert new into other table (archive) |