Re: something better than pgtrgm?

From: Andrew Sullivan <ajs(at)crankycanuck(dot)ca>
To: pgsql-general(at)postgresql(dot)org
Subject: Re: something better than pgtrgm?
Date: 2012-10-09 15:16:03
Message-ID: 20121009151601.GH594@crankycanuck.ca
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

On Tue, Oct 09, 2012 at 03:54:35PM +0200, Willy-Bas Loos wrote:
>
> > If so, I
> > can almost imagine a way this could work
> >
>
> Great! How?

Well, it involves very large tables. But basically, you work out a
"variant" table for any language you like, and then query across it
with subsets of the trigrams you were just working with. It probably
sucks in performance, but at least you're likely to get valid
sequences this way.

For inspiration on this (and why I have so much depressing news on the
subject of internationalization in a multi-script and multi-lingual
environment), see RFC 3743 and RFC 4290. These are related (among
other things) to how to make "variants" of different DNS labels
somehow hang together. The problem is not directly related to what
you're working on, but it's a similar sort of problem: people have
rough ideas of what they're entering, and they need an exact match.
You have the good fortune of being able to provide them with a hint!
I wish I were in your shoes.

A

--
Andrew Sullivan
ajs(at)crankycanuck(dot)ca

In response to

Responses

Browse pgsql-general by date

  From Date Subject
Next Message José Pedro Santos 2012-10-09 15:37:59 PostgreSQL and WMS/WFS Service
Previous Message Matthijs Möhlmann 2012-10-09 14:55:50 plpgsql: trigger insert new into other table (archive)