Optimizing query?

From: wolfgang(at)noten5(dot)maas-noten(dot)de
To: pgsql-general(at)postgresql(dot)org
Subject: Optimizing query?
Date: 2013-01-30 11:08:05
Message-ID: wolfgang-1130130120805.A0219642@noten5.maas-noten.de
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general


Hi,

I am trying to match items from 2 tables based on a common string.
One is a big table which has one column with entries like XY123, ABC44, etc
The table has an index on that column.
The second table is, typically, much smaller

select .... from tab1, tab2 where tab1.code = tab2.code;

This works fine and fast.
Now, as a variant, I have some entries like XY423A, XY423B, GF55A, GF55D in the
big table and want them to match XY423, GF55 in the second table

Variants I have tried

select .... from tab1, tab2 where tab1.code ~ (tab2.code||'($|[A-Z])');
select .... from tab1, tab2 where tab1.code ~ ('^'||tab2.code||'($|[A-Z])');

both take an enormous time. In the better case that I can subset (e.g. all candidates in table 2
share initial "AX") I get back to manageable times by adding
and tab1.code ~ '^AX'
into the recipe. Actual runtime with about a million entries in tab1 and 800 entries in tab2
is about 40 seconds.

Regards
Wolfgang Hamann

Responses

Browse pgsql-general by date

  From Date Subject
Next Message hamann.w 2013-01-30 11:33:17 optimize query?
Previous Message DANIEL CRISTIAN CRUZ 2013-01-30 10:49:56 Re: Is there a way to add a detail message in a warning with pl/Python?