Quick Links

Re: Allow multi-byte characters as escape in SIMILAR TO and SUBSTRING

From:	Heikki Linnakangas <hlinnakangas(at)vmware(dot)com>
To:	Jeff Davis <pgsql(at)j-davis(dot)com>, Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc:	<pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Allow multi-byte characters as escape in SIMILAR TO and SUBSTRING
Date:	2014-08-25 13:01:13
Message-ID:	53FB3399.2070802@vmware.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

On 07/12/2014 05:16 AM, Jeff Davis wrote:
> I was able to see about a 2% increase in runtime when using the
> similar_escape function directly. I made a 10M tuple table and did:
>
> explain analyze
> select
> similar_escape('ΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣ','#') from t;
>
> which was the worst reasonable case I could think of. (It appears that
> selecting from a table is faster than from generate_series. I'm curious
> what you use when testing the performance of an individual function at
> the SQL level.)

A large table like that is what I usually do. A large generate_series()
spends a lot of time building the tuplestore, especially if it doesn't
fit in work_mem and spills to disk. Sometimes I use this to avoid it:

explain analyze
select
similar_escape('ΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣΣ','#')
from generate_series(1, 10000) a, generate_series(1,1000);

although in my experience it still has somewhat more overhead than a
straight seqscan because.

- Heikki

In response to

Re: Allow multi-byte characters as escape in SIMILAR TO and SUBSTRING at 2014-07-12 02:16:17 from Jeff Davis

Responses

Re: Allow multi-byte characters as escape in SIMILAR TO and SUBSTRING at 2014-08-25 13:48:59 from Tom Lane

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Kevin Grittner	2014-08-25 13:25:53	Re: LIMIT for UPDATE and DELETE
Previous Message	Albe Laurenz	2014-08-25 12:58:59	Re: Optimization for updating foreign tables in Postgres FDW