Quick Links

Re: text_position worst case runtime

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	Mark Dilger <pgsql(at)markdilger(dot)com>
Cc:	pgsql-hackers(at)postgresql(dot)org
Subject:	Re: text_position worst case runtime
Date:	2006-05-19 00:54:12
Message-ID:	23183.1148000052@sss.pgh.pa.us
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

Mark Dilger <pgsql(at)markdilger(dot)com> writes:
> The function
> static int32 text_position(text *t1, text *t2, int matchnum)
> defined in src/backend/utils/adt/varlena.c uses repeated calls to
> strncmp (or pg_wchar_strncmp) to find the location of the pattern in
> the text. The worst case runtime for such an approach is O(n*m) where
> n and m are the lengths of the pattern and text. The best case would
> be O(n), I guess, because it only takes n comparisons to find the
> pattern at the very start of the text. I'm not sure how to determine
> the average case runtime, because it depends what your data looks like
> on average.

I would think that the worst-case times would be fairly improbable.
I'm disinclined to push something as complicated as Boyer-Moore matching
into this function without considerable evidence that it's a performance
bottleneck for real applications.

The question that comes to mind for me is why we're not using simple
strncmp in all cases in that code. The conversion to pg_wchar format
looks expensive and unnecessary ...

regards, tom lane

In response to

text_position worst case runtime at 2006-05-19 00:17:19 from Mark Dilger

Responses

Re: text_position worst case runtime at 2006-05-19 01:49:38 from Mark Dilger

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Christopher Kings-Lynne	2006-05-19 01:33:16	Re: [OT] MySQL is bad, but THIS bad?
Previous Message	Mark Dilger	2006-05-19 00:44:47	Re: PL/pgSQL 'i = i + 1' Syntax