[pgsql-general] Baffling Performance

From: Poet/Joshua Drake <poet(at)linuxports(dot)com>
To: <pgsql-general(at)postgresql(dot)org>
Subject: [pgsql-general] Baffling Performance
Date: 2001-04-04 18:53:24
Message-ID: Pine.LNX.4.30.0104041113590.22572-100000@crazypenguins.commandprompt.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general pgsql-sql

Good day,

We're running a week-old CVS snapshot of PostgreSQL 7.1, and I'm not sure
if this performance inconsistency is specific to it, or if this is just
something in PostgreSQL in general, but it seems kind of odd, and I could
use some help here. ;)

I have run two queries in table full of invoices where the only different
in the query is the where clause:

SELECT COUNT(DISTINCT l."ORDER_NUM") AS line_orders,
SUM(l."GROSS_PROFIT" + "REPLACEMENT_COST") AS grand_total,
SUM(l."LIST_PRICE" * l."SHIPPED_QUANTITY") AS grand_total_alternative,
SUM(l."GROSS_PROFIT") AS grand_total_profit
FROM cu_invoice_li1_tbl l INNER JOIN cu_invoice_03_tbl h
ON l."ORDER_NUM"=h."ORDER_NUM"

WHERE substr("ORDER_NUM", 1, 1) != 'W'; # (Orders NOT prefixed with W.)

OR...

WHERE substr("ORDER_NUM", 1, 1) = 'I'; # (Orders prefixed with I.)

The first query returns more rows than the second (because it includes
orders prefixed with I, C, F, etc), whereas the second only returns orders
prefixed with I. It seems that the logic of identifying a single character
as *not* being a single value would be identical or at least similar in
efficiency to the logic of a single character *being* being a single value.

However, the first query runs in about 10-15 seconds, and the second query
ran for over 40 minutes before I cancelled it.

Now, I can easily use a series of "!=" statements to get it down so that
it's only the 'I' orders (which runs even faster, at about 5-6 seconds!),
but can anyone explain this to me? :)

Are "!=" substring evaluations inherently faster by an obscene order of
magnitude, or is something really wrong here?

J.

--
--
<COMPANY>CommandPrompt - http://www.commandprompt.com </COMPANY>
<PROJECT>OpenDocs, LLC. - http://www.opendocs.org </PROJECT>
<PROJECT>LinuxPorts - http://www.linuxports.com </PROJECT>
<WEBMASTER>LDP - http://www.linuxdoc.org </WEBMASTER>
--
Instead of asking why a piece of software is using "1970s technology,"
start asking why software is ignoring 30 years of accumulated wisdom.
--

In response to

Browse pgsql-general by date

  From Date Subject
Next Message Gordon A. Runkle 2001-04-04 18:54:14 Re: php Compile question
Previous Message Ahmed Moustafa 2001-04-04 18:34:53 Re: Postgres 7.1RC1 on Solaris 7

Browse pgsql-sql by date

  From Date Subject
Next Message Tom Lane 2001-04-04 18:55:30 Re: Need to do an ALTER TABLE.
Previous Message Srikanth Rao 2001-04-04 18:45:09 Memory exhaustion