Quick Links

Re: count * performance issue

From:	Craig James <craig_james(at)emolecules(dot)com>
To:	pgsql-performance(at)postgresql(dot)org
Subject:	Re: count * performance issue
Date:	2008-03-06 15:28:50
Message-ID:	47D00DB2.7040007@emolecules.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-performance

In the 3 years I've been using Postgres, the problem of count() performance has come up more times than I can recall, and each time the answer is, "It's a sequential scan -- redesign your application."

My question is: What do the other databases do that Postgres can't do, and why not?

Count() on Oracle and MySQL is almost instantaneous, even for very large tables. So why can't Postgres do what they do?

On the one hand, I understand that Postgres has its architecture, and I understand the issue of row visibility, and so forth. On the other hand, my database is just sitting there, nothing going on, no connections except me, and... it takes FIFTY FIVE SECONDS to count 20 million rows, a query that either Oracle or MySQL would answer in a fraction of a second. It's hard for me to believe there isn't a better way.

This is a real problem. Countless people (including me) have spent significant effort rewriting applications because of this performance flaw in Postgres. Over and over, the response is, "You don't really need to do that ... change your application." Well, sure, it's always possible to change the application, but that misses the point. To most of us users, count() seems like it should be a trivial operation. On other relational database systems, it is a trivial operation.

This is really a significant flaw on an otherwise excellent relational database system.

My rant for today...
Craig

In response to

Re: count * performance issue at 2008-03-06 13:18:17 from Greg Smith

Responses

Re: count * performance issue at 2008-03-06 15:33:25 from Bruce Momjian
Re: count * performance issue at 2008-03-06 15:36:39 from Steinar H. Gunderson
Re: count * performance issue at 2008-03-06 15:41:20 from Bill Moran
Re: count * performance issue at 2008-03-06 16:16:31 from Mark Lewis
Re: count * performance issue at 2008-03-06 16:31:44 from D'Arcy J.M. Cain
Re: count * performance issue at 2008-03-06 17:29:22 from Tom Lane
Re: count * performance issue at 2008-03-07 02:40:45 from Mark Kirkwood
Re: count * performance issue at 2008-03-08 08:04:31 from Arjen van der Meijden

Browse pgsql-performance by date

	From	Date	Subject
Next Message	Bruce Momjian	2008-03-06 15:33:25	Re: count * performance issue
Previous Message	Greg Smith	2008-03-06 13:43:43	Re: postgresql Explain command output