Re: 100% CPU pg processes that don't die.

From: Greg Smith <gsmith(at)gregsmith(dot)com>
To: Scott Marlowe <scott(dot)marlowe(at)gmail(dot)com>
Cc: pgsql general <pgsql-general(at)postgresql(dot)org>
Subject: Re: 100% CPU pg processes that don't die.
Date: 2008-08-11 16:41:02
Message-ID: Pine.GSO.4.64.0808111159300.25081@westnet.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

On Sun, 10 Aug 2008, Scott Marlowe wrote:

> The good news is that both Centos 5.2 and Ubuntu 7.10 seem immune to
> this particular bug, and have been running 13 hours now without a
> hitch.

Not sure if it's relevant here, but you do know that I've been kicking
back to lkml that pgbench has issues on the 2.6.24 kernel, right? I
haven't tried simulating 1000 clients like you're case, but it fails
miserably to scale to even 10 with the select-only workload.

The CFS scheduler used in 2.6.23+ is only about a year old right now, and
it sure seems like it's still a baby going through its share of teething
pains. Lest you think it's just me complaining, read
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/188226 Maybe your
problem is somewhere else instead, but particularly given the known
pgbench issues it may be worth spending a minute investigating that area.

One bit of black magic I was told is that you can turn off some of the new
scheduling "features" (one of which was the main cause of the pgbench
incompatibility) by updating a kernel tunable. You can turn off all the
fancy features with:

echo 0 > /proc/sys/kernel/sched_features

See http://lkml.org/lkml/2008/5/26/288 for more details.

Whether the problem still shows up with that change should help narrow
whether your issue is likely related to the scheduler changes or likely
something else that's different with the newer kernel. Your pgbench
results should be higher after that change, too, but you shouldn't run the
system like that normally--it's optimizing for something bad pgbench does
but will hurt other, more realistic workloads.

--
* Greg Smith gsmith(at)gregsmith(dot)com http://www.gregsmith.com Baltimore, MD

In response to

Browse pgsql-general by date

  From Date Subject
Next Message aravind chandu 2008-08-11 16:41:47 How to calculate number of rows per page in postgresql
Previous Message Joao Ferreira gmail 2008-08-11 16:30:04 big database with very small dump !?