Quick Links

Re: Context switch storm

From:	Richard Huxton <dev(at)archonet(dot)com>
To:	Cosimo Streppone <cosimo(at)streppone(dot)it>
Cc:	Postgresql Performance list <pgsql-performance(at)postgresql(dot)org>
Subject:	Re: Context switch storm
Date:	2006-11-03 13:29:28
Message-ID:	454B4438.9080105@archonet.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-performance

Cosimo Streppone wrote:
> Richard Huxton wrote:
>
>> creimer(at)brturbo(dot)com(dot)br wrote:
>>>
>>> The average context switching for this server as vmstat shows is 1
>>> but when the problem occurs it goes to 250000.
>>
>> You'll tend to see it when you have multiple clients and most queries
>> can use RAM rather than disk I/O. My understanding of what happens is
>> that PG requests data from RAM - it's not in cache so the process gets
>> suspended to wait. The next process does the same, with the same
>> result. You end up with lots of processes all fighting over what
>> data is in the cache and no-one gets much work done.
>
> Does this happen also with 8.0, or is specific to 8.1 ?

All versions suffer to a degree - they just push the old Xeon in the
wrong way. However, more recent versions *should* be better than older
versions. I believe some work was put in to prevent contention on
various locks which should reduce context-switching across the board.

> I seem to have the same exact behaviour for an OLTP-loaded 8.0.1 server

upgrade from 8.0.1 - the most recent is 8.0.9 iirc

> when I raise `shared_buffers' from 8192 to 40000.
> I would expect an increase in tps/concurrent clients, but I see an average
> performance below a certain threshold of users, and when concurrent users
> get above that level, performance starts to drop, no matter what I do.

Are you seeing a jump in context-switching in top? You'll know when you
do - it's a *large* jump. That's the key diagnosis. Otherwise it might
simply be your configuration settings aren't ideal for that workload.

> Server logs and io/vm statistics seem to indicate that there is little
> or no disk activity but machine loads increases to 7.0/8.0.
> After some minutes, the problem goes away, and performance returns
> to acceptable levels.

That sounds like it. Query time increases across the board as all the
clients fail to get any data back.

> When the load increases, *random* database queries show this "slowness",
> even if they are perfectly planned and indexed.
>
> Is there anything we can do?

Well, the client I saw it with just bought a dual-opteron server and
used their quad-Xeon for something else. However, I do remember that 8.1
seemed better than 7.4 before they switched. Part of that might just
have been better query-planning and other efficiences though.

--
Richard Huxton
Archonet Ltd

In response to

Re: Context switch storm at 2006-11-03 13:06:36 from Cosimo Streppone

Responses

Re: Context switch storm at 2006-11-03 13:53:02 from Andreas Kostyrka
Re: Context switch storm at 2006-11-14 09:51:44 from Cosimo Streppone

Browse pgsql-performance by date

	From	Date	Subject
Next Message	Andreas Kostyrka	2006-11-03 13:34:24	Re: Setting "nice" values
Previous Message	creimer	2006-11-03 13:28:36	Re: Context switch storm