Quick Links

Re: Excessive WAL generation and related performance issue

From:	Jim Nasby <jim(at)nasby(dot)net>
To:	Andres Freund <andres(at)2ndquadrant(dot)com>, Joe Conway <mail(at)joeconway(dot)com>
Cc:	"Hackers (PostgreSQL)" <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Excessive WAL generation and related performance issue
Date:	2014-04-14 22:17:18
Message-ID:	534C5E6E.1080604@nasby.net
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

On 4/14/14, 4:50 PM, Andres Freund wrote:
> Hi,
>
> On 2014-04-14 14:33:03 -0700, Joe Conway wrote:
>> checkpoint_segments = 96
>> checkpoint_timeout = 10min
>
>> I realize there are many things that can be done to improve my
>> specific scenario, e.g. drop indexes before loading, change various
>> configs, etc. My purpose for this post is to ask if it is really
>> expected to get over 20 times as much WAL as heap data?
>
> I'd bet a large percentage of this will be full page images of the
> index. The values you index are essentially distributed over the whole
> index, so you'll modifiy the same indx values repeatedly. But often
> enough it won't be in the same checkpoint and thus will create full page
> images.

My thought exactly...

ISTM that we should be able to push all the index inserts to the end of the transaction. That should greatly reduce the amount of full page writes. That would also open the door for doing all the index inserts in parallel.
--
Jim C. Nasby, Data Architect jim(at)nasby(dot)net
512.569.9461 (cell) http://jim.nasby.net

In response to

Re: Excessive WAL generation and related performance issue at 2014-04-14 21:50:59 from Andres Freund

Responses

Re: Excessive WAL generation and related performance issue at 2014-04-14 22:51:41 from Joe Conway

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Jim Nasby	2014-04-14 22:36:40	Re: Signaling of waiting for a cleanup lock?
Previous Message	Andres Freund	2014-04-14 21:50:59	Re: Excessive WAL generation and related performance issue