Quick Links

Re: Update table performance

From:	Heikki Linnakangas <heikki(at)enterprisedb(dot)com>
To:	Erik Jones <erik(at)myemma(dot)com>
Cc:	Mark Makarowsky <bedrockconstruction(at)yahoo(dot)com>, pgsql-performance(at)postgresql(dot)org
Subject:	Re: Update table performance
Date:	2007-08-08 08:00:48
Message-ID:	46B97830.1010603@enterprisedb.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-performance

Erik Jones wrote:
> Decibel! wrote:
>> I should mention that if you can handle splitting the
>> update into multiple transactions, that will help a
>> lot since it means you won't be doubling the size of
>> the table.
>
> As I mentioned above, when you do an update you're actually inserting a
> new row and deleting the old one. That deleted row is still considered
> part of the table (for reasons of concurrency, read up on the
> concurrency chapter in the manual for the details) and once it is no
> longer visible by any live transactions can be re-used by future
> inserts. So, if you update one column on every row of a one million row
> table all at once, you have to allocate and write out one million new
> rows. But, if you do the update a quarter million at a time, the last
> three updates would be able to re-use many of the rows deleted in
> earlier updates.

Only if you vacuum between the updates.

--
Heikki Linnakangas
EnterpriseDB http://www.enterprisedb.com

In response to

Re: Update table performance at 2007-08-08 01:46:20 from Erik Jones

Responses

Re: Update table performance at 2007-08-08 15:15:41 from Erik Jones

Browse pgsql-performance by date

	From	Date	Subject
Next Message	Enrico Weigelt	2007-08-08 12:16:11	Implementing an regex filter
Previous Message	Erik Jones	2007-08-08 01:46:20	Re: Update table performance