Quick Links

Re: vacuum analyze again...

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
Cc:	Peter Eisentraut <peter_e(at)gmx(dot)net>, Jean-Christophe Boggio <cat(at)thefreecat(dot)org>, PostgreSQL General <pgsql-general(at)postgresql(dot)org>
Subject:	Re: vacuum analyze again...
Date:	2001-02-20 18:27:33
Message-ID:	25778.982693653@sss.pgh.pa.us
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-general

Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us> writes:
>> How's reading a sufficiently large fraction of random rows going to be
>> significantly faster than reading all rows? If you're just going to read
>> the first n rows then that isn't really random, is it?

> Ingres did this too, I thought. You could specify a certain number of
> random rows, perhaps 10%. On a large table, that is often good enough
> and much faster. Often 2% is enough.

Peter's got a good point though. Even 2% is going to mean fetching most
or all of the blocks in the table, for typical-size rows. Furthermore,
fetching (say) every second or third block is likely to be actually
slower than a straight sequential read, because you're now fighting the
kernel's readahead policy instead of working with it.

To get a partial VACUUM ANALYZE that was actually usefully faster than
the current code, I think you'd have to read just a few percent of the
blocks, which means much less than a few percent of the rows ... unless
maybe you picked selected blocks but then used all the rows in those
blocks ... but is that a random sample? It's debatable.

I find it hard to believe that VAC ANALYZE is all that much slower than
plain VACUUM anyway; fixing the indexes is the slowest part of VACUUM in
my experience. It would be useful to know exactly what the columns are
in a table where VAC ANALYZE is considered unusably slow.

regards, tom lane

In response to

Re: vacuum analyze again... at 2001-02-20 17:55:02 from Bruce Momjian

Responses

Re: vacuum analyze again... at 2001-02-20 18:51:26 from Bruce Momjian

Browse pgsql-general by date

	From	Date	Subject
Next Message	Tom Lane	2001-02-20 18:39:28	Re: Re: A How-To: PostgreSQL from Tcl via ODBC
Previous Message	Peter Eisentraut	2001-02-20 18:23:40	Re: Re: A How-To: PostgreSQL from Tcl via ODBC