Quick Links

Re: CLUSTER vs. VACUUM FULL

From:	Ron Johnson <ronljohnsonjr(at)gmail(dot)com>
To:	pgsql-general <pgsql-general(at)postgresql(dot)org>
Subject:	Re: CLUSTER vs. VACUUM FULL
Date:	2024-04-22 18:45:16
Message-ID:	CANzqJaConAabzE942KSQV2BtZdBaEjvY9Cz_A8xvwrgauUX=zA@mail.gmail.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Thread:
Lists:	pgsql-general

On Mon, Apr 22, 2024 at 12:29 PM David G. Johnston <
david(dot)g(dot)johnston(at)gmail(dot)com> wrote:

>
>
> On Mon, Apr 22, 2024, 08:37 Ron Johnson <ronljohnsonjr(at)gmail(dot)com> wrote:
>
>> On Mon, Apr 22, 2024 at 10:25 AM Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us> wrote:
>>
>>> Marcos Pegoraro <marcos(at)f10(dot)com(dot)br> writes:
>>> > But wouldn't it be good that VACUUM FULL uses that index defined by
>>> > Cluster, if it exists ?
>>>
>>> No ... what would be the difference then?
>>>
>>
>> What the VACUUM docs "should" do, it seems, is suggest CLUSTER on the PK,
>> if the PK is a sequence (whether that be an actual sequence, or a timestamp
>> or something else that grows monotonically).
>>
>> That's because the data is already roughly in PK order.
>>
>
> If things are bad enough to require a vacuum full that doesn't seem like a
> good assumption.
>

Sure it does.

For example, I just deleted the oldest half of the records in 30 tables.
Tables who's CREATED_ON timestamp value strongly correlates to the
synthetic PK sequence values.

Thus, the remaining records were still mostly in PK order. CLUSTERs on the
PK values would have taken just about as much time as the VACUUM FULL
statements which I *did* run.

In response to

Re: CLUSTER vs. VACUUM FULL at 2024-04-22 16:29:21 from David G. Johnston

Responses

Re: CLUSTER vs. VACUUM FULL at 2024-04-22 19:14:38 from Adrian Klaver

Browse pgsql-general by date

	From	Date	Subject
Next Message	Олександр Янін	2024-04-22 19:01:17	Re: Performance degradation after upgrading from 9.5 to 14
Previous Message	Marcos Pegoraro	2024-04-22 17:50:24	Re: CLUSTER vs. VACUUM FULL