Re: The documentation for storage type 'plain' actually allows single byte header

From: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To: Laurenz Albe <laurenz(dot)albe(at)cybertec(dot)at>
Cc: Andres Freund <andres(at)anarazel(dot)de>, suchithjn22(at)gmail(dot)com, pgsql-docs(at)lists(dot)postgresql(dot)org
Subject: Re: The documentation for storage type 'plain' actually allows single byte header
Date: 2023-01-15 21:40:27
Message-ID: 1698750.1673818827@sss.pgh.pa.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-docs pgsql-hackers

Laurenz Albe <laurenz(dot)albe(at)cybertec(dot)at> writes:
> On Tue, 2023-01-10 at 15:53 +0000, PG Doc comments form wrote:
>>> PLAIN prevents either compression or out-of-line storage; furthermore it
>>> disables use of single-byte headers for varlena types. This is the only
>>> possible strategy for columns of non-TOAST-able data types.

>> However, it does allow "single byte" headers. How to verify this?
>> CREATE EXTENSION pageinspect;
>> CREATE TABLE test(a VARCHAR(10000) STORAGE PLAIN);
>> INSERT INTO test VALUES (repeat('A',10));
>>
>> Now peek into the page with pageinspect functions
>>
>> SELECT left(encode(t_data, 'hex'), 40) FROM
>> heap_page_items(get_raw_page('test', 0));
>>
>> This returned value of "1741414141414141414141".

> I think that the documentation is wrong. The attached patch removes the
> offending half-sentence.

The documentation is correct, what is broken is the code. I'm not
sure when we broke it, but what I see in tracing through the INSERT
is that we are forming the tuple using a tupdesc with the wrong
value of attstorage. It looks like the tupdesc belongs to the
virtual slot representing the output of the INSERT statement,
which is not identical to the target relation's tupdesc.

(The virtual slot's tupdesc is probably reverse-engineered from
just the data types of the columns, so it'll have whatever is the
default attstorage for the data type. It's blind luck that this
attstorage value isn't used for anything more consequential,
like TOAST decisions.)

regards, tom lane

In response to

Responses

Browse pgsql-docs by date

  From Date Subject
Next Message Andres Freund 2023-01-15 23:03:20 Re: The documentation for storage type 'plain' actually allows single byte header
Previous Message Tatsuo Ishii 2023-01-15 03:28:41 Re: archive_command was duplicated in backup.sgml

Browse pgsql-hackers by date

  From Date Subject
Next Message Andres Freund 2023-01-15 22:06:27 Re: logrep stuck with 'ERROR: int2vector has too many elements'
Previous Message Tom Lane 2023-01-15 20:53:09 Re: logrep stuck with 'ERROR: int2vector has too many elements'