Quick Links

Re: Fix brin_form_tuple to properly detoast data

From:	Tomas Vondra <tomas(dot)vondra(at)enterprisedb(dot)com>
To:	Alvaro Herrera <alvherre(at)alvh(dot)no-ip(dot)org>, Tomas Vondra <tomas(dot)vondra(at)2ndquadrant(dot)com>
Cc:	pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Fix brin_form_tuple to properly detoast data
Date:	2020-11-05 17:42:10
Message-ID:	21cacd39-1ca2-54ef-4f3e-ac493238832c@enterprisedb.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

On 11/5/20 6:17 PM, Alvaro Herrera wrote:
> On 2020-Nov-04, Tomas Vondra wrote:
>
>> The first test is fairly trivial - it simply builds index on toasted data
>> and then shows how an insert and select fail. There's a caveat, that this
>> requires a DELETE + VACUUM, and the VACUUM actually has to cleanup the rows.
>> So there must be no concurrent transactions that might need the rows, which
>> is unlikely in regression tests. So this requires waiting for all running
>> transactions to finish - I did that by building an index concurrently. It's
>> a bit strange, but it's better than any other solution I could think of
>> (timeout or some custom wait for xacts).
>
> There are recent changes in vacuum for temp tables (commit 94bc27b57680?)
> that would maybe make this stable enough, without having to have the CIC
> there. At least, I tried it locally a few times and it appears to work well.
> This won't work for older releases though, just master. This is patch
> 0001 attached here.
>

IIUC you're suggesting to use a temporary table in the test?
Unfortunately, this does not work on older releases, and IMHO the test
should be backpatched too. IMHO the CIC "hack" is acceptable, unless
there's a better solution that I'm not aware of.

>> The second test is a bit redundant - it merely checks that both CREATE INDEX
>> and INSERT INTO fail the same way when the index tuple gets too large.
>> Before the fix there were some inconsistencies - the CREATE INDEX succeeded
>> because it used TOASTed data. So ultimately this tests the same thing, but
>> from a different perspective.
>
> Hmm. This one shows page size in the error messages, so it'll fail on
> nonstandard builds. I think we try to stay away from introducing those,
> so I'd leave this test out.

Hmm, OK. I don't think having "redundant" test is a big deal, but I
haven't thought about builds with different block sizes. I'll leave this
out.

>
> The code fix looks all right -- I'd just move the #include lines to
> their place. Patch 0002.
>

> You add this comment:
>
>> + /*
>> + * Do nothing if value is not of varlena type. We don't need to
>> + * care about NULL values here, thanks to bv_allnulls above.
>> + *
>> + * If value is stored EXTERNAL, must fetch it so we are not
>> + * depending on outside storage.
>> + *
>> + * XXX Is this actually true? Could it be that the summary is
>> + * NULL even for range with non-NULL data? E.g. degenerate bloom
>> + * filter may be thrown away, etc.
>> + */
>
> I think the XXX comment points to a bug that we don't have right now,
> since neither minmax nor inclusion can end up with a NULL summary
> starting from non-NULL data. But if the comment about bloom is correct,
> then surely it'll become a bug when bloom is added.
>

Yeah, but that'd be something for the bloom patch to fix, I think.

This got me thinking though - wouldn't it be better to handle too large
values by treating the range as "degenerate" (i.e. store NULL and
consider it as matching all queries), instead of failing the CREATE
INDEX or DML? I find the current behavior rather annoying, because it
depends on the other rows in the page range, not just on the one row the
user deals with. Perhaps this might even be considered an information
leak about the other data. Of course, not something this patch should
deal with.

> I don't think we need the second part of this comment:
>
>> +/*
>> + * This enables de-toasting of index entries. Needed until VACUUM is
>> + * smart enough to rebuild indexes from scratch.
>> + */
>
> ... because, surely, we're now never working on having VACUUM rebuild
> indexes from scratch. In fact, I wonder if we need the #define at
> all. I propose to remove all those #ifdef lines in your patch.
>

That's a verbatim copy of a comment from indextuple.c. IMHO we should
keep it the same in both places.

> The fix looks good to me. I just added a comment in 0003.
>

Thanks. Any opinions on fixing this in existing clusters? Any better
ideas than just giving users the SQL query to list possibly-affected
indexes?

regards

--
Tomas Vondra
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

In response to

Re: Fix brin_form_tuple to properly detoast data at 2020-11-05 17:17:17 from Alvaro Herrera

Responses

Re: Fix brin_form_tuple to properly detoast data at 2020-11-05 21:16:04 from Alvaro Herrera

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Matthias van de Meent	2020-11-05 18:07:34	Re: libpq compression
Previous Message	Justin Pryzby	2020-11-05 17:20:30	Re: Yet another fast GiST build