Re: [PATCH] Add sortsupport for range types and btree_gist

From: Heikki Linnakangas <hlinnaka(at)iki(dot)fi>
To: Bernd Helmle <mailings(at)oopsware(dot)de>, Andrey Borodin <x4mmm(at)yandex-team(dot)ru>
Cc: Michael Paquier <michael(at)paquier(dot)xyz>, jian he <jian(dot)universality(at)gmail(dot)com>, PostgreSQL Hackers <pgsql-hackers(at)lists(dot)postgresql(dot)org>
Subject: Re: [PATCH] Add sortsupport for range types and btree_gist
Date: 2025-04-02 19:41:48
Message-ID: 2d3078fa-4700-431f-99a5-91ae8ee3bf86@iki.fi
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

On 02/04/2025 20:18, Heikki Linnakangas wrote:
> So I added it for the btree opfamily too, and moved the function to
> rangetypes.c since it's not just for gist anymore. I Ccmmitted that
> part, and will start looking more closely at the remaining btree_gist
> parts now.

Here's an updated version of the remaining parts. Found a couple of bugs:

* The stuff to save the FmgrInfo for gbt_enum_sortsupport() was broken.
It saved the address of FmgrInfo struct that was allocated in a local
variable, on the stack, in SortSupport->ssup-extra, and expected it to
be valid on subsequent call to gbt_enum_ssup_cmp(). It went unnoticed
because enum_cmp() only accesses the FmgrInfo struct if it encounters
odd enum values, i.e. enum values that have been added with ALTER TYPE
ADD VALUE. I fixed that, and modified the existing enum test to cover
that case.

* The gist_bpchar_ops opfamily was using the built-in
bpchar_sortsupport() function directly. That's not right, you need to
have a shim that extracts and comparse just the 'lower' value, just like
for all other datatypes. I think that was just a simple oversight, but
it happened to pass the tests because bpchar_sortsupport() would not
outright crash, even though we were passing it garbage. It's interesting
that because the gist sortsupport function is used just to order the
input during build, everything still works if it sorts the input to a
completely bogus ordering, it just gets slower. At one point while
fixing that, I also accidentally used "btcharcmp" instead of
"bpcharcmp", and all the tests passed with that too.

Those are now fixed. I also harmonized the comments to use the same
phrasing for all the datatypes, marked all the sortsupport functions as
PARALLEL SAFE, and reformatted slightly.

> + <para>
> + By default <filename>btree_gist</filename> builds <acronym>GiST</acronym> index with
> + <function>sortsupport</function> in <firstterm>sorted</firstterm> mode. This usually results in
> + much faster index built speed. It is still possible to revert to buffered built strategy
> + by using the <literal>buffering</literal> parameter when creating the index.
> + </para>

I'm inclined to leave out this paragraph. That's not specific to the
btree_gist module, you can make any GiST index slower by disabling the
sorted mode; but why would you do that? Let's mention in the release
notes that btree_gist index builds got faster, and leave it at that.

--
Heikki Linnakangas
Neon (https://neon.tech)

Attachment Content-Type Size
v2-0001-Add-support-for-sorted-gist-index-builds-to-btree.patch text/x-patch 43.2 KB

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Sami Imseih 2025-04-02 19:43:13 Re: Proposal - Allow extensions to set a Plan Identifier
Previous Message Andres Freund 2025-04-02 19:36:24 Re: Incorrect result of bitmap heap scan.