Quick Links

Re: Improving btree performance through specializing by key shape, take 2

From:	Matthias van de Meent <boekewurm+postgres(at)gmail(dot)com>
To:
Cc:	Peter Geoghegan <pg(at)bowt(dot)ie>, PostgreSQL Hackers <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Improving btree performance through specializing by key shape, take 2
Date:	2022-07-04 14:18:35
Message-ID:	CAEze2Wjc-1_hFd-MhRrA6Wczd1TUEq4keqc3aP3+u-px6hvwcA@mail.gmail.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

On Sun, 5 Jun 2022 at 21:12, Matthias van de Meent
<boekewurm+postgres(at)gmail(dot)com> wrote:
> While working on benchmarking the v2 patchset, I noticed no
> improvement on reindex, which I attributed to forgetting to also
> specialize comparetup_index_btree in tuplesorth.c. After adding the
> specialization there as well (attached in v3), reindex performance
> improved significantly too.

PFA version 4 of this patchset. Changes:
- Silence the compiler warnings,
- Extract itup_attiter code into its own header, so that we don't get
compiler warnings and pass the cfbot builds,
- Re-order patches to be in a more logical order,
- Updates to the dynamic prefix compression so that we don't always do
a _bt_compare on the pages' highkey. memcmp(parentpage_rightsep,
highkey) == 0 is often true, and allows us to skip the indirect
function calls in _bt_compare most of the time.

Based on local measurements, this patchset improves performance for
all key shapes, with 2% to 600+% increased throughput (2-86% faster
operations), depending on key shape. As can be seen in the attached
pgbench output, the performance results are based on beta1 (f00a4f02,
dated 2022-06-04) and thus not 100% current, but considering that no
significant changes have been made in the btree AM code since, I
believe these measurements are still quite valid.

I also didn't re-run the numbers for the main branch; but I compared
against the results of master in the last mail. This is because I run
the performance tests locally, and a 7-iteration pgbench run for
master requires 9 hours of downtime with this dataset, during which I
can't use the system so as to not interfere with the performance
tests. As such, I considered rerunning the benchmark for master to be
not worth the time/effort/cost with the little changes that were
committed.

Kind regards,

Matthias van de Meent.

Attachment	Content-Type	Size
pgbench_log.patched_v4.txt	text/plain	2.9 KB
v4-0001-Specialize-nbtree-functions-on-btree-key-shape.patch	application/octet-stream	233.6 KB
v4-0002-Use-specialized-attribute-iterators-in-backend-nb.patch	application/octet-stream	9.3 KB
v4-0003-Specialize-the-nbtree-rd_indam-entry.patch	application/octet-stream	4.0 KB
v4-0004-Optimize-attribute-iterator-access-for-single-col.patch	application/octet-stream	3.5 KB
v4-0005-Add-a-function-whose-task-it-is-to-populate-all-a.patch	application/octet-stream	3.6 KB
v4-0006-Implement-specialized-uncacheable-attribute-itera.patch	application/octet-stream	9.5 KB
v4-0007-Add-specialization-to-btree-index-creation.patch	application/octet-stream	12.0 KB
v4-0008-Implement-dynamic-prefix-compression-in-nbtree.patch	application/octet-stream	22.7 KB

In response to

Re: Improving btree performance through specializing by key shape, take 2 at 2022-06-05 19:12:36 from Matthias van de Meent

Responses

Re: Improving btree performance through specializing by key shape, take 2 at 2022-07-27 07:35:24 from Matthias van de Meent

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Peter Eisentraut	2022-07-04 14:27:20	Re: TAP output format in pg_regress
Previous Message	Zhihong Yu	2022-07-04 14:16:31	Re: Removing unneeded self joins