Quick Links

Re: Reducing memory consumed by RestrictInfo list translations in partitionwise join planning

From:	Ashutosh Bapat <ashutosh(dot)bapat(dot)oss(at)gmail(dot)com>
To:	Amit Langote <amitlangote09(at)gmail(dot)com>
Cc:	Alvaro Herrera <alvherre(at)alvh(dot)no-ip(dot)org>, Dmitry Dolgov <9erthalion6(at)gmail(dot)com>, tomas(at)vondra(dot)me, vignesh C <vignesh21(at)gmail(dot)com>, pgsql-hackers <pgsql-hackers(at)postgresql(dot)org>, Richard Guo <guofenglinux(at)gmail(dot)com>, David Rowley <dgrowleyml(at)gmail(dot)com>
Subject:	Re: Reducing memory consumed by RestrictInfo list translations in partitionwise join planning
Date:	2025-03-18 11:48:32
Message-ID:	CAExHW5vNk4f8VSTnizL-avREYhPgocEtH3E7MCEJgp=R33ChyQ@mail.gmail.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

Hi Amit,

On Tue, Mar 18, 2025 at 4:02 PM Amit Langote <amitlangote09(at)gmail(dot)com> wrote:
>
> I ran a couple of benchmarks of my own as follows.
>
> cat benchmark_amit.sh
> for p in 0 16 32 64 128 256 512 1024; do
> echo -ne "$p\t";
> pgbench -i --partitions=$p > /dev/null 2>&1
> pgbench -n -T 30 -f /tmp/query.sql | grep latency | awk '{print $4}';
> done
>
> For a 3-way join:
> cat /tmp/query.sql
> select * from pgbench_accounts t1, pgbench_accounts t2,
> pgbench_accounts t3 where t2.aid = t1.aid AND t3.aid = t2.aid;
>
> nparts master patched %change
> 0 35.508 36.066 1.571
> 16 66.79 67.704 1.368
> 32 67.774 68.179 0.598
> 64 51.023 50.471 -1.082
> 128 56.4 55.759 -1.137
> 256 70.134 68.401 -2.471
> 512 120.621 113.552 -5.861
> 1024 405.339 312.726 -22.848
>
> For a 6-way jon
> cat /tmp/query.sql
> select * from pgbench_accounts t1, pgbench_accounts t2,
> pgbench_accounts t3, pgbench_accounts t4, pgbench_accounts t5,
> pgbench_accounts t6 where t2.aid = t1.aid AND t3.aid = t2.aid and
> t4.aid = t3.aid and t5.aid = t4.aid and t6.aid = t5.aid;
>
> nparts master patched %change
> 0 66.144 64.932 -1.832
> 16 100.874 100.491 -0.380
> 32 104.645 104.536 -0.104
> 64 114.415 109.193 -4.564
> 128 145.422 130.458 -10.290
> 256 273.761 209.919 -23.320
> 512 1359.896 616.295 -54.681
> 1024 7183.765 2857.086 -60.229
>
> -60% means 60% reduction in latency due to the patch.
>
> As others have already found, performance improvements become
> substantial as both partition count and join depth increase. The patch
> seems to have minimal impact at low partition counts and low join
> complexity -- base cases (e.g., 0–32 partitions, 3-way joins) are
> essentially unchanged, which is good to see.

I assume these are with enable_partitionwise_join = off since it's not
enabled in the query.

With lower number of partitions and joins the execution time will be
substantial compared to planning time hence the changes in execution
time will affect the changes in latency. So I agree that the
difference in numbers for lower number of partitions and joins is
noise. That agrees with my results posted a few emails before.

At a higher number of partitions and joins, the planning time
dominates the latency. Hence any variation in the planning time
dominates a variation in the latency. Thus the improvements seen here
are due to improvements in the planning time. Again that agrees with
my results in the previous email.

>
> I haven’t measured memory increase from the patch, but let me know if
> that has already been evaluated and shown not to be a showstopper.

There's a slight increase in memory consumption because of the hash
table but it's very minimal.

each column is s=difference in memory consumed, md = memory consumed
without patch, pd = memory consumed with patch. -ve difference shows
increase in memory consumption.

%wise this is 3-4% maximum.
>
> I also noticed using perf that create_join_clause() is a hotspot when
> running without the patch, especially at high partition counts (> 500)
> and the more join relations.

That's an interesting observation. Any hotspot in planning would show
up as hotspot in total execution time. So expected.

>
> Let me know if my methodology seems off or if the results look reasonable.

Thanks for benchmarking using a different method. The results agree
with my results.
--
Best Wishes,
Ashutosh Bapat

In response to

Re: Reducing memory consumed by RestrictInfo list translations in partitionwise join planning at 2025-03-18 10:32:21 from Amit Langote

Responses

Re: Reducing memory consumed by RestrictInfo list translations in partitionwise join planning at 2025-03-19 02:52:13 from Amit Langote

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	John Naylor	2025-03-18 11:50:59	Re: CRC32C Parallel Computation Optimization on ARM
Previous Message	Amit Kapila	2025-03-18 11:47:25	Re: Adding a '--clean-publisher-objects' option to 'pg_createsubscriber' utility.