From: | Luc Vlaming <luc(at)swarm64(dot)com> |
---|---|
To: | Greg Nancarrow <gregn4422(at)gmail(dot)com> |
Cc: | Phil Florent <philflorent(at)hotmail(dot)com>, "pgsql-hackers(at)lists(dot)postgresql(dot)org" <pgsql-hackers(at)lists(dot)postgresql(dot)org> |
Subject: | Re: Parallel plans and "union all" subquery |
Date: | 2020-11-25 07:43:03 |
Message-ID: | 3d8fe7f0-692a-89c5-2ec2-64afd6d91c3e@swarm64.com |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-hackers |
On 24-11-2020 01:44, Greg Nancarrow wrote:
> On Tue, Nov 24, 2020 at 2:34 AM Luc Vlaming <luc(at)swarm64(dot)com> wrote:
>>
>> Hi,
>>
>> For this problem there is a patch I created, which is registered under
>> https://commitfest.postgresql.org/30/2787/ that should fix this without
>> any workarounds. Maybe someone can take a look at it?
>>
>
> I tried your patch with the latest PG source code (24/11), but
> unfortunately a non-parallel plan was still produced in this case.
>
> test=# explain
> select count(*)
> from (select
> n1
> from drop_me
> union all
> values(1)) ua;
> QUERY PLAN
> --------------------------------------------------------------------------------
> Aggregate (cost=1889383.54..1889383.55 rows=1 width=8)
> -> Append (cost=0.00..1362834.03 rows=42123961 width=32)
> -> Seq Scan on drop_me (cost=0.00..730974.60 rows=42123960 width=32)
> -> Subquery Scan on "*SELECT* 2" (cost=0.00..0.02 rows=1 width=32)
> -> Result (cost=0.00..0.01 rows=1 width=4)
> (5 rows)
>
>
> That's not to say your patch doesn't have merit - but maybe just not a
> fix for this particular case.
>
> As before, if the SQL is tweaked to align the types for the UNION, you
> get a parallel plan:
>
> test=# explain
> select count(*)
> from (select
> n1
> from drop_me
> union all
> values(1::numeric)) ua;
> QUERY PLAN
> ----------------------------------------------------------------------------------------------------
> Finalize Aggregate (cost=821152.71..821152.72 rows=1 width=8)
> -> Gather (cost=821152.50..821152.71 rows=2 width=8)
> Workers Planned: 2
> -> Partial Aggregate (cost=820152.50..820152.51 rows=1 width=8)
> -> Parallel Append (cost=0.00..747235.71 rows=29166714 width=0)
> -> Result (cost=0.00..0.01 rows=1 width=0)
> -> Parallel Seq Scan on drop_me
> (cost=0.00..601402.13 rows=29166713 width=0)
> (7 rows)
>
>
> Regards,
> Greg Nancarrow
> Fujitsu Australia
>
Hi,
You're completely right, sorry for my error. I was too quick on assuming
my patch would work for this specific case too; I should have tested
that before replying. It looked very similar but turns out to not work
because of the upper rel not being considered parallel.
I would like to extend my patch to support this, or create a second
patch. This would however be significantly more involved because it
would require that we (always?) consider two paths whenever we process a
subquery: the best parallel plan and the best serial plan. Before I
emback on such a journey I would like some input on whether this would
be a very bad idea. Thoughts?
Regards,
Luc
Swarm64
From | Date | Subject | |
---|---|---|---|
Next Message | Peter Eisentraut | 2020-11-25 07:47:09 | Re: abstract Unix-domain sockets |
Previous Message | Fujii Masao | 2020-11-25 07:32:12 | Re: walsender bug: stuck during shutdown |