From: | Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us> |
---|---|
To: | Andres Freund <andres(at)anarazel(dot)de> |
Cc: | Peter Eisentraut <peter(dot)eisentraut(at)2ndquadrant(dot)com>, Regina Obe <lr(at)pcorp(dot)us>, "'PostgreSQL-development'" <pgsql-hackers(at)postgresql(dot)org> |
Subject: | Re: Parallel Aggregation support for aggregate functions that use transitions not implemented for array_agg |
Date: | 2017-06-07 04:03:15 |
Message-ID: | 17091.1496808195@sss.pgh.pa.us |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-hackers |
Andres Freund <andres(at)anarazel(dot)de> writes:
> On 2017-06-06 23:32:53 -0400, Peter Eisentraut wrote:
>> I'm not sure how you would parallelize these, since in most uses you
>> want to have a deterministic output order.
> Unless you specify ORDER BY you don't really have that anyway, consider
> hash-aggregation. If you want deterministic order, you really need an
> ORDER BY inside the aggregate.
Hash aggregation does not destroy the property that array_agg/string_agg
will produce results whose components appear in the order that the
subquery emitted them in. It only causes the various aggregate results
in a GROUP BY query to themselves appear in random order.
Now you could argue that the subquery might've gotten parallelized and
emitted its outputs in some random order, so doing the same thing one
level further up changes nothing. But you can't defend this on this
basis that it was historically unpredictable, because it wasn't.
regards, tom lane
From | Date | Subject | |
---|---|---|---|
Next Message | Andres Freund | 2017-06-07 04:06:13 | Re: Parallel Aggregation support for aggregate functions that use transitions not implemented for array_agg |
Previous Message | Masahiko Sawada | 2017-06-07 03:52:28 | Re: Fix tab-completion of ALTER SUBSCRIPTION SET PUBLICATION |