From: | Tomas Vondra <tomas(dot)vondra(at)2ndquadrant(dot)com> |
---|---|
To: | James Coleman <jtc331(at)gmail(dot)com> |
Cc: | Alvaro Herrera <alvherre(at)2ndquadrant(dot)com>, Michael Paquier <michael(at)paquier(dot)xyz>, Rafia Sabih <rafia(dot)pghackers(at)gmail(dot)com>, Peter Geoghegan <pg(at)bowt(dot)ie>, Simon Riggs <simon(at)2ndquadrant(dot)com>, Shaun Thomas <shaun(dot)thomas(at)2ndquadrant(dot)com>, Dmitry Dolgov <9erthalion6(at)gmail(dot)com>, Alexander Korotkov <a(dot)korotkov(at)postgrespro(dot)ru>, Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, PostgreSQL Developers <pgsql-hackers(at)lists(dot)postgresql(dot)org>, Andreas Karlsson <andreas(at)proxel(dot)se> |
Subject: | Re: [PATCH] Incremental sort (was: PoC: Partial sort) |
Date: | 2020-03-30 01:44:42 |
Message-ID: | 20200330014442.chqnwfmelm3ejsgr@development |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-hackers |
Hi,
Attached is a slightly reorganized patch series. I've merged the fixes
into the appropriate matches, and I've also combined the two patches
adding incremental sort paths to additional places in planner.
A couple more comments:
1) I think the GUC documentation in src/sgml/config.sgml is a bit too
detailed, compared to the other enable_* GUCs. I wonder if there's a
better place where to move the details. What about adding some examples
and explanation to perform.sgml?
2) Looking at the explain output, the verbose mode looks like this:
test=# explain (verbose, analyze) select a from t order by a, b, c;
QUERY PLAN
--------------------------------------------------------------------------------------------------------------------------------------------------------------
Gather Merge (cost=66.31..816072.71 rows=8333226 width=24) (actual time=4.787..20092.555 rows=10000000 loops=1)
Output: a, b, c
Workers Planned: 2
Workers Launched: 2
-> Incremental Sort (cost=66.28..729200.36 rows=4166613 width=24) (actual time=1.308..14021.575 rows=3333333 loops=3)
Output: a, b, c
Sort Key: t.a, t.b, t.c
Presorted Key: t.a, t.b
Full-sort Groups: 4169 Sort Method: quicksort Memory: avg=30kB peak=30kB
Presorted Groups: 4144 Sort Method: quicksort Memory: avg=128kB peak=138kB
Worker 0: actual time=0.766..16122.368 rows=3841573 loops=1
Full-sort Groups: 6871 Sort Method: quicksort Memory: avg=30kB peak=30kB
Presorted Groups: 6823 Sort Method: quicksort Memory: avg=132kB peak=141kB
Worker 1: actual time=1.986..16189.831 rows=3845490 loops=1
Full-sort Groups: 6874 Sort Method: quicksort Memory: avg=30kB peak=30kB
Presorted Groups: 6847 Sort Method: quicksort Memory: avg=130kB peak=139kB
-> Parallel Index Scan using t_a_b_idx on public.t (cost=0.43..382365.92 rows=4166613 width=24) (actual time=0.040..9808.449 rows=3333333 loops=3)
Output: a, b, c
Worker 0: actual time=0.048..11275.178 rows=3841573 loops=1
Worker 1: actual time=0.041..11314.133 rows=3845490 loops=1
Planning Time: 0.166 ms
Execution Time: 25135.029 ms
(22 rows)
There seems to be missing indentation for the first line of worker info.
I'm still not quite convinced we should be printing two lines - I know
you mentioned the lines might be too long, but see how long the other
lines may get ...
3) I see the new nodes (plan state, ...) have "presortedCols" which does
not indicate it's a "number of". I think we usually prefix names of such
fields "n" or "num". What about "nPresortedCols"? (Nitpicking, I know.)
My TODO for this patch is this:
- review the costing (I think the estimates are OK, but I recall I
haven't been entirely happy with how it's broken into functions.)
- review the tuplesort changes (the memory contexts etc.)
- do more testing of performance impact on planning
regards
--
Tomas Vondra http://www.2ndQuadrant.com
PostgreSQL Development, 24x7 Support, Remote DBA, Training & Services
Attachment | Content-Type | Size |
---|---|---|
v46-0001-Consider-low-startup-cost-when-adding-partial-path.patch | text/plain | 4.6 KB |
v46-0002-Implement-incremental-sort.patch | text/plain | 164.5 KB |
v46-0003-explain-fixes.patch | text/plain | 15.8 KB |
v46-0004-Consider-incremental-sort-paths-in-additional-places.patch | text/plain | 24.2 KB |
From | Date | Subject | |
---|---|---|---|
Next Message | Michael Paquier | 2020-03-30 01:56:48 | Re: [Patch] pg_rewind: options to use restore_command from recovery.conf or command line |
Previous Message | David Steele | 2020-03-30 01:23:06 | Re: backup manifests |