Quick Links

Re: Why scan all columns when we select distinct c1?

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	Ron Johnson <ronljohnsonjr(at)gmail(dot)com>
Cc:	Yongtao Huang <yongtaoh2022(at)gmail(dot)com>, pgsql-general(at)postgresql(dot)org
Subject:	Re: Why scan all columns when we select distinct c1?
Date:	2024-01-14 15:46:03
Message-ID:	1239078.1705247163@sss.pgh.pa.us
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-general

Ron Johnson <ronljohnsonjr(at)gmail(dot)com> writes:
> On Sun, Jan 14, 2024 at 6:18 AM Yongtao Huang <yongtaoh2022(at)gmail(dot)com>
> wrote:
>> gpadmin=# create table t1 (c1 int, c2 text);
>> CREATE TABLE
>> gpadmin=# explain (costs off, verbose) select distinct c1 from t1;
>> QUERY PLAN
>> -----------------------------
>> HashAggregate
>> Output: c1
>> Group Key: t1.c1
>> -> Seq Scan on public.t1
>> Output: c1, c2 <---- pay attention <---- !!!
>> (5 rows)
>>
>> My question is why scan all columns in PG 16.01?

> You can't scan just one column of a row-oriented table.
> The real question is why it mentions c2.

The planner did that so that the SeqScan step doesn't have to
perform a projection: it can just return (a pointer to)
the physical tuple it found in the table, without doing extra
work to form a tuple containing only c1. The upper HashAgg
step won't really care. See use_physical_tlist() in createplan.c.

What I'm confused about is why 9.4 didn't do the same.
That optimization heuristic is very old, and certainly
would be applied by 9.4 in some circumstances. Testing
says the behavior in this specific case changed at 9.6.
I'm not quite interested enough to drill down further...

regards, tom lane

In response to

Re: Why scan all columns when we select distinct c1? at 2024-01-14 14:51:28 from Ron Johnson

Responses

Re: Why scan all columns when we select distinct c1? at 2024-01-15 01:32:56 from Yongtao Huang
Re: Why scan all columns when we select distinct c1? at 2024-01-16 19:48:33 from Jim Nasby

Browse pgsql-general by date

	From	Date	Subject
Next Message	Adrian Klaver	2024-01-14 16:50:05	Re: Help with "Create Extension unaccent"
Previous Message	Andreas Kretschmer	2024-01-14 15:31:50	Re: Why scan all columns when we select distinct c1?