Re: DISTINCT vs. GROUP BY

From: "Jim C(dot) Nasby" <jnasby(at)pervasive(dot)com>
To: Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>
Cc: Greg Stark <gsstark(at)mit(dot)edu>, Neil Conway <neilc(at)samurai(dot)com>, Hans-J?rgen Sch?nig <postgres(at)cybertec(dot)at>, pgsql-hackers(at)postgresql(dot)org
Subject: Re: DISTINCT vs. GROUP BY
Date: 2005-09-20 22:07:50
Message-ID: 20050920220750.GT7630@pervasive.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

On Tue, Sep 20, 2005 at 05:05:05PM -0400, Bruce Momjian wrote:
> Jim C. Nasby wrote:
> > On Mon, Sep 19, 2005 at 10:16:36PM -0400, Bruce Momjian wrote:
> > >
> > > Added to TODO:
> > >
> > > * Allow DISTINCT to use hashing like GROUP BY
> >
> > 3 lines above we have...
> > Consider using hash buckets to do DISTINCT, rather than sorting
> > This would be beneficial when there are few distinct values.
>
> OK, I have merged these items into one.
>
> >
> > Can you add
> > http://archives.postgresql.org/pgsql-hackers/2005-09/msg00810.php? All I
> > could find on the other TODO was
> > http://archives.postgresql.org/pgsql-committers/2004-09/msg00028.php,
> > which doesn't help much...
>
> What do these URL's have that the current TODO does not?
>
> * Consider using hash buckets to do DISTINCT, rather than sorting
>
> This would be beneficial when there are few distinct values. This is
> already used by GROUP BY.

Maybe it's just me, but the recent run-through of the TODO list
indicated that there's a fair number of items that people look at and
don't really knowh what they mean. Providing the context (ie: email
thread) that spawned an idea seems extremely valuable in being able to
explain the idea behind a TODO item. They also usually contain valuable
tips about how a TODO could be implemented. In this example, having
quick reference to the discussion about hashagg and first()/last() would
probably prove useful.
--
Jim C. Nasby, Sr. Engineering Consultant jnasby(at)pervasive(dot)com
Pervasive Software http://pervasive.com work: 512-231-6117
vcard: http://jim.nasby.net/pervasive.vcf cell: 512-569-9461

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Bruce Momjian 2005-09-20 22:45:07 Re: DISTINCT vs. GROUP BY
Previous Message Tom Lane 2005-09-20 22:05:35 Re: passing parameters to CREATE INDEX