Quick Links

Re: difference when using 'distinct on'

From:	Stephan Szabo <sszabo(at)megazone(dot)bigpanda(dot)com>
To:	"Johnson, Shaunn" <SJohnson6(at)bcbsm(dot)com>
Cc:	"pg-general (E-mail)" <pgsql-general(at)postgresql(dot)org>
Subject:	Re: difference when using 'distinct on'
Date:	2003-09-13 02:14:09
Message-ID:	20030912190759.F4046@megazone.bigpanda.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-general

On Fri, 12 Sep 2003, Johnson, Shaunn wrote:

> Howdy:
>
> Can someone tell what the difference (and why
> you would use it) is between the following:
>
> [snip]
> select distinct on (col_1, col_2),
> col_1,
> col_2,
> col_3
> from t_table
>
> --
>
> select distinct
> col_1,
> col_2,
> col_3
> from t_table
> [/snip]
>
> In the first example, is it just getting
> the unique rows for the first two columns?

In the first, for each set of rows that have a distinct col1,col2
value it's taking one of those rows and using its col3 value.
It's like group by, but less restrictive since you don't need
to use a set function on col_3.

In general distinct on in that fashion is most usable when
combined with an order by so that you can get a particular row
from each set. For example, you might say do something like:
select distinct on (col1, col2) col1, col2, col3 from t_table
order by col1, col2, col4;
In this case you should get the col3 value for each col1,col2
distinct group that corresponds to the row having the lowest col4
value.

In response to

difference when using 'distinct on' at 2003-09-12 13:29:39 from Johnson, Shaunn

Browse pgsql-general by date

	From	Date	Subject
Next Message	Jason Alan Smith	2003-09-13 02:43:04
Previous Message	Ron Johnson	2003-09-12 23:54:00	need for in-place upgrades (was Re: State of Beta 2)