From: | Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us> |
---|---|
To: | Martijn van Oosterhout <kleptog(at)cupid(dot)suninternet(dot)com> |
Cc: | Ian Harding <iharding(at)pakrat(dot)com>, pgsql-general(at)postgresql(dot)org |
Subject: | Re: Re: Loading optimization |
Date: | 2001-01-09 15:51:35 |
Message-ID: | 200101091551.KAA07785@candle.pha.pa.us |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-general |
> Ian Harding wrote:
> >
> > Gary Wesley wrote:
> >
> > > Is there any advantage to having data sorted before populating it into a
> > > table?
> > > (In 6.5)
> > >
> > > Gary Wesley
> >
> > Yes. You have effectively loaded it with a clustered index. If you
> > cluster an index on the sort column after loading it, the sort will remain
> > in effect and will speed queries/joins that use that column.
>
> But does postgres actually use the fact that the data is clustered? I
> keep thinking that here I could cluster all our data such that a
> sequential
> search is almost always a bad idea but I have no idea how to relate that
> fact to postgres...
Well, clustering certainly speeds up index access to multiple heap
values because duplicate values are all on the same heap page. One
thing that is missing is that there is no preference for index scans for
clustered indexes.
Because the clustering is not permanent, but becomes unclustered as data
is added/modified, there is no easy way to know if the clustering is
still valid.
--
Bruce Momjian | http://candle.pha.pa.us
pgman(at)candle(dot)pha(dot)pa(dot)us | (610) 853-3000
+ If your life is a hard drive, | 830 Blythe Avenue
+ Christ can be your backup. | Drexel Hill, Pennsylvania 19026
From | Date | Subject | |
---|---|---|---|
Next Message | Bruce Momjian | 2001-01-09 16:12:06 | Re: ECPG could not connect to the database. |
Previous Message | admin | 2001-01-09 14:48:54 | odbc |