Re: again on index usage

From: Daniel Kalchev <daniel(at)digsys(dot)bg>
To: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc: mlw <markw(at)mohawksoft(dot)com>, Zeugswetter Andreas SB SD <ZeugswetterA(at)spardat(dot)at>, Bruce Momjian <pgman(at)candle(dot)pha(dot)pa(dot)us>, pgsql-hackers(at)postgresql(dot)org
Subject: Re: again on index usage
Date: 2002-01-12 11:51:23
Message-ID: 200201121151.NAA02845@dcave.digsys.bg
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

>>>Tom Lane said:
> mlw <markw(at)mohawksoft(dot)com> writes:
> > ... Storage
> > devices are now black boxes. The only predictable advantage a
> > "sequential scan" can have on a modern computer is OS level caching.
>
> You mean read-ahead. True enough, but that "only advantage" is very
> significant. The 4.0 number did not come out of the air, it came
> from actual measurements.

On what OS? Linux? Windows? BSD? OSF/1? System V? All these differ
significantly in how buffer cache is managed. For example, the BSD 'soft
updates' will not penalize large directory updates, but not do any good for
sequential reads (considering what was said already about modern disks). SCSI
tag queueing will significantly improve raw disk reads ('sequential' again)
because of the low overhead of host<->SCSI subsystem communication - any
decent SCSI host adapter will do bus-master DMA, without the interference of
the processor (simplified as much as to illustrate it :). Todays IDE drives on
PC hardware don't do that! Which is not to say that only SCSI drive
controllers are intelligent enough - I still remember an older Motorola VME
based UNIX system (that now can only server the purpose of coffee table :),
where an MFM controller board had all the intelligence of the SCSI subsystem,
although it operated with 'dump' MFM disks. So many examples can be given here.

> I think the real point in this thread is that measurements on an idle
> system might not extrapolate very well to measurements on a heavily
> loaded system. I can see the point, but I don't really have time to
> investigate it right now. I'd be willing to reduce the default value of
> random_page_cost to something around 2, if someone can come up with
> experimental evidence justifying it ...

Agreed. My preference would be, that if you have reasonable enough test data,
that can be shared, many people on different platforms can run performance
tests and come up with an array of recommended values for their particular
OS/hardware configuration. I believe these two items are most significant for
the tuning of an installation.

Daniel Kalchev

Browse pgsql-hackers by date

  From Date Subject
Next Message Karl DeBisschop 2002-01-12 14:03:44 Re: Postgres in bash-mode
Previous Message David Terrell 2002-01-12 08:32:05 Re: Postgres in bash-mode