Re: Startup cost of sequential scan

From: Konstantin Knizhnik <k(dot)knizhnik(at)postgrespro(dot)ru>
To: pgsql-hackers(at)lists(dot)postgresql(dot)org
Subject: Re: Startup cost of sequential scan
Date: 2018-08-30 15:08:00
Message-ID: 850fd7e3-3331-937b-c2b8-3f7ce2d7c257@postgrespro.ru
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

On 30.08.2018 17:58, Tom Lane wrote:
> Alexander Korotkov <a(dot)korotkov(at)postgrespro(dot)ru> writes:
>> On Thu, Aug 30, 2018 at 5:05 PM Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us> wrote:
>>> Because it's what the mental model of startup cost says it should be.
>> From this model we make a conclusion that we're starting getting rows
>> from sequential scan sooner than from index scan. And this conclusion
>> doesn't reflect reality.
> No, startup cost is not the "time to find the first row". It's overhead
> paid before you even get to start examining rows.
But it seems to me that calculation of cost in LIMIT node contradicts
with this statement:

            pathnode->path.startup_cost +=
                (subpath->total_cost - subpath->startup_cost)
                * offset_rows / subpath->rows;

>
> I'm disinclined to consider fundamental changes to our costing model
> on the basis of this example. The fact that the rowcount estimates are
> so far off reality means that you're basically looking at "garbage in,
> garbage out" for the cost calculations --- and applying a small LIMIT
> just magnifies that.
>
> It'd be more useful to think first about how to make the selectivity
> estimates better; after that, we might or might not still think there's
> a costing issue.
>
> regards, tom lane
>

--
Konstantin Knizhnik
Postgres Professional: http://www.postgrespro.com
The Russian Postgres Company

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Alexander Korotkov 2018-08-30 15:23:30 Re: Startup cost of sequential scan
Previous Message Tom Lane 2018-08-30 14:58:38 Re: Startup cost of sequential scan