Re: Mistake in documentation for CREATE STATISTICS

From: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To: theolassonder(at)hotmail(dot)com
Cc: pgsql-docs(at)lists(dot)postgresql(dot)org
Subject: Re: Mistake in documentation for CREATE STATISTICS
Date: 2019-06-14 16:01:06
Message-ID: 29901.1560528066@sss.pgh.pa.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-docs

PG Doc comments form <noreply(at)postgresql(dot)org> writes:
> The example on https://www.postgresql.org/docs/11/sql-createstatistics.html
> seems wrong: instead of "EXPLAIN ANALYZE SELECT ..." it should say "EXPLAIN
> SELECT ...". If changed that way, the query optimizer estimates the number
> of rows at 1 and then 100 after statistics are created which is indeed how
> many the query returns. In the current wording, with ANALYZE, the query
> optimizer estimates the number of rows at 100 for both the first and second
> select.

I think you're mistaking actual rows for estimated rows in the output
of EXPLAIN ANALYZE.

The point of using EXPLAIN ANALYZE here is just to make it easier to see
that the estimate is indeed wrong, by providing the actual count alongside
the estimate. I don't think that taking out ANALYZE would be an
improvement. For instance, in the first EXPLAIN I get

Gather (cost=1000.00..11675.10 rows=1 width=8) (actual time=1.215..45.218 rows
=100 loops=1)

which without ANALYZE would just be

Gather (cost=1000.00..11675.10 rows=1 width=8)

The rows=1 estimate is equally wrong either way, but you don't get to
see the correct value without ANALYZE.

regards, tom lane

In response to

Browse pgsql-docs by date

  From Date Subject
Next Message Michael Paquier 2019-06-17 02:10:29 Re: SPITupleTable members missing in docs
Previous Message Daniel Gustafsson 2019-06-14 14:40:51 Re: SPITupleTable members missing in docs