Re: Feature request for count_estimate(samplesize) aggregate or SAMPLE keyword

From: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To: Torge <torgato(at)posteo(dot)de>
Cc: pgsql-general(at)lists(dot)postgresql(dot)org
Subject: Re: Feature request for count_estimate(samplesize) aggregate or SAMPLE keyword
Date: 2022-09-01 01:47:32
Message-ID: 243115.1661996852@sss.pgh.pa.us
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

Torge <torgato(at)posteo(dot)de> writes:
> Now I would like to humbly propose a feature that gives an easy way to
> get a quick count estimate for any condition - index based or not -
> based on a random sample of rows, that does not require a custom
> function creation or complex SQL statement

Can't you do that already using TABLESAMPLE? For example, to
use a 1% sample:

select count(*) * 100 from mytab tablesample system(1) where <condition>;

You do have to remember to multiply by the factor corresponding
to your sample rate, but aside from that annoyance this (a)
already exists, (b) is SQL-standard, and (c) can be adapted to
a lot of other kinds of analysis besides plain count(*).

regards, tom lane

In response to

Responses

Browse pgsql-general by date

  From Date Subject
Next Message Torge 2022-09-01 01:49:40 Re: Feature request for count_estimate(samplesize) aggregate or SAMPLE keyword
Previous Message Torge 2022-09-01 01:07:09 Feature request for count_estimate(samplesize) aggregate or SAMPLE keyword