Quick Links

Re: PoC: Using Count-Min Sketch for join cardinality estimation

From:	John Naylor <john(dot)naylor(at)enterprisedb(dot)com>
To:	Tomas Vondra <tomas(dot)vondra(at)enterprisedb(dot)com>
Cc:	PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: PoC: Using Count-Min Sketch for join cardinality estimation
Date:	2021-06-18 19:54:40
Message-ID:	CAFBsxsHw-Z9vFSi7j1Yzxw7u_KcjA6c79n4QZiWwD=j4=LR_=Q@mail.gmail.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

On Fri, Jun 18, 2021 at 3:43 PM Tomas Vondra <tomas(dot)vondra(at)enterprisedb(dot)com>
wrote:

> Sorry, I'm not sure what you mean by "we set the number of MCVs to the
> number of histograms" :-(
>
> When you say "MCV limit" you mean that we limit the number of items to
> statistics target, right? I agree plan time is one concern - but it's
> also about analyze, as we need larger sample to build a larger MCV or
> histogram (as the paper you referenced shows).

Ah, I didn't realize the theoretical limit applied to the MCVs too, but
that makes sense since they're basically singleton histogram buckets.

--
John Naylor
EDB: http://www.enterprisedb.com

In response to

Re: PoC: Using Count-Min Sketch for join cardinality estimation at 2021-06-18 19:43:24 from Tomas Vondra

Responses

Re: PoC: Using Count-Min Sketch for join cardinality estimation at 2021-06-18 20:24:45 from Tomas Vondra

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Jeff Davis	2021-06-18 19:55:17	Re: A few nuances about specifying the timeline with START_REPLICATION
Previous Message	Tomas Vondra	2021-06-18 19:43:24	Re: PoC: Using Count-Min Sketch for join cardinality estimation