Quick Links

Re: Hook for Selectivity Estimation in Query Planning

From:	Andrei Lepikhov <lepihov(at)gmail(dot)com>
To:	Aleksander Alekseev <aleksander(at)timescale(dot)com>, PostgreSQL Hackers <pgsql-hackers(at)lists(dot)postgresql(dot)org>
Subject:	Re: Hook for Selectivity Estimation in Query Planning
Date:	2025-03-05 14:40:22
Message-ID:	478ee7e5-6ab9-4fe5-b782-e1210e0e3db5@gmail.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

On 5/3/2025 14:29, Aleksander Alekseev wrote:
> Hi,
>
>> I would like to discuss the introduction of a hook for evaluating the
>> selectivity of an expression when searching for an optimal query plan.
>> This topic has been brought up in various discussions, for example, in [1].
>>
>> [...]
>
> As I vaguely recall recent proposals like this ("Pluggable TOASTer" to
> name one) this approach was criticised. Hooks per se don't add value
> for the end user. They only put the burden of maintaining them on the
> community while all the real features are implemented in proprietar
> extensions. If you believe something is missing in Postgres,
> contribute it to the upstream so that anyone will benefit from it.
At first, I didn't find the reason for hooks' current existence in the
core. However, it's clear that hooks speed up the development of
extensions, which in turn enhances usability and popularity of the
project. This leads to a greater number of use cases and tests,
fostering community growth. I'm not sure what the purpose of the project
is except curiosity, but even then, extensions speed up the idea
validation process, don't they?
It's important to remember that not all extensions are proprietary. Does
TimescaleDB not provide value to both end users and the community?

Furthermore, extensions are necessary to address gaps that the community
may not work on by definition; for example, consider pg_hint_plan.

As I mentioned, the primary purpose of the hook is clear: to advance the
development of alternative statistics and estimation methods. For
instance, I've already come across proposals for multidimensional
histograms. Personally, I want to use this hook to implement zonal
ndistinct statistic extension to address the intra-column data skew issue.

Overall, I see that new hooks allow new [sometimes] open-source projects
and startups to emerge - not sure about enterprises' benefits.
Therefore, I'm not convinced by your current justification. Are there
any technical objections?

--
regards, Andrei Lepikhov

In response to

Re: Hook for Selectivity Estimation in Query Planning at 2025-03-05 13:29:51 from Aleksander Alekseev

Responses

Re: Hook for Selectivity Estimation in Query Planning at 2025-03-05 18:50:52 from Aleksander Alekseev

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Israel Barth Rubio	2025-03-05 14:47:33	Re: Add -k/--link option to pg_combinebackup
Previous Message	Matthias van de Meent	2025-03-05 14:37:37	Re: Hook for Selectivity Estimation in Query Planning