pgsql: Ignore nullingrels when looking up statistics

From: Richard Guo <rguo(at)postgresql(dot)org>
To: pgsql-committers(at)lists(dot)postgresql(dot)org
Subject: pgsql: Ignore nullingrels when looking up statistics
Date: 2025-01-02 09:07:01
Message-ID: E1tTHAS-002cNz-I9@gemulon.postgresql.org
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-committers

Ignore nullingrels when looking up statistics

When looking up statistical data about an expression, we do not need
to concern ourselves with the outer joins that could null the
Vars/PHVs contained in the expression. Accounting for nullingrels in
the expression could cause estimate_num_groups to count the same Var
multiple times if it's marked with different nullingrels. This is
incorrect, and could lead to "ERROR: corrupt MVNDistinct entry" when
searching for multivariate n-distinct.

Furthermore, the nullingrels could prevent us from matching an
expression to expressional index columns or to the expressions in
extended statistics, leading to inaccurate estimates.

To fix, strip out all the nullingrels from the expression before we
look up statistical data about it. There is one ensuing plan change
in the regression tests, but it looks reasonable and does not
compromise its original purpose.

This patch could result in plan changes, but it fixes an actual bug,
so back-patch to v16 where the outer-join-aware-Var infrastructure was
introduced.

Author: Richard Guo
Discussion: https://postgr.es/m/CAMbWs4-2Z4k+nFTiZe0Qbu5n8juUWenDAtMzi98bAZQtwHx0-w@mail.gmail.com

Branch
------
master

Details
-------
https://git.postgresql.org/pg/commitdiff/e28033fe1af8037e0fec8bb3a32fabbe18ac06b1

Modified Files
--------------
src/backend/utils/adt/selfuncs.c | 29 ++++++++++++++++++++++++++---
src/test/regress/expected/join.out | 26 ++++++++++++++++++++++++--
src/test/regress/sql/join.sql | 13 +++++++++++++
3 files changed, 63 insertions(+), 5 deletions(-)

Browse pgsql-committers by date

  From Date Subject
Next Message Richard Guo 2025-01-02 09:08:09 pgsql: Ignore nullingrels when looking up statistics
Previous Message David Rowley 2025-01-02 09:04:26 pgsql: Fix outdated CHUNKHDRSZ value in nodeAgg.c