Re: checking for a NULL date in a partitioned table kills performance

From: "Wetmore, Matthew (CTR)" <Matthew(dot)Wetmore(at)mdlive(dot)com>
To: Scott Ribe <scott_ribe(at)elevated-dev(dot)com>, Pgsql-admin <pgsql-admin(at)lists(dot)postgresql(dot)org>
Subject: Re: checking for a NULL date in a partitioned table kills performance
Date: 2024-08-23 15:49:27
Message-ID: 17C7D941-73CB-41BF-BE8F-BB2788049F1F@glbcore.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-admin pgsql-performance

I have had this issue in the past.

The real admin fix to this is to have a NULL replacement character that prevents this.
This does a few things:

1. An index will index on a replacement character ( I use <->)
2. A join is easier on a replacement character than NULL (Nulls last/first avoided)
3. Stops all evil NULL rules.

We strive to fix things, but the real solution, IMHO, is better arch design and better fundamental understanding of how NULL works.

Partitioned tables under 500M-750M rows will always have these performance issues.

Great ideas on the workaround’s though, I do understand sometimes you inherit a bad db.

From: Scott Ribe <scott_ribe(at)elevated-dev(dot)com>
Date: Friday, August 23, 2024 at 8:17 AM
To: Pgsql-admin <pgsql-admin(at)lists(dot)postgresql(dot)org>
Subject: Re: checking for a NULL date in a partitioned table kills performance

> On Aug 23, 2024, at 5: 39 AM, Doug Reynolds <mav@ wastegate. net> wrote: > > You could try something really ugly where you make a function index that COALESCEs the nulls to 1-1-1900 and use the COALESCE in the query. I don't see

> On Aug 23, 2024, at 5:39 AM, Doug Reynolds <mav(at)wastegate(dot)net> wrote:

>

> You could try something really ugly where you make a function index that COALESCEs the nulls to 1-1-1900 and use the COALESCE in the query.

I don't see how that could be better than just creating a partial index on it WHERE contract_date IS NULL--and anyway I'm sure you're right that no index would help. No matter what, it seems that sequential scans of all partitions will be required since most rows have it null, and it's not even related to the partition key.

In response to

Browse pgsql-admin by date

  From Date Subject
Next Message Scott Ribe 2024-08-23 16:08:49 Re: checking for a NULL date in a partitioned table kills performance
Previous Message Doug Reynolds 2024-08-23 15:42:16 Re: checking for a NULL date in a partitioned table kills performance

Browse pgsql-performance by date

  From Date Subject
Next Message Scott Ribe 2024-08-23 16:08:49 Re: checking for a NULL date in a partitioned table kills performance
Previous Message Doug Reynolds 2024-08-23 15:42:16 Re: checking for a NULL date in a partitioned table kills performance