Re: SQL:2011 application time

From: Paul Jungwirth <pj(at)illuminatedcomputing(dot)com>
To: Matthias van de Meent <boekewurm+postgres(at)gmail(dot)com>
Cc: Peter Eisentraut <peter(at)eisentraut(dot)org>, Robert Haas <robertmhaas(at)gmail(dot)com>, jian he <jian(dot)universality(at)gmail(dot)com>, PostgreSQL Hackers <pgsql-hackers(at)lists(dot)postgresql(dot)org>
Subject: Re: SQL:2011 application time
Date: 2024-06-05 20:57:40
Message-ID: 3ad31f4c-8caf-4b9e-9f1f-b949c6330005@illuminatedcomputing.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

On Thu, May 9, 2024 at 5:44 PM Matthias van de Meent <boekewurm+postgres(at)gmail(dot)com> wrote:
> Additionally, because I can't create my own non-constraint-backing
> unique GIST indexes, I can't pre-create my unique constraints
> CONCURRENTLY as one could do for the non-temporal case

We talked about this a bit at pgconf.dev. I would like to implement it, since I agree it is an
important workflow to support. Here are some thoughts about what would need to be done.

First we could take a small step: allow non-temporal UNIQUE GiST indexes. This is possible according
to [1], but in the past we had no way of knowing which strategy number an opclass was using for
equality. With the stratnum support proc introduced by 6db4598fcb (reverted for v17), we could
change amcanunique to true for the GiST AM handler. If the index's opclasses had that sproc and it
gave non-zero for RTEqualStrategyNumber, we would have a reliable "definition of uniqueness". UNIQUE
GiST indexes would raise an error if they detected a duplicate record.

Incidentally, this would also let us correct the error message about GiST not supporting unique,
fixing the problem you raised here:

On Sun, May 12, 2024 at 8:51 AM Paul Jungwirth <pj(at)illuminatedcomputing(dot)com> wrote:
>
> On 5/12/24 05:55, Matthias van de Meent wrote:
> >> > pg=# CREATE UNIQUE INDEX ON temporal_testing USING gist (id, valid_during);
> >> > ERROR: access method "gist" does not support unique indexes
> >>
> >> To me that error message seems correct. The programmer hasn't said anything about the special
> >> temporal behavior they are looking for.
> >
> > But I showed that I had a GIST index that does have the indisunique
> > flag set, which shows that GIST does support indexes with unique
> > semantics.
> >
> > That I can't use CREATE UNIQUE INDEX to create such an index doesn't
> > mean the feature doesn't exist, which is what the error message
> > implies.
>
> True, the error message is not really telling the truth anymore.

But that is just regular non-temporal indexes. To avoid a long table lock you'd need a way to build
the index that is not just unique, but also does exclusion based on &&. We could borrow syntax from
SQL:2011 and allow `CREATE INDEX idx ON t (id, valid_at WITHOUT OVERLAPS)`. But since CREATE INDEX
is a lower-level concept than a constraint, it'd be better to do something more general. You can
already give opclasses for each indexed column. How about allowing operators as well? For instance
`CREATE UNIQUE INDEX idx ON t (id WITH =, valid_at WITH &&)`? Then the index would know to enforce
those rules. This is the same data we store today in pg_constraint.conexclops. So that would get
moved/copied to pg_index (probably moved).

Then when you add the constraint, what is the syntax? Today when you say PRIMARY KEY/UNIQUE USING
INDEX, you don't give the column names. So how do we know it's WITHOUT OVERLAPS? I guess if the
underlying index has (foo WITH = [, bar WITH =], baz WITH &&) we just assume the user wants WITHOUT
OVERLAPS, and otherwise they want a regular PK/UQ constraint?

In addition this workflow only works if you can CREATE INDEX CONCURRENTLY. I'm not sure yet if we'll
have problems there. I noticed that for REINDEX at least, there were plans in 2012 to support
exclusion-constraint indexes,[2] but when the patch was committed in 2019 they had been dropped,
with plans to add support eventually.[3] Today they are still not supported. Maybe whatever caused
problems for REINDEX isn't an issue for just INDEX, but it would take more research to find out.

[1] https://dsf.berkeley.edu/papers/sigmod97-gist.pdf
[2] Original patch thread from 2012:
https://www.postgresql.org/message-id/flat/CAB7nPqS%2BWYN021oQHd9GPe_5dSVcVXMvEBW_E2AV9OOEwggMHw%40mail.gmail.com#e1a372074cfdf37bf9e5b4e29ddf7b2d
[3] Revised patch thread, committed in 2019:
https://www.postgresql.org/message-id/flat/60052986-956b-4478-45ed-8bd119e9b9cf%402ndquadrant.com#74948a1044c56c5e817a5050f554ddee

Yours,

--
Paul ~{:-)
pj(at)illuminatedcomputing(dot)com

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Jeff Davis 2024-06-05 21:06:11 Re: Addressing SECURITY DEFINER Function Vulnerabilities in PostgreSQL Extensions
Previous Message Paul Jungwirth 2024-06-05 20:56:15 Re: SQL:2011 application time