Quick Links

Re: lifetime of the old CTID

From:	Matthias Apitz <guru(at)unixarea(dot)de>
To:	Karsten Hilbert <Karsten(dot)Hilbert(at)gmx(dot)net>
Cc:	pgsql-general(at)lists(dot)postgresql(dot)org
Subject:	Re: lifetime of the old CTID
Date:	2022-07-06 12:26:00
Message-ID:	20220706122600.GA24@sh4-5.1blu.de
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-general

El día Mittwoch, Juli 06, 2022 a las 11:45:14 +0200, Karsten Hilbert escribió:

> Am Wed, Jul 06, 2022 at 08:18:42AM +0200 schrieb Matthias Apitz:
>
> > > On first glance, it appears that you are using the ctid as a primary key for a row, and that's highly not-recommended. The ctid is never intended to be stable in the database, as you have discovered. There are really no particular guarantees about ctid values being retained.
> > >
> > > I'd suggest having a proper primary key column on the table, and using that instead.
> >
> > Ofc, each table has its own primary key(s), used for example for the
> > SELECT ctid, * FROM d01buch WHERE ...
> >
> > As I said, we came to PostgreSQL from Sybase (and Oracle) and Sybase has
> > for each table a so called SYB_IDENTITY_COLUMN which is static for the
> > table and its value does not change. When we would add now to some 400 tables an
> > additional INTEGER column (and triggers to fill this on INSERT) this
> > would be a big change in our DB layer and migration of databases in the
> > field. Your suggesting (thanks for it in any case) is not that easy to
> > implement, and no option at the moment.
>
> Christopher suggested to *use* the primary key, not to *add*
> one.
>
> You said that there *is* a primary key.
>
> So, more thought/explanation would need to go into why that
> cannot be used.

This is because of the generic, generated structure of the DB layer for
all the 400 tables. The provided pseudo-code example should only explain
from where our problem comes from. The big CURSOR is done in the DB layer by
the application calling something like

read_where(d01buch, "WHERE d01status=4");

i.e. the application provides the table name (d01buch) and a
WHERE-clause of its interest ("WHERE d01status=4"), here all books which
are lent to patrons. This gives in the DB layer a CURSOR of say 100.000
rows of the 3.000.000 in the table. Now the application fetches row
by row and see if something should be done with the row. If so, the
DB layer must LOCK the row for update. It does so using the CTID. Of
course there is a key in the row (d01gsi, the signature of the book),
but this is not uniqu and can't be used to lock exactly this row for update.

The same problem could occure in any or the other 400 tables.

Interestingly, I tested today morning how long the new CTID can be seen
with currtid2(). I did 10 or more updates of a row and the then new CTID
could always be seen with the old CTID from the moment before the 10 updates. I even
found no way to get this tuple broken. Only deletion of the row helped
to make currtid2() loosing the relation. This is with a 14.1 server. Why
the 13.1 behaves different?

matthias

--
Matthias Apitz, ✉ guru(at)unixarea(dot)de, http://www.unixarea.de/ +49-176-38902045
Public GnuPG key: http://www.unixarea.de/key.pub

In response to

Re: lifetime of the old CTID at 2022-07-06 09:45:14 from Karsten Hilbert

Responses

Re: lifetime of the old CTID at 2022-07-06 12:32:48 from Alvaro Herrera
Re: lifetime of the old CTID at 2022-07-06 13:53:54 from Peter J. Holzer
Re: lifetime of the old CTID at 2022-07-06 15:56:02 from Ilya Anfimov

Browse pgsql-general by date

	From	Date	Subject
Next Message	Alvaro Herrera	2022-07-06 12:32:48	Re: lifetime of the old CTID
Previous Message	Ludwig Isaac Lim	2022-07-06 12:24:00	Error when pasting function blocks into psql