Re: CREATE INDEX CONCURRENTLY does not index prepared xact's data

From: Noah Misch <noah(at)leadboat(dot)com>
To: Andrey Borodin <x4mmm(at)yandex-team(dot)ru>
Cc: Michael Paquier <michael(at)paquier(dot)xyz>, Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, pgsql-bugs(at)lists(dot)postgresql(dot)org
Subject: Re: CREATE INDEX CONCURRENTLY does not index prepared xact's data
Date: 2021-07-20 21:49:36
Message-ID: 20210720214936.GA1040253@gust.leadboat.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-bugs

On Wed, Jul 21, 2021 at 12:38:25AM +0500, Andrey Borodin wrote:
> > 19 июля 2021 г., в 23:41, Andrey Borodin <x4mmm(at)yandex-team(dot)ru> написал(а):
> > > 19 июля 2021 г., в 23:10, Noah Misch <noah(at)leadboat(dot)com> написал(а):
> > > On Mon, Jul 19, 2021 at 12:10:52PM +0500, Andrey Borodin wrote:
> > >>>> 19 июля 2021 г., в 05:30, Noah Misch <noah(at)leadboat(dot)com> написал(а):
> > >>>
> > >>> To fix $SUBJECT, it sounds like we need a way to identify a transaction,
> > >>> usable as early as the transaction's first catalog access and remaining valid
> > >>> until COMMIT PREPARED finishes. We may initially see a transaction as having
> > >>> a VXID and no XID, then later need to wait for that transaction when it has
> > >>> entered prepared state, having an XID and no VXID. How might we achieve that?
> > >>
> > >> PFA draft with vxid->xid mapping and subsequent wait for it. The patch, obviously, lacks a ton of comments explaining what is going on.
> > >> We write actual VXID into dummy proc entries of prepared xact.
> > >> When we wait for vxid we try to convert it to xid through real proc entry. If we cannot do so - we lookup in shared 2pc state. If vxid is not there - it means it is already gone and there's nothing to wait.
> > >
> > > When the system reuses BackendId values, it reuses VXID values. In the
> > > general case, two prepared transactions could exist simultaneously with the
> > > same BackendId+LocalTransactionId. Hmm. It could be okay to have a small
> > > probability that CIC waits on more transactions than necessary. Suppose we
> > > have three PGPROC entries with the same VXID, two prepared transactions and
> > > one regular transaction. Waiting for all three could be tolerable, though
> > > avoiding that would be nice. Should we follow transactions differently to
> > > avoid that?
> >
> > We don’t have to wait for regular Xid in this case at all. Because it would be finished with VXID.

I don't understand those two sentences. Could you write more?

> > We are looking for transaction that was only VXID during GetLockConflicts(). In conflicts array we may have each VXID only once.
> > Other 2PCs with same VXID may be older or newer than target 2PC.
> > Older 2PCs must be with XID in conflicts array. So we might wait for all 2PC with known XIDs. Then for each ambiguous VXID->XID mapping choose oldest XID.

> Unfortunately, this is just wrong. Older 2PC with same VXID don't have to be in conflicts array. They might be of some other unrelated 2PC working with different relations.

I agree. Would it work to do the following sequence in WaitForLockers()?

1. In GetLockConflicts(), record a list of conflicting XIDs. Also record a
list of conflicting LXIDs having no XID.
2. Wait on all LXIDs recorded in (1). They have either ended or converted to
prepared transactions.
3. Inner-join the present list of prepared transactions with the list of
LXIDs from (1). Record the XIDs of matched entries.
4. Wait on all XIDs recorded in (1) and (3).

While that may wait on some prepared XIDs needlessly, it can't degrade into
persistent starvation. We could reduce the chance of a needless XID wait by
remembering the list of old prepared XIDs before (1) and not adding any of
those remembered, old XIDs in (3). That last bit probably would be
unjustified complexity, but maybe not.

In response to

Responses

Browse pgsql-bugs by date

  From Date Subject
Next Message Tom Lane 2021-07-20 21:54:55 Re: BUG #17066: Cache lookup failed when null (unknown) is passed as anycompatiblemultirange
Previous Message PG Bug reporting form 2021-07-20 21:00:01 BUG #17116: Assert failed in SerialSetActiveSerXmin() on commit of parallelized serializable transaction