Re: CREATE INDEX CONCURRENTLY does not index prepared xact's data

From: Andrey Borodin <x4mmm(at)yandex-team(dot)ru>
To: Noah Misch <noah(at)leadboat(dot)com>
Cc: Michael Paquier <michael(at)paquier(dot)xyz>, Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>, pgsql-bugs(at)lists(dot)postgresql(dot)org
Subject: Re: CREATE INDEX CONCURRENTLY does not index prepared xact's data
Date: 2021-07-19 18:41:28
Message-ID: 1727EC55-0709-4E06-91C9-212682A134C7@yandex-team.ru
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-bugs


> 19 июля 2021 г., в 23:10, Noah Misch <noah(at)leadboat(dot)com> написал(а):
>
> On Mon, Jul 19, 2021 at 12:10:52PM +0500, Andrey Borodin wrote:
>>>> 19 июля 2021 г., в 05:30, Noah Misch <noah(at)leadboat(dot)com> написал(а):
>>>
>>> To fix $SUBJECT, it sounds like we need a way to identify a transaction,
>>> usable as early as the transaction's first catalog access and remaining valid
>>> until COMMIT PREPARED finishes. We may initially see a transaction as having
>>> a VXID and no XID, then later need to wait for that transaction when it has
>>> entered prepared state, having an XID and no VXID. How might we achieve that?
>>
>> PFA draft with vxid->xid mapping and subsequent wait for it. The patch, obviously, lacks a ton of comments explaining what is going on.
>> We write actual VXID into dummy proc entries of prepared xact.
>> When we wait for vxid we try to convert it to xid through real proc entry. If we cannot do so - we lookup in shared 2pc state. If vxid is not there - it means it is already gone and there's nothing to wait.
>
> When the system reuses BackendId values, it reuses VXID values. In the
> general case, two prepared transactions could exist simultaneously with the
> same BackendId+LocalTransactionId. Hmm. It could be okay to have a small
> probability that CIC waits on more transactions than necessary. Suppose we
> have three PGPROC entries with the same VXID, two prepared transactions and
> one regular transaction. Waiting for all three could be tolerable, though
> avoiding that would be nice. Should we follow transactions differently to
> avoid that?

We don’t have to wait for regular Xid in this case at all. Because it would be finished with VXID. But I think that we have to wait for all 2PCs with the same VXID.

We are looking for transaction that was only VXID during GetLockConflicts(). In conflicts array we may have each VXID only once.
Other 2PCs with same VXID may be older or newer than target 2PC.
Older 2PCs must be with XID in conflicts array. So we might wait for all 2PC with known XIDs. Then for each ambiguous VXID->XID mapping choose oldest XID.

But this logic seem to me overly complicated. Or isn’t it?

Best regards, Andrey Borodin.

In response to

Responses

Browse pgsql-bugs by date

  From Date Subject
Next Message Alvaro Herrera 2021-07-19 21:24:48 Re: BUG #17103: WAL segments are not removed after exceeding max_slot_wal_keep_size
Previous Message Noah Misch 2021-07-19 18:10:05 Re: CREATE INDEX CONCURRENTLY does not index prepared xact's data