Re: Fix slot synchronization with two_phase decoding enabled

From: Masahiko Sawada <sawada(dot)mshk(at)gmail(dot)com>
To: "Zhijie Hou (Fujitsu)" <houzj(dot)fnst(at)fujitsu(dot)com>
Cc: Amit Kapila <amit(dot)kapila16(at)gmail(dot)com>, PostgreSQL Hackers <pgsql-hackers(at)lists(dot)postgresql(dot)org>
Subject: Re: Fix slot synchronization with two_phase decoding enabled
Date: 2025-04-02 07:45:07
Message-ID: CAD21AoAbzjU=ixB-Qg77tdiyJtwkx_WR0xBtYVdUhp_BzzzPoQ@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

On Mon, Mar 31, 2025 at 4:34 AM Zhijie Hou (Fujitsu)
<houzj(dot)fnst(at)fujitsu(dot)com> wrote:
>
> On Thu, Mar 27, 2025 at 2:29 PM Amit Kapila wrote:
>
> >
> > On Tue, Mar 25, 2025 at 12:1 PM Amit Kapila <amit(dot)kapila16(at)gmail(dot)com>
> > wrote:
> > >
> > > On Tue, Mar 25, 2025 at 11:05 AM Zhijie Hou (Fujitsu)
> > > <houzj(dot)fnst(at)fujitsu(dot)com> wrote:
> > > >
> > > > Hi,
> > > >
> > > > When testing the slot synchronization with logical replication slots that
> > > > enabled two_phase decoding, I found that transactions prepared before
> > two-phase
> > > > decoding is enabled may fail to replicate to the subscriber after being
> > > > committed on a promoted standby following a failover.
> > > >
> > > > To reproduce this issue, please follow these steps (also detailed in the
> > > > attached TAP test, v1-0001):
> > > >
> > > > 1. sub: create a subscription with (two_phase = false)
> > > > 2. primary (pub): prepare a txn A.
> > > > 3. sub: alter subscription set (two_phase = true) and wait for the logical
> > slot to
> > > > be synced to standby.
> > > > 4. primary (pub): stop primary, promote the standby and let the subscriber
> > use
> > > > the promoted standby as publisher.
> > > > 5. promoted standby (pub): COMMIT PREPARED A;
> > > > 6. sub: the apply worker will report the following ERROR because it didn't
> > > > receive the PREPARE.
> > > > ERROR: prepared transaction with identifier "pg_gid_16387_752"
> > does not exist
> > > >
> > > > I think the root cause of this issue is that the two_phase_at field of the
> > > > slot, which indicates the LSN from which two-phase decoding is enabled
> > (used to
> > > > prevent duplicate data transmission for prepared transactions), is not
> > > > synchronized to the standby server.
> > > >
> > > > In step 3, transaction A is not immediately replicated because it occurred
> > > > before enabling two-phase decoding. Thus, the prepared transaction
> > should only
> > > > be replicated after decoding the final COMMIT PREPARED, as referenced
> > in
> > > > ReorderBufferFinishPrepared(). However, due to the invalid two_phase_at
> > on the
> > > > standby, the prepared transaction fails to send at that time.
> > > >
> > > > This problem arises after the support for altering the two-phase option
> > > > (1462aad).
> > > >
> >
> > I suspect that this can happen in PG17 as well, but I need to think
> > more about it to make a reproducible test case.
>
> After further analysis, I was able to reproduce the same issue [1] in
> PG 17.
>
> [1]
> - pub: created a slot 'sub' with two_phase=false, then prepared a transaction
> - pub: after some activity, advanced the confirmed_flush_lsn of 'sub', so it is
> greater than prepared txn lsn.
> - sub: create subscription with (slot_name='sub', create_slot=false, failover =
> true, two_phase=true, copy_data=false); two_phase_at will be set to the same
> as confirmed_flush_lsn which is greater than the prepared transaction.
> - stop the primary and promote the standby.
> - commit the prepared transaction on standby, the following error will be
> reported on subscriber:

It seems to require elaborate steps to reproduce this issue in v17. I
wonder if we could somehow narrow down the cases that we want to
prohibit. The patch for v17 disallows CREATE SUBSCRIPTION to enable
both two_phase and failover, but I guess that it's still safe if it
also creates the replication slot (e.g., create_slot is true). If my
understanding is right, we can allow users to specify both fields if
CRETE SUBSCRIPTION creates the slot, and we don't need to disallow
that in ReplicationSlotCreate().

Regards,

--
Masahiko Sawada
Amazon Web Services: https://aws.amazon.com

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Ashutosh Bapat 2025-04-02 08:19:03 Re: Test to dump and restore objects left behind by regression
Previous Message Daniel Gustafsson 2025-04-02 07:32:55 Re: Making sslrootcert=system work on Windows psql