Re: BUG #18789: logical replication slots are deleted after failovers

From: Masahiko Sawada <sawada(dot)mshk(at)gmail(dot)com>
To: sachinkonde3(at)gmail(dot)com, pgsql-bugs(at)lists(dot)postgresql(dot)org
Subject: Re: BUG #18789: logical replication slots are deleted after failovers
Date: 2025-01-29 19:13:26
Message-ID: CAD21AoBk-gsJ8z4czP4wmNFhsE=Rgmg7-hf0sVPFY+u8Joo8JA@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-bugs

On Wed, Jan 29, 2025 at 7:01 AM PG Bug reporting form
<noreply(at)postgresql(dot)org> wrote:
>
> The following bug has been logged on the website:
>
> Bug reference: 18789
> Logged by: Sachin Konde-Deshmukh
> Email address: sachinkonde3(at)gmail(dot)com
> PostgreSQL version: 17.2
> Operating system: Oracle Linux 8.9
> Description:
>
> We are using 2 node PostgreSQL 17 HA setup using Patroni 4.0.4.
> When I do failover 2nd or third time or more than once, it fails to transfer
> or move logical replication slot to new Primary.
> postgres=# select slot_name,slot_type, failover,
> synced,confirmed_flush_lsn,active from pg_replication_slots;
> slot_name | slot_type | failover | synced | confirmed_flush_lsn |
> active
> --------------------+-----------+----------+--------+---------------------+--------
> psoel89pgcluster01 | physical | f | f | |
> t
> mysub | logical | t | t | 0/4000AB8 |
> t
> (2 rows)

I guess that this is the list of slots on the primary.

> After First Failover -->
> postgres=# select slot_name,slot_type, failover,
> synced,confirmed_flush_lsn,active from pg_replication_slots;
> slot_name | slot_type | failover | synced | confirmed_flush_lsn |
> active
> --------------------+-----------+----------+--------+---------------------+--------
> psoel89pgcluster02 | physical | f | f | |
> t
> mysub | logical | f | f | 0/50001E0 |
> t
> (2 rows)

I guess that this is the list of slots on the new primary after a
failover. It seems that a subscriber is receiving logical replication
changes from the new primary by using the 'mysub' slot, which makes
sense. However, a problem I can see is that its 'failover' and
'synced' fields were false. Was the slot sync worker running on the
standby before the first failover?

Regards,

--
Masahiko Sawada
Amazon Web Services: https://aws.amazon.com

In response to

Browse pgsql-bugs by date

  From Date Subject
Next Message Tom Lane 2025-01-29 19:27:42 Re: BUG #18735: Specific multibyte character in psql file path command parameter for Windows
Previous Message PG Bug reporting form 2025-01-29 19:10:33 BUG #18790: Pg_stat_statements doesn't track schema.