Temporary disabling a replica in a Patroni cluster

From: Victor Sudakov <vas(at)sibptus(dot)ru>
To: pgsql-admin(at)lists(dot)postgresql(dot)org
Subject: Temporary disabling a replica in a Patroni cluster
Date: 2023-08-25 11:18:42
Message-ID: ZOiOEgeQ0HywK9sN@admin.sibptus.ru
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-admin

Dear Colleagues,

Do you perchance know what is the correct procedure of temporarily
taking down a replica in a Patroni cluster, e.g. for 5-10 minutes of
hardware maintenance?

The problem is that after stopping the patroni process (service) on a
replica, patroni removes the corresponding physical replication slot
from the leader, and unless the wal_keep_size value is unsanely high,
the replica, when up again, cannot restart streaming because the WAL
segments are already gone from the leader.

Well, you all know:
<%%%>LOG: started streaming WAL from primary at B4A0/E2000000 on timeline 8
<%%%>FATAL: could not receive data from WAL stream: ERROR: requested WAL segment 000000080000B4A0000000E2 has already been removed
<%%%>LOG: waiting for WAL to become available at B4A0/E2002000

Do you think there is a way to tell Patroni that a replica is down
temporarily and its replication slot should not be removed?

Or, what am I missing?

--
Victor Sudakov VAS4-RIPE
http://vas.tomsk.ru/
2:5005/49(at)fidonet

Responses

Browse pgsql-admin by date

  From Date Subject
Next Message Georg H. 2023-08-25 11:24:12 Re: Temporary disabling a replica in a Patroni cluster
Previous Message ROHIT SACHDEVA 2023-08-24 16:25:48 Re: Related To Hash Partition