Quick Links

Re: pg_rewind with cascade standby doesn't work well

From:	Kuwamura Masaki <kuwamura(at)db(dot)is(dot)i(dot)nagoya-u(dot)ac(dot)jp>
To:	pgsql-hackers(at)lists(dot)postgresql(dot)org
Subject:	Re: pg_rewind with cascade standby doesn't work well
Date:	2023-09-11 08:49:46
Message-ID:	CAMyC8qpq9dkh79oHsJ9-DjVovWDOQSk7Q+bFCq_a2nXQqJe9ig@mail.gmail.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

> Consider a scenario like this,
>
> Server A: primary
> Server B :replica of A
> Server C :replica of B
>
> and somehow A down ,so B gets promoted.
> Server A: down
> Server B :new primary
> Server C :replica of B
>
> In this case, pg_rewind can be used to reconstruct the cascade; the
source is C and the target is A.
> However, we get error as belows by running pg_rewind.
>
> ```
> pg_rewind: fetched file "global/pg_control", length 8192
> pg_rewind: source and target cluster are on the same timeline
> pg_rewind: no rewind required
> ```

To fix the above mentioned behavior of pg_rewind, I suggest to change the
cascade standby's (i.e. server C's) minRecoveryPointTLI when it receives
the new timeline information from the new primary (i.e. server B).

When server B is promoted, it creates an end-of-recovery record by calling
CreateEndOfRecoveryRecord(). (in xlog.c)
And also updates B's minRecoveryPoint and minRecoveryPointTLI.
```
/*
* Update the control file so that crash recovery can follow the
timeline
* changes to this point.
*/
LWLockAcquire(ControlFileLock, LW_EXCLUSIVE);
ControlFile->minRecoveryPoint = recptr;
ControlFile->minRecoveryPointTLI = xlrec.ThisTimeLineID;
UpdateControlFile();
LWLockRelease(ControlFileLock);
```
Since C is a replica of B, the end-of-recovery record is replicated from B
to C, so the record is replayed in C by xlog_redo().
The attached patch updates minRecoveryPoint and minRecoveryPointTLI at this
point by mimicking CreateEndOfRecoveryRecord().
With this patch, you can run pg_rewind with cascade standby immediately.
(without waiting for checkpointing)

Thoughts?

Masaki Kuwamura

Attachment	Content-Type	Size
v1-0001-pg_rewind-Fix-bug-using-cascade-standby-as-source.patch	application/octet-stream	789 bytes

In response to

pg_rewind with cascade standby doesn't work well at 2023-09-07 06:33:45 from Kuwamura Masaki

Responses

Re: pg_rewind with cascade standby doesn't work well at 2023-09-11 16:04:30 from Aleksander Alekseev

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Amit Kapila	2023-09-11 09:19:49	Re: persist logical slots to disk during shutdown checkpoint
Previous Message	Lepikhov Andrei	2023-09-11 08:04:22	Re: MergeJoin beats HashJoin in the case of multiple hash clauses