From: | Kyotaro HORIGUCHI <horiguchi(dot)kyotaro(at)lab(dot)ntt(dot)co(dot)jp> |
---|---|
To: | andres(at)anarazel(dot)de |
Cc: | pgsql-hackers(at)postgresql(dot)org |
Subject: | Re: WAL recycle retading based on active sync rep. |
Date: | 2016-11-21 03:22:35 |
Message-ID: | 20161121.122235.167372624.horiguchi.kyotaro@lab.ntt.co.jp |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-hackers |
Hello,
At Fri, 18 Nov 2016 10:16:22 -0800, Andres Freund <andres(at)anarazel(dot)de> wrote in <20161118181622(dot)hklschaizwaxocl7(at)alap3(dot)anarazel(dot)de>
> Hi,
>
> On 2016-11-18 14:12:42 +0900, Kyotaro HORIGUCHI wrote:
> > We had too-early WAL recycling during a test we had on a sync
> > replication set. This is not a bug and a bit extreme case but is
> > contrary to expectation on synchronous replication.
>
> I don't think you can expect anything else.
My sentense was inaccurate. "is contrary to *naive* expectation
on synchronous replication." But I agree to you.
> > This is because sync replication doesn't wait non-commit WALs to
> > be replicated. This situation is artificially caused with the
> > first patch attached and the following steps.
>
> You could get that situation even if we waited for syncrep. The
> SyncRepWaitForLSN happens after delayChkpt is unset.
>
> Additionally a syncrep connection could break for a a short while, and
> you'd loose all guarantees anyway.
I know. Replication slots are for such cases.
> > - Is this situation required to be saved? This is caused by a
> > large transaction, spans over two max_wal_size segments, or
> > replication stall lasts for a chackepoint period.
>
> I very strongly think not.
>
>
> > - Is the measure acceptable? For the worst case, a master
> > crashes from WAL space exhaustion. (But such large transaction
> > won't/shouldn't exist?)
>
> No, imo not.
Thanks for clarifying that.
regards,
--
Kyotaro Horiguchi
NTT Open Source Software Center
From | Date | Subject | |
---|---|---|---|
Next Message | Michael Paquier | 2016-11-21 03:42:07 | Re: Fun fact about autovacuum and orphan temp tables |
Previous Message | Tsunakawa, Takayuki | 2016-11-21 03:20:41 | Re: [RFC] Should we fix postmaster to avoid slow shutdown? |