WAL replay issue from 9.6.8 to 9.6.10

From: Dave Peticolas <dave(at)krondo(dot)com>
To: pgsql-general <pgsql-general(at)postgresql(dot)org>
Subject: WAL replay issue from 9.6.8 to 9.6.10
Date: 2018-08-29 04:02:45
Message-ID: CAPRbp06PWuV=NETrb_fDXatbTNJvS-v4L2MQwAn-u38Eipvg6g@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

Hello, I'm seeing some issues with WAL replay on a test server running
9.6.10 using WAL archived from a 9.6.8 primary server. It reliably PANICs
during replay with messages like so:

WARNING: page 1209270272 of relation base/16422/47496599 does not exist
CONTEXT: xlog redo at 4810/C84F8A0 for Btree/DELETE: 88 items
PANIC: WAL contains references to invalid pages

This is after restoring from a snapshot that completed successfully as far
as I can tell.

I downgraded the test server to 9.6.8 and I was able to replay WAL past the
point where 9.6.10 would PANIC and crash. Curiously, 9.6.8 PANICed the
first time, but then when I restarted it was able to replay through.

Should I expect this to just not work until the primary database is
upgraded to 9.6.10? I'm a little wary to upgrade the primary at this point
until I understand what might be going on.

thanks,
dave

Responses

Browse pgsql-general by date

  From Date Subject
Next Message Andreas Kretschmer 2018-08-29 04:20:17 Re: WAL replay issue from 9.6.8 to 9.6.10
Previous Message Tom Lane 2018-08-28 20:51:51 Re: archive items not in correct section order