9.4 checksum errors in recovery with gin index

From: Jeff Janes <jeff(dot)janes(at)gmail(dot)com>
To: pgsql-hackers <pgsql-hackers(at)postgresql(dot)org>
Subject: 9.4 checksum errors in recovery with gin index
Date: 2014-05-07 07:35:35
Message-ID: CAMkU=1y-9WvAeHR5dxTA8z8Ema5ZnT4Y-V643a1ZTO=x_eV13Q@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

When recovering from a crash (with injection of a partial page write at
time of crash) against 7c7b1f4ae5ea3b1b113682d4d I get a checksum
verification failure.

16396 is a gin index.

If I have it ignore checksum failures, there is no apparent misbehavior.
I'm trying to bisect it, but it could take a while and I thought someone
might have some theories based on the log:

29075 2014-05-06 23:29:51.411 PDT:LOG: 00000: database system was not
properly shut down; automatic recovery in progress
29075 2014-05-06 23:29:51.411 PDT:LOCATION: StartupXLOG, xlog.c:6361
29075 2014-05-06 23:29:51.412 PDT:LOG: 00000: redo starts at 11/323FE1C0
29075 2014-05-06 23:29:51.412 PDT:LOCATION: StartupXLOG, xlog.c:6600
29075 2014-05-06 23:29:51.471 PDT:WARNING: 01000: page verification
failed, calculated checksum 35967 but expected 7881
29075 2014-05-06 23:29:51.471 PDT:CONTEXT: xlog redo Delete list pages
(16), node: 1663/16384/16396 blkno: 0
29075 2014-05-06 23:29:51.471 PDT:LOCATION: PageIsVerified, bufpage.c:145
29075 2014-05-06 23:29:51.471 PDT:FATAL: XX001: invalid page in block
28486 of relation base/16384/16396
29075 2014-05-06 23:29:51.471 PDT:CONTEXT: xlog redo Delete list pages
(16), node: 1663/16384/16396 blkno: 0
29075 2014-05-06 23:29:51.471 PDT:LOCATION: ReadBuffer_common,
bufmgr.c:483
27799 2014-05-06 23:29:51.473 PDT:LOG: 00000: startup process (PID 29075)
exited with exit code 1
27799 2014-05-06 23:29:51.473 PDT:LOCATION: LogChildExit,
postmaster.c:3281
27799 2014-05-06 23:29:51.473 PDT:LOG: 00000: aborting startup due to
startup process failure

Cheers,

Jeff

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Andres Freund 2014-05-07 07:48:56 Re: 9.4 checksum errors in recovery with gin index
Previous Message Simon Riggs 2014-05-07 07:18:12 Re: proposal: Set effective_cache_size to greater of .conf value, shared_buffers