Quick Links

Hot standby queries see transient all-zeros pages

From:	Noah Misch <noah(at)leadboat(dot)com>
To:	pgsql-hackers(at)postgresql(dot)org
Subject:	Hot standby queries see transient all-zeros pages
Date:	2024-05-12 17:16:58
Message-ID:	20240512171658.7e.nmisch@google.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

XLogReadBufferForRedoExtended() precedes RestoreBlockImage() with
RBM_ZERO_AND_LOCK. Per src/backend/storage/buffer/README:

Once one has determined that a tuple is interesting (visible to the current
transaction) one may drop the content lock, yet continue to access the
tuple's data for as long as one holds the buffer pin.

The use of RBM_ZERO_AND_LOCK is incompatible with that. See a similar
argument at https://postgr.es/m/flat/5101(dot)1328219790(at)sss(dot)pgh(dot)pa(dot)us that led me
to the cause. Adding a 10ms sleep just after RBM_ZERO_AND_LOCK, I got 2
failures in 7 runs of 027_stream_regress.pl, at Assert(ItemIdIsNormal(lpp)) in
heapgettup_pagemode(). In the core file, lpp pointed into an all-zeros page.
RestoreBkpBlocks() had been doing RBM_ZERO years before hot standby existed,
but it wasn't a bug until queries could run concurrently.

I suspect the fix is to add a ReadBufferMode specified as, "If the block is
already in shared_buffers, do RBM_NORMAL and exclusive-lock the buffer.
Otherwise, do RBM_ZERO_AND_LOCK." That avoids RBM_NORMAL for a block past the
current end of the file. Like RBM_ZERO_AND_LOCK, it avoids wasting disk reads
on data we discard. Are there other strategies to consider?

I got here from a Windows CI failure,
https://cirrus-ci.com/task/6247605141766144. That involved patched code, but
adding the sleep suffices on Linux, with today's git master:

--- a/src/backend/access/transam/xlogutils.c
+++ b/src/backend/access/transam/xlogutils.c
@@ -388,6 +388,8 @@ XLogReadBufferForRedoExtended(XLogReaderState *record,
*buf = XLogReadBufferExtended(rlocator, forknum, blkno,
get_cleanup_lock ? RBM_ZERO_AND_CLEANUP_LOCK : RBM_ZERO_AND_LOCK,
prefetch_buffer);
+ if (!get_cleanup_lock)
+ pg_usleep(10 * 1000);
page = BufferGetPage(*buf);
if (!RestoreBlockImage(record, block_id, page))
ereport(ERROR,

Responses

Re: Hot standby queries see transient all-zeros pages at 2024-12-13 22:41:15 from Andres Freund

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Noah Misch	2024-05-12 17:48:51	Re: Weird test mixup
Previous Message	Andrew Dunstan	2024-05-12 16:20:48	Re: Why is citext/regress failing on hamerkop?