Quick Links

Re: odd buildfarm failure - "pg_ctl: control file appears to be corrupt"

From:	Thomas Munro <thomas(dot)munro(at)gmail(dot)com>
To:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Cc:	Andres Freund <andres(at)anarazel(dot)de>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: odd buildfarm failure - "pg_ctl: control file appears to be corrupt"
Date:	2022-11-24 01:02:54
Message-ID:	CA+hUKGK-BEe38aKNqHJDQ86LUW-CMwF5F9bo1JtJVg71FoDv_w@mail.gmail.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

On Thu, Nov 24, 2022 at 11:05 AM Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us> wrote:
> Thomas Munro <thomas(dot)munro(at)gmail(dot)com> writes:
> > On Wed, Nov 23, 2022 at 11:03 PM Thomas Munro <thomas(dot)munro(at)gmail(dot)com> wrote:
> > As for what to do about it, some ideas:
> > 2. Retry after a short time on checksum failure. The probability is
> > already miniscule, and becomes pretty close to 0 if we read thrice
> > 100ms apart.
>
> > First thought is that 2 is appropriate level of complexity for this
> > rare and stupid problem.
>
> Yeah, I was thinking the same. A variant could be "repeat until
> we see the same calculated checksum twice".

Hmm. While writing a comment to explain why that's good enough, I
realised it's not really true for a standby that control file writes
are always expected to be far apart in time. XLogFlush->
UpdateMinRecoveryPoint() could coincide badly with our N attempts for
any small N and for any nap time, which I think makes your idea better
than mine.

With some cartoon-level understanding of what's going on (to wit: I
think the kernel just pins the page but doesn't use a page-level
content lock or range lock, so what you're seeing is raw racing memcpy
calls and unsynchronised cache line effects), I guess you'd be fairly
likely to make "progress" in seeing more new data even if you didn't
sleep in between, but who knows. So I have a 10ms sleep to make
progress very likely; given your algorithm it doesn't matter if you
didn't make all the progress, just some. Since this is reachable from
SQL, I think we also need a CFI call so you can't get uninterruptibly
stuck here?

I wrote a stupid throw-away function to force a write. If you have an
ext4 system to hand (xfs, zfs, apfs, ufs, others don't suffer from
this) you can do:

do $$ begin for i in 1..100000000 do loop perform
pg_update_control_file(); end loop; end; $$;

... while you also do:

select pg_control_system();
\watch 0.001

... and you'll soon see:

ERROR: calculated CRC checksum does not match value stored in file

The attached draft patch fixes it.

Attachment	Content-Type	Size
0001-XXX-Dirty-hack-to-clobber-control-file-for-testing.patch	text/x-patch	1.8 KB
0002-Try-to-tolerate-concurrent-reads-and-writes-of-contr.patch	text/x-patch	2.3 KB

In response to

Re: odd buildfarm failure - "pg_ctl: control file appears to be corrupt" at 2022-11-23 22:05:03 from Tom Lane

Responses

Re: odd buildfarm failure - "pg_ctl: control file appears to be corrupt" at 2022-11-24 01:59:04 from Thomas Munro
Re: odd buildfarm failure - "pg_ctl: control file appears to be corrupt" at 2023-01-31 01:09:58 from Anton A. Melnikov

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Ted Yu	2022-11-24 01:09:22	Re: cleanup in open_auth_file
Previous Message	Ted Yu	2022-11-24 00:54:58	cleanup in open_auth_file