Quick Links

Re: BUG #15641: Autoprewarm worker fails to start on Windows with huge pages in use Old PostgreSQL community/pgsql-bugs x

From:	Thomas Munro <thomas(dot)munro(at)gmail(dot)com>
To:	Hans Buschmann <buschmann(at)nidsa(dot)net>
Cc:	PostgreSQL mailing lists <pgsql-bugs(at)lists(dot)postgresql(dot)org>, Robert Haas <robertmhaas(at)gmail(dot)com>, mithun(dot)cy(at)enterprisedb(dot)com
Subject:	Re: BUG #15641: Autoprewarm worker fails to start on Windows with huge pages in use Old PostgreSQL community/pgsql-bugs x
Date:	2019-02-20 21:41:11
Message-ID:	CA+hUKG+29N_CXU3sF3t2f_4V+JDUqs=2dxxn8DpmcQnyEmBgqw@mail.gmail.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-bugs pgsql-hackers

On Thu, Feb 21, 2019 at 4:36 AM Hans Buschmann <buschmann(at)nidsa(dot)net> wrote:
> I encountered this problem after switching the production system and then found it also on the new created replica.
>
> I have no knowledge of the shared memory areas involved.
>
> I did some further investigation and tried to reproduce it on the old System (WS2016, PG 11.2) but there it worked fine (without and with huge pages activated!).
>
> Even on a developer machine under WS2019, PG 11.2 the error did not occur (both cases running on different generation of intel machines, Haswell and Nehalem, under different Hypervisors, WS2012R2 and WS2019).
>
> I am really confused to not being able to reproduce the error outside of production and replica instances...
>
> The error caused a massive flood of the logs (about 800 MB in about 1 day, on SSD)
>
> I'll try to investigate further by configuring a second replica tomorrow, using the configuration of the production system as done per pg_basebackup.

Just to confirm: on the machines where it happens, does it happen on
every restart, and does it never happen if you set huge_pages = off?

CC'ing the authors of the auto-prewarm feature to see if they have ideas.

There is a known bug (fixed in commit 6c0fb941 for the next release)
that would cause spurious dsm_attach() failure that would look just
this this (dsm_attach() returns NULL), but that should be very rare
and couldn't cause the behaviour described here, because here the
background worker is repeatedly failing to attach in a loop (hence the
800MB of logs).

--
Thomas Munro
https://enterprisedb.com

In response to

AW: BUG #15641: Autoprewarm worker fails to start on Windows with huge pages in use Old PostgreSQL community/pgsql-bugs x at 2019-02-20 16:17:08 from Hans Buschmann

Responses

AW: BUG #15641: Autoprewarm worker fails to start on Windows with huge pages in use Old PostgreSQL community/pgsql-bugs x at 2019-02-21 09:21:55 from Hans Buschmann

Browse pgsql-bugs by date

	From	Date	Subject
Next Message	Thomas Munro	2019-02-20 23:21:14	Re: BUG #15644: not able to up the database
Previous Message	Tom Lane	2019-02-20 20:36:39	Re: BUG #15572: Misleading message reported by "Drop function operation" on DB with functions having same name

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Stephen Frost	2019-02-20 21:43:35	Re: WAL insert delay settings
Previous Message	Robert Haas	2019-02-20 21:32:15	Re: Delay locking partitions during INSERT and UPDATE