Re: BUG #17345: pg_basebackup stucked for 2 hours before timeout

From: Masahiko Sawada <sawada(dot)mshk(at)gmail(dot)com>
To: bchen90(at)163(dot)com, pgsql-bugs(at)lists(dot)postgresql(dot)org
Subject: Re: BUG #17345: pg_basebackup stucked for 2 hours before timeout
Date: 2021-12-27 04:53:33
Message-ID: CAD21AoDdQKkpdLMcmY3NqO2xKJpf=nx-=OwNQzzcLuKvB1fXdg@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-bugs

Hi,

On Mon, Dec 27, 2021 at 1:23 PM PG Bug reporting form
<noreply(at)postgresql(dot)org> wrote:
>
> The following bug has been logged on the website:
>
> Bug reference: 17345
> Logged by: Bo Chen
> Email address: bchen90(at)163(dot)com
> PostgreSQL version: 11.13
> Operating system: euleros v2r9 x86_64
> Description:
>
> Hello experts,
> I am facing an issue for pg_basebackup in docker env. when the primary
> VM restarted while pg_basebackup is running on the standby docker in VM. It
> takes 2 hours before pg_basebackup times out.
> After analysis and reproduce the problem, I think the reason is the
> parent process for fetching data files is blocking for tcp keeplive, and it
> ignore or block SIGCHLD when running poll API. So we add signaling the
> parent when fetching wal exit not zero.

This seems to be addressed by the patch discussed here[1]. I'm not
sure it's going to be backpatched but is there any chance you could
test this patch?

Regards,

[1] https://www.postgresql.org/message-id/0F69E282-97F9-4DB7-8D6D-F927AA6340C8%40yesql.se

--
Masahiko Sawada
EDB: https://www.enterprisedb.com/

In response to

Browse pgsql-bugs by date

  From Date Subject
Next Message PG Bug reporting form 2021-12-27 08:38:17 BUG #17346: pg_upgrade fails with role granted by other role
Previous Message PG Bug reporting form 2021-12-27 03:53:30 BUG #17345: pg_basebackup stucked for 2 hours before timeout