Re: Timeout failure in 019_replslot_limit.pl

From: Alvaro Herrera <alvherre(at)alvh(dot)no-ip(dot)org>
To: Michael Paquier <michael(at)paquier(dot)xyz>
Cc: Noah Misch <noah(at)leadboat(dot)com>, Kyotaro Horiguchi <horikyota(dot)ntt(at)gmail(dot)com>, tgl(at)sss(dot)pgh(dot)pa(dot)us, pgsql-hackers(at)lists(dot)postgresql(dot)org
Subject: Re: Timeout failure in 019_replslot_limit.pl
Date: 2021-09-20 12:38:29
Message-ID: 202109201238.7vfk4pxt6mgj@alvherre.pgsql
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

On 2021-Sep-20, Michael Paquier wrote:

> > Can you please first test if the idea of sending the signal twice is
> > enough?
>
> This idea does not work. I got one failure after 5 tries.

OK, thanks for taking the time to test it.

> > If that doesn't work, let's try Horiguchi-san's idea of using some
> > `ps` flags to find the process.
>
> Tried this one as well, to see the same failure.

Hmm, do you mean that you used Horiguchi-san's patch in [1] and the
failure still occurred?
[1] https://postgr.es/m/20210907.120106.1483239898065111540.horikyota.ntt@gmail.com

> I was just looking at the state of the test while it was querying
> pg_replication_slots and that was the expected state after the WAL
> sender received SIGCONT:
> USER PID %CPU %MEM VSZ RSS TT STAT STARTED TIME COMMAND
> toto 12663 0.0 0.0 5014468 3384 ?? Ss 8:30PM 0:00.00 postgres: primary3: walsender toto [local] streaming 0/720000
> toto 12662 0.0 0.0 4753092 3936 ?? Ts 8:30PM 0:00.01 postgres: standby_3: walreceiver streaming 0/7000D8
>
> The test gets the right PIDs, as the logs showed:
> ok 17 - have walsender pid 12663
> ok 18 - have walreceiver pid 12662

As I understood, Horiguchi-san's point isn't that the PIDs might be
wrong -- the point is to make sure that the process is in state T before
moving on to the next step in the test.

--
Álvaro Herrera PostgreSQL Developer — https://www.EnterpriseDB.com/

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Daniel Gustafsson 2021-09-20 13:06:17 psql: tab completion differs on semicolon placement
Previous Message Amit Kapila 2021-09-20 12:34:21 Re: Logical replication timeout problem