Re: Help troubleshooting SubtransControlLock problems

From: Scott Frazer <sfrazer(at)couponcabin(dot)com>
To: pgsql-admin(at)lists(dot)postgresql(dot)org
Subject: Re: Help troubleshooting SubtransControlLock problems
Date: 2018-03-07 03:25:03
Message-ID: CA+ey=ann3_M43RAPdx7N=SWHB+U8b8uUOMwMTKyieax_ccfPPw@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-admin

Apologies! I thought pgsql-admin meant like "systems admin." I've reposted
the question to General.

On Tue, Mar 6, 2018 at 8:57 PM, Scott Frazer <sfrazer(at)couponcabin(dot)com>
wrote:

>
> Hi, we have a Postgres 9.6 setup using replication that has recently
> started seeing a lot of processes stuck in "SubtransControlLock" as a
> wait_event on the read-replicas. Like this, only usually about 300-800 of
> them:
>
>
> 179706 | LWLockNamed | SubtransControlLock
>
> 186602 | LWLockNamed | SubtransControlLock
>
> 186606 | LWLockNamed | SubtransControlLock
>
> 180947 | LWLockNamed | SubtransControlLock
>
> 186621 | LWLockNamed | SubtransControlLock
>
> The server then begins to crawl, with some queries just never finishing
> until I finally shut the server down.
>
> Searching for that particular combo of wait_event_type and wait_event only
> seems to turn up the page about statistics collection, but no helpful
> information on troubleshooting this lock.
>
> Restarting the replica server clears the locks and allows us to start
> working again, but it's happened twice now in 12 hours and I'm worried it
> will happen again.
>
> Does anyone have any advice on where to start looking?
>
> Thanks,
> Scott
>

In response to

Browse pgsql-admin by date

  From Date Subject
Next Message bricklen 2018-03-07 18:47:06 Re: pg_upgrade and frozen xids
Previous Message Mark Kirkwood 2018-03-07 03:18:30 Re: Reliable WAL file shipping over unreliable network