Quick Links

Re: Undetected Deadlock

From:	Michael Harris <harmic(at)gmail(dot)com>
To:	Alvaro Herrera <alvherre(at)alvh(dot)no-ip(dot)org>
Cc:	pgsql-general(at)lists(dot)postgresql(dot)org
Subject:	Re: Undetected Deadlock
Date:	2022-01-27 00:20:59
Message-ID:	CADofcAXSn_oS3MOxS=1epiXirwETOB_BpBmM8w4=F3GaQe+cwA@mail.gmail.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-general

Hi Alvaro

Thanks for the feedback!

> What version were you using previously?

We were previously on 11.4. Another difference is that we were using
inheritance based partitioning before, whereas now we are using
declarative partitioning.

> Maybe the lock is already taken before the DELETE is run; do you have
> any triggers, rules, constraints, or anything?

There are no triggers, rules or constraints on the table involved in
the DELETE (either the partition or the table that the partition is
part of).

Even more confusingly - when I reproduce the SQL statements that
should be triggering the deadlock, it does not happen: the DELETE does
not attempt to take an AccessShareLock on the parent table, so it does
not deadlock.

Is there any state associated with a transaction or a database
connection that would alter the lock(s) that gets taken out for a
DELETE on a partition? Or can other concurrent transactions in other
processes cause more locks to be needed somehow?

> If you have seen this several times already, maybe a way to investigate deeper is an
> exhaustive log capture of everything that these transactions do

So far it has happened at least twice. There were a couple of other
incidents that may well also have been caused by this, but not enough
data was collected at the time to be sure.

A bit more detail: the two processes that are deadlocked here are one
that is ingesting new data, while the other is removing old data by
dropping partitions. Even before we shifted to 14.1 and native
partitioning, we did get deadlocks between these two processes every
so often which we could not really prevent, so we adopted a retry
approach when it does occur. However we never had an undetected
deadlock in the database.

Since going to 14.1 & native partitioning, we are seeing a lot more
frequent deadlocks. It looks like the increase in frequency might be
related to the extra lock taken by the DELETE that I mentioned above.
However most of the time the deadlock is detected and released by
Postgres and the impact is minimal. Of course it is another story if
it is not detected!

I have enabled `log_statement=all`, but the undetected deadlock hasn't
happened again since. I can easily reproduce the deadlock itself, but
not the undetected case.

Thanks again.

Cheers
Mike

In response to

Re: Undetected Deadlock at 2022-01-25 23:11:39 from Alvaro Herrera

Responses

Re: Undetected Deadlock at 2022-01-27 01:14:26 from Michael Lewis
Re: Undetected Deadlock at 2022-02-01 06:50:06 from Michael Harris

Browse pgsql-general by date

	From	Date	Subject
Next Message	Michael Lewis	2022-01-27 01:14:26	Re: Undetected Deadlock
Previous Message	Merlin Moncure	2022-01-26 23:23:26	Re: Counting the number of repeated phrases in a column