Re: Conflict detection for update_deleted in logical replication

From: Masahiko Sawada <sawada(dot)mshk(at)gmail(dot)com>
To: Amit Kapila <amit(dot)kapila16(at)gmail(dot)com>
Cc: "Zhijie Hou (Fujitsu)" <houzj(dot)fnst(at)fujitsu(dot)com>, Nisha Moond <nisha(dot)moond412(at)gmail(dot)com>, "Hayato Kuroda (Fujitsu)" <kuroda(dot)hayato(at)fujitsu(dot)com>, shveta malik <shveta(dot)malik(at)gmail(dot)com>, pgsql-hackers <pgsql-hackers(at)postgresql(dot)org>
Subject: Re: Conflict detection for update_deleted in logical replication
Date: 2025-01-31 21:23:30
Message-ID: CAD21AoAxRNwU5ZCa6bPvDjQxKOA+kUeMqWPBZ1kV0OsK3AfBWg@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

On Thu, Jan 30, 2025 at 10:39 PM Amit Kapila <amit(dot)kapila16(at)gmail(dot)com> wrote:
>
> On Fri, Jan 31, 2025 at 4:10 AM Masahiko Sawada <sawada(dot)mshk(at)gmail(dot)com> wrote:
> >
> > I have one question about the 0004 patch; it implemented
> > max_conflict_retntion_duration as a subscription parameter. But the
> > launcher invalidates the pg_conflict_detection slot only if all
> > subscriptions with retain_conflict_info stopped retaining dead tuples
> > due to the max_conflict_retention_duration parameter. Therefore, even
> > if users set the parameter to a low value to avoid table bloats, it
> > would not make sense if other subscriptions set it to a larger value.
> > Is my understanding correct?
> >
>
> Yes, your understanding is correct. I think this could be helpful
> during resolution because the worker for which the duration has
> exceeded cannot detect conflicts reliably but others can. So, this
> info can be useful while performing resolutions. Do you have an
> opinion/suggestion on this matter?

I imagined a scenario like where two apply workers are running and
have different max_conflict_retention_duration values (say '5 min' and
'15 min'). Suppose both workers are roughly the same behind the
publisher(s), when both workers cannot advance the workers' xmin
values for 5 min or longer, one worker stops retaining dead tuples.
However, the pg_conflict_detection slot is not invalidated yet since
another worker is still using it, so both workers would continue to be
getting slower. The subscriber would end up retaining dead tuples
until both workers are behind for 15 min or longer, before
invalidating the slot. In this case, stopping dead tuple retention on
the first worker would help neither advance the slot's xmin nor
improve another worker's performance. I was not sure of the point of
making the max_conflict_retention_duration a per-subscription
parameter.

Regards,

--
Masahiko Sawada
Amazon Web Services: https://aws.amazon.com

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Melanie Plageman 2025-01-31 21:54:10 Re: pgbench with partitioned tables
Previous Message Antonin Houska 2025-01-31 20:02:47 Re: why there is not VACUUM FULL CONCURRENTLY?