Quick Links

Re: Issues with Quorum Commit

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	Greg Smith <greg(at)2ndquadrant(dot)com>
Cc:	Markus Wanner <markus(at)bluegap(dot)ch>, Dimitri Fontaine <dimitri(at)2ndQuadrant(dot)fr>, Heikki Linnakangas <heikki(dot)linnakangas(at)enterprisedb(dot)com>, Simon Riggs <simon(at)2ndquadrant(dot)com>, Jeff Davis <pgsql(at)j-davis(dot)com>, Josh Berkus <josh(at)agliodbs(dot)com>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: Issues with Quorum Commit
Date:	2010-10-08 14:11:58
Message-ID:	21937.1286547118@sss.pgh.pa.us
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

Greg Smith <greg(at)2ndquadrant(dot)com> writes:
> I don't see this as needing any implementation any more complicated than
> the usual way such timeouts are handled. Note how long you've been
> trying to reach the standby. Default to -1 for forever. And if you hit
> the timeout, mark the standby as degraded and force them to do a proper
> resync when they disconnect. Once that's done, then they can re-enter
> sync rep mode again, via the same process a new node would have done so.

Well, actually, that's *considerably* more complicated than just a
timeout. How are you going to "mark the standby as degraded"? The
standby can't keep that information, because it's not even connected
when the master makes the decision. ISTM that this requires

1. a unique identifier for each standby (not just role names that
multiple standbys might share);

2. state on the master associated with each possible standby -- not just
the ones currently connected.

Both of those are perhaps possible, but the sense I have of the
discussion is that people want to avoid them.

Actually, #2 seems rather difficult even if you want it. Presumably
you'd like to keep that state in reliable storage, so it survives master
crashes. But how you gonna commit a change to that state, if you just
lost every standby (suppose master's ethernet cable got unplugged)?
Looks to me like it has to be reliable non-replicated storage. Leaving
aside the question of how reliable it can really be if not replicated,
it's still the case that we have noplace to put such information given
the WAL-is-across-the-whole-cluster design.

regards, tom lane

In response to

Re: Issues with Quorum Commit at 2010-10-07 23:44:27 from Greg Smith

Responses

Re: Issues with Quorum Commit at 2010-10-08 14:26:47 from Markus Wanner
Re: Issues with Quorum Commit at 2010-10-08 14:35:17 from Dimitri Fontaine
Re: Issues with Quorum Commit at 2010-10-08 14:47:11 from Simon Riggs
Re: Issues with Quorum Commit at 2010-10-08 20:34:47 from Greg Smith
Re: Issues with Quorum Commit at 2010-10-21 00:49:06 from Bruce Momjian

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Pavel Stehule	2010-10-08 14:14:21	Re: proposal: plpgsql, solution for derivated types of parameters
Previous Message	Tom Lane	2010-10-08 13:55:27	Re: proposal: plpgsql, solution for derivated types of parameters