| From: | Jeff Frost <jeff(at)frostconsultingllc(dot)com> |
|---|---|
| To: | Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us> |
| Cc: | Simon Riggs <simon(at)2ndquadrant(dot)com>, pgsql-admin(at)postgresql(dot)org, pgsql-hackers(at)postgresql(dot)org |
| Subject: | Re: does wal archiving block the current client connection? |
| Date: | 2006-05-19 16:08:19 |
| Message-ID: | Pine.LNX.4.64.0605190901140.7012@discord.home.frostconsultingllc.com |
| Views: | Whole Thread | Raw Message | Download mbox | Resend email |
| Thread: | |
| Lists: | pgsql-admin pgsql-hackers |
On Fri, 19 May 2006, Tom Lane wrote:
> Well, there's our smoking gun. IIRC, all the failures you showed us are
> consistent with race conditions caused by multiple archiver processes
> all trying to do the same tasks concurrently.
>
> Do you frequently stop and restart the postmaster? Because I don't see
> how you could get into this state without having done so.
>
> I've just been looking at the code, and the archiver does commit
> hara-kiri when it notices its parent postmaster is dead; but it only
> checks that in the outer loop. Given sufficiently long delays in the
> archive_command, that could be a long time after the postmaster died;
> and in the meantime, successive executions of the archive_command could
> be conflicting with those launched by a later archiver incarnation.
Hurray! Unfortunately, the postmaster on the original troubled server almost
never gets restarted, and in fact only has only one archiver process running
right now. Drat!
I guess I'll have to try and catch it in the act again the next time the NAS
gets wedged so I can debug a little more (it was caught by one of the windows
folks last time) and gather some useful data.
Let me know if you want me to test a patch since I've already got this test
case setup.
--
Jeff Frost, Owner <jeff(at)frostconsultingllc(dot)com>
Frost Consulting, LLC http://www.frostconsultingllc.com/
Phone: 650-780-7908 FAX: 650-649-1954
| From | Date | Subject | |
|---|---|---|---|
| Next Message | Tom Lane | 2006-05-19 16:17:29 | Re: does wal archiving block the current client connection? |
| Previous Message | Tom Lane | 2006-05-19 16:03:17 | Re: [ADMIN] does wal archiving block the current client connection? |
| From | Date | Subject | |
|---|---|---|---|
| Next Message | Joshua D. Drake | 2006-05-19 16:11:57 | Re: [OT] MySQL is bad, but THIS bad? |
| Previous Message | Rafael Martinez | 2006-05-19 16:08:05 | Re: [GENERAL] Weird ..... (a=1 or a=2) <> (a=2 or a=1) |