Re: how to tell if a replication server has stopped replicating

From: "mark" <dvlhntr(at)gmail(dot)com>
To: "'Bill MacArthur'" <webmaster(at)dhs-club(dot)com>
Cc: <pgsql-admin(at)postgresql(dot)org>
Subject: Re: how to tell if a replication server has stopped replicating
Date: 2011-08-27 03:05:01
Message-ID: 005601cc6466$1d7e73d0$587b5b70$@com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-admin

> -----Original Message-----
> From: pgsql-admin-owner(at)postgresql(dot)org [mailto:pgsql-admin-
> owner(at)postgresql(dot)org] On Behalf Of Bill MacArthur
> Sent: Friday, August 26, 2011 10:21 AM
> To: pgsql-admin(at)postgresql(dot)org
> Subject: [ADMIN] how to tell if a replication server has stopped
> replicating
>
> Hello,
>
> We recently discovered, quite by accident, that our streaming
> replication server was no longer replicating. We noticed this in our
> master server log file:
> 2011-08-26 00:00:05 PDT postgres 192.168.17.4 [unknown]LOG:
> replication connection authorized: user=postgres host=192.168.17.4
> port=53542
> 2011-08-26 00:00:05 PDT postgres 192.168.17.4 [unknown]FATAL:
> requested WAL segment 00000001000001D10000006B has already been removed
>
> As it turned out this has been going on for at least a week as
> everyday's log files were crammed with these messages. Whatever caused
> the replication server to end up needing the WAL file is a mystery for
> another day. What I would like to do is setup a simple method of
> alerting us if replication stops. We could do a simple grep of log
> files on the replication side, but I am guessing that there is some SQL
> command that could be run against the postgres internals that would be
> cleaner. Is there such an animal?
>
> Thank you,
> Bill MacArthur
>

* http://archives.postgresql.org/pgsql-hackers/2010-11/msg00198.php

* http://archives.postgresql.org/pgsql-hackers/2010-11/msg00252.php

Those two posts should cover the basics. There are other ways some people use to do it, but this seems to be the generally accepted way.

I think 9.1 has some stuff in the works that should make it far easier to monitor.

-Mark

In response to

Browse pgsql-admin by date

  From Date Subject
Next Message Andrej 2011-08-28 04:43:22 Re: Using Postgresql as application server
Previous Message gais 2011-08-26 23:27:40 Re: Point-In-Time Recovery not working