Re: PITR - warm standby switchover question

From: Dan Hayes <dhayes501(at)gmail(dot)com>
To: Erik Jones <ejones(at)engineyard(dot)com>
Cc: pgsql-general(at)postgresql(dot)org
Subject: Re: PITR - warm standby switchover question
Date: 2009-04-15 00:23:47
Message-ID: 55bfc63d0904141723r56083225ve135f35250578663@mail.gmail.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

Excellent! Thanks. One other quick question... What would happen if I
didn't delete the recovery.conf file? Is that step just to prevent
accidentally restarting the server with it there?

On Tue, Apr 14, 2009 at 6:26 PM, Erik Jones <ejones(at)engineyard(dot)com> wrote:

>
> On Apr 14, 2009, at 3:47 PM, Dan Hayes wrote:
>
> I've followed the implementation instructions at 24.4.2:
>> http://www.postgresql.org/docs/current/static/warm-standby.html
>>
>> And I've used the archive/restore commands from the example in F23.2:
>> http://www.postgresql.org/docs/current/static/pgstandby.html
>>
>> This all works great. The primary backs up the WAL archives to a mutually
>> accessible folder and the backup system is continuously reading in the logs.
>> Monitoring the "standby.log" file shows that the system is working as it
>> should.
>>
>> My question is, if the main system goes down, what do I do to the backup
>> machine to make it come alive? Currently, when I attempt to login to the
>> database, I get the error: "psql: FATAL: the database is starting up" I'm
>> assuming this means it can't be accessed while its in continuous recovery
>> mode. I attempted to turn off postgresql on the main server and try
>> connecting again, same error (was obvious, but I do things one step at a
>> time...) So then I removed the "recovery.conf" file on the backup server
>> and attempted to restart postgresql. It stopped fine, but the restart
>> failed.
>>
>> The pg_log/postgresql-XXXX-XX-XX.log file says:
>> LOG: database system was interrupted while in recovery at log time
>> 2009-04-14 17:36:14 CDT
>> HINT: If this has occurred more than once some data might be corrupted
>> and you might need to choose an earlier recovery target.
>> LOG: could not open file "pg_xlog/0000000100000002000000DD" (log file 2,
>> segment 221): No such file or directory
>> LOG: invalid primary checkpoint record
>> LOG: could not open file "pg_xlog/0000000100000002000000DC" (log file 2,
>> segment 220): No such file or directory
>> LOG: invalid secondary checkpoint record
>> PANIC: could not locate a valid checkpoint record
>> LOG: startup process (PID 3756) was terminated by signal 6: Aborted
>> LOG: aborting startup due to startup process failure
>>
>> Any ideas what I'm doing wrong? How do I bring the standby server out of
>> standby do act as the primary?
>>
>
> You should use pg_standby's -t flag to specify a stop file. Then all you
> need to do is touch (create) that file and pg_standby will let the server
> come out of recovery mode into normal operation mode. Be sure to rm or mv
> the recovery.conf once that is complete.
>
> Erik Jones, Database Administrator
> Engine Yard
> Support, Scalability, Reliability
> 866.518.9273 x 260
> Location: US/Pacific
> IRC: mage2k
>
>
>
>
>
>

In response to

Responses

Browse pgsql-general by date

  From Date Subject
Next Message Koichi Suzuki 2009-04-15 00:49:11 Re: Minimizing Recovery Time (wal replication)
Previous Message Greg Smith 2009-04-15 00:12:06 Re: Disconnected editing - versioning of databases