Re: Missing files under pg_data following power failure

From: Brad Nicholson <bnichols(at)ca(dot)afilias(dot)info>
To: Scott Marlowe <scott(dot)marlowe(at)gmail(dot)com>
Cc: Jack Orenstein <jack(dot)orenstein(at)hds(dot)com>, pgsql-general(at)postgresql(dot)org
Subject: Re: Missing files under pg_data following power failure
Date: 2007-10-12 16:08:40
Message-ID: 1192205320.6131.2.camel@bnicholson-desktop
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general

On Fri, 2007-10-12 at 10:54 -0500, Scott Marlowe wrote:
> On 10/12/07, Jack Orenstein <jack(dot)orenstein(at)hds(dot)com> wrote:
> > Our testing involves cutting power to machines running postgres 7.4,
> > while under load (lots of reads and writes). When we do this, we often
> > lose some files under pg_data storing table content. I.e., the file
> > named for a table's pg_class.oid value is simply gone. This can affect
> > many tables following a power outage. We know this problem has
> > occurred when we are unable to access a table, e.g.
> >
> > ERROR: relation "xxx" does not exist
> >
> > The table is present in the system tables, but the file storing table
> > content under pg_data is missing.
> >
> > Can anyone provide insight on this phenomenon? Why are these files
> > lost? Are they really lost, or have they simply moved somewhere? What
> > happens to the disk blocks formerly occupied by the files?
> >
> > Getting back in service following this file loss is not a problem; I'm
> > just trying to understand how postgres gets into this state.
>
> First of all, this should not happen on a machine with proper
> fsyncing. The possible causes are generally either fsync is off in
> postgresql.conf or the drive array <--> OS layer is lying about fsync
> operations.

What filesystem are you using? I've seen similar problems on JFS2, it
was something with logredo or fsck being busted on a Big Blue OS.

--
Brad Nicholson 416-673-4106
Database Administrator, Afilias Canada Corp.

In response to

Browse pgsql-general by date

  From Date Subject
Next Message Richard Huxton 2007-10-12 16:16:33 Re: Out of memory with ODBC
Previous Message Merlin Moncure 2007-10-12 16:00:46 Re: Coercing compound types to use generic ROW comparison operators