Quick Links

Re: DB files, sizes and cleanup

From:	"Gauthier, Dave" <dave(dot)gauthier(at)intel(dot)com>
To:	Merlin Moncure <mmoncure(at)gmail(dot)com>
Cc:	Bill Moran <wmoran(at)potentialtech(dot)com>, "pgsql-general(at)postgresql(dot)org" <pgsql-general(at)postgresql(dot)org>
Subject:	Re: DB files, sizes and cleanup
Date:	2010-12-20 02:48:59
Message-ID:	482E80323A35A54498B8B70FF2B87980047E266D13@azsmsx504.amr.corp.intel.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-general

I recreated the 2 "bad" DBs (the ones with millions of files in them). So far, they're behaving.

Thanks for the help.

-----Original Message-----
From: Merlin Moncure [mailto:mmoncure(at)gmail(dot)com]
Sent: Saturday, December 18, 2010 1:29 PM
To: Gauthier, Dave
Cc: Bill Moran; pgsql-general(at)postgresql(dot)org
Subject: Re: [GENERAL] DB files, sizes and cleanup

On Fri, Dec 17, 2010 at 5:22 PM, Gauthier, Dave <dave(dot)gauthier(at)intel(dot)com> wrote:
> max_fsm_pages = 200000
> max_fsm_relations = 12000
>
> There are 12 DBs with roughly 30 tables+indexes each.
>
> There are apparently 2 "bad" DBs. Both identical in terms of data models (clones with different data). I've pg_dummped one of them to a file, dropped the DB (took a long time as millions of files were deleted) and recreated it. It now has 186 files.
>
> ls -1 | wc took a while for the other bad one but eventually came up with exactly 7,949,911 files, so yes, millions. The other one had millions too before I dropped it. Something is clearly wrong. But, since the DB recreate worked for the other one, I'll do the same thing to fix this one too.
>
> What I will need to know then is how to prevent this in the future. It's very odd because the worst of the 2 bad DBs was a sister DB to one that's no problem at all. Here's the picture...
>
> I have a DB, call it "foo", that gets loaded with a ton of data at night. The users query the thing readonly all day. At midnight, an empty DB called "foo_standby", which is identical to "foo" in terms of data model is reloaded from scratch. It takes hours. But when it's done, I do a few rename databases to swap "foo" with "foo_standby" (really just a name swap). "foo_standby" serves as a live backup of yesterday's data. Come the next midnight, I truncate all the tables and start the process all over again.

maybe something in this process is leaking files. if I was in your
shoes, I'd recreate the database from scratch, then watch the file
count carefully and look for unusual growth. this is probably not the
case, but if it is in fact a backend bug it will turn up again right
away.

anything else interesting jump out about these files? for example, are
there a lot of 0 byte files?

merlin

In response to

Re: DB files, sizes and cleanup at 2010-12-18 18:29:13 from Merlin Moncure

Browse pgsql-general by date

	From	Date	Subject
Next Message	tbazadaykin	2010-12-20 06:00:50	Re: Need More Case Studies on PostGIS
Previous Message	Wang, Mary Y	2010-12-20 01:05:57	Need More Case Studies on PostGIS