Re: pg_dump -Fd must create directory

From: David Salisbury <salisbury(at)globe(dot)gov>
To: Postgres List <pgsql-general(at)postgresql(dot)org>
Subject: Re: pg_dump -Fd must create directory
Date: 2012-09-13 20:51:57
Message-ID: 5052476D.6050103@globe.gov
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general


It looks to me like you're misusing git..

You should only git init once, and always use that directory.
Then pg_dump, which should create one file per database
with the file name you've specified.
Not sure of the flags but I'd recommend plain text format.

I'm also unsure what you mean by network traffic, as you don't
mention a remote repository, but there nice visual tools
for you to see the changes to files between you're committed
objects. git init.. will more than likely lose all changes
to files.

-ds

On 9/12/12 5:12 AM, François Beausoleil wrote:
> Hi all!
>
> Why must pg_dump create a fresh new directory everytime? I'm running some tests where I dump a database to a directory, git init and git add --all, then dump again. When I did that after doing some modifications (specifically creating a new table and adding a few hundred thousand records), git status told me some files had been renamed and others were removed. I worked around this by dumping to a new directory, then moving .git manually.
>
> My real-world use case is I have a largish database (46 GiB gzip'd dump) that I dump every few hours, and some tables are essentially static. I was thinking of saving some network traffic by transferring only the delta.
>
> Any thoughts on this? Is this something that can or can be made to change? Where are the IDs used in the dump coming from? Can they be made stable?
>
> Thanks!
> François Beausoleil
>
> $ cat a.sh
> PGDATABASE=${USER}
> rm -rf thedump thedump2
> psql -c "select * into a from generate_series(1, 100000, 1) as t1(a)"
> pg_dump -Fd --file=thedump
> cd thedump&& git init&& git add --all .&& git commit -m "initial"&& cd ..
> psql -c "select * into b from generate_series(1, 100000, 1) as t1(b)"
> pg_dump -Fd --file=thedump2
> mv thedump/.git thedump2/
> cd thedump2
> git status --short
>
> $ sh a.sh
> SELECT 100000
> Initialized empty Git repository in /root/tmp/thedump/.git/
> [master (root-commit) 9f8dc9f] initial
> 2 files changed, 0 insertions(+), 0 deletions(-)
> create mode 100644 1882.dat.gz
> create mode 100644 toc.dat
> SELECT 100000
> D 1882.dat.gz
> M toc.dat
> ?? 1886.dat.gz
> ?? 1887.dat.gz
>

In response to

Responses

Browse pgsql-general by date

  From Date Subject
Next Message Sébastien Lorion 2012-09-13 21:08:39 Re: Amazon High I/O instances
Previous Message Igor Neyman 2012-09-13 20:24:17 Re: Best free tool for relationship extraction