Quick Links

Re: WIP patch for parallel pg_dump

From:	Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
To:	Josh Berkus <josh(at)agliodbs(dot)com>
Cc:	marcin mank <marcin(dot)mank(at)gmail(dot)com>, Greg Smith <greg(at)2ndquadrant(dot)com>, Joachim Wieland <joe(at)mcknight(dot)de>, Andrew Dunstan <andrew(at)dunslane(dot)net>, Robert Haas <robertmhaas(at)gmail(dot)com>, Heikki Linnakangas <heikki(dot)linnakangas(at)enterprisedb(dot)com>, pgsql-hackers(at)postgresql(dot)org
Subject:	Re: WIP patch for parallel pg_dump
Date:	2010-12-07 00:22:09
Message-ID:	4881.1291681329@sss.pgh.pa.us
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

Josh Berkus <josh(at)agliodbs(dot)com> writes:
>> However, if you were doing something like parallel pg_dump you could
>> just run the parent and child instances all against the slave, so the
>> pg_dump scenario doesn't seem to offer much of a supporting use-case for
>> worrying about this. When would you really need to be able to do it?

> If you had several standbys, you could distribute the work of the
> pg_dump among them. This would be a huge speedup for a large database,
> potentially, thanks to parallelization of I/O and network. Imagine
> doing a pg_dump of a 300GB database in 10min.

That does sound kind of attractive. But to do that I think we'd have to
go with the pass-the-snapshot-through-the-client approach. Shipping
internal snapshot files through the WAL stream doesn't seem attractive
to me.

While I see Robert's point about preferring not to expose the snapshot
contents to clients, I don't think it outweighs all other considerations
here; and every other one is pointing to doing it the other way.

regards, tom lane

In response to

Re: WIP patch for parallel pg_dump at 2010-12-06 22:46:31 from Josh Berkus

Responses

Re: WIP patch for parallel pg_dump at 2010-12-07 07:16:51 from Stefan Kaltenbrunner
Re: WIP patch for parallel pg_dump at 2010-12-26 03:13:23 from Gurjeet Singh

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Josh Berkus	2010-12-07 00:41:06	Re: [PATCH] Revert default wal_sync_method to fdatasync on Linux 2.6.33+
Previous Message	Greg Smith	2010-12-06 23:56:26	Re: We really ought to do something about O_DIRECT and data=journalled on ext4