Quick Links

Re: Hash partitioning.

From:	Jeff Janes <jeff(dot)janes(at)gmail(dot)com>
To:	"ktm(at)rice(dot)edu" <ktm(at)rice(dot)edu>
Cc:	Markus Wanner <markus(at)bluegap(dot)ch>, Kevin Grittner <kgrittn(at)ymail(dot)com>, Claudio Freire <klaussfreire(at)gmail(dot)com>, Robert Haas <robertmhaas(at)gmail(dot)com>, Bruce Momjian <bruce(at)momjian(dot)us>, Yuri Levinsky <yuril(at)celltick(dot)com>, PostgreSQL-Dev <pgsql-hackers(at)postgresql(dot)org>
Subject:	Re: Hash partitioning.
Date:	2013-06-26 19:32:24
Message-ID:	CAMkU=1xgu2T7VK-Poows+9vMr+nTxnwFKBYQ2LtgWCv=rm6KPw@mail.gmail.com
Views:	Raw Message \| Whole Thread \| Download mbox \| Resend email
Thread:
Lists:	pgsql-hackers

On Wed, Jun 26, 2013 at 7:01 AM, ktm(at)rice(dot)edu <ktm(at)rice(dot)edu> wrote:

> On Wed, Jun 26, 2013 at 03:47:43PM +0200, Markus Wanner wrote:
> > On 06/25/2013 11:52 PM, Kevin Grittner wrote:
> > > At least until we have parallel
> > > query execution. At *that* point this all changes.
> >
> > Can you elaborate on that, please? I currently have a hard time
> > imagining how partitions can help performance in that case, either. At
> > least compared to modern RAID and read-ahead capabilities.
> >
> > After all, RAID can be thought of as hash partitioning with a very weird
> > hash function. Or maybe rather range partitioning on an internal key.
> >
> > Put another way: ideally, the system should take care of optimally
> > distributing data across its physical storage itself. If you need to do
> > partitioning manually for performance reasons, that's actually a
> > deficiency of it, not a feature.
>

+1, except I'm looking at it from a CPU perspective not a disk perspective.

I would hope not to need to partition my data at all in order to enable
parallel execution. I certainly would hope not to redo that partitioning
just because I got new hardware with a different number of CPUs.

> Hi Markus,
>
> I think he is referring to the fact that with parallel query execution,
> multiple partitions can be processed simultaneously instead of serially
> as they are now with the resulting speed increase.
>

Hopefully parallel execution can divide the query into multiple "chunks" on
its own, without me needing to micromanage it.

Cheers,

Jeff

In response to

Re: Hash partitioning. at 2013-06-26 14:01:13 from ktm@rice.edu

Browse pgsql-hackers by date

	From	Date	Subject
Next Message	Szymon Guz	2013-06-26 19:52:15	Re: Add more regression tests for CREATE OPERATOR
Previous Message	Josh Berkus	2013-06-26 19:19:01	Re: Add more regression tests for dbcommands