Re: When to use PARTITION BY HASH?

From: MichaelDBA <MichaelDBA(at)sqlexec(dot)com>
To: Oleksandr Shulgin <oleksandr(dot)shulgin(at)zalando(dot)de>
Cc: pgsql-general(at)lists(dot)postgresql(dot)org, pgsql-performance(at)lists(dot)postgresql(dot)org
Subject: Re: When to use PARTITION BY HASH?
Date: 2020-06-02 17:39:40
Message-ID: 2835080c-b6ac-d13a-c788-4fd03b2c7a23@sqlexec.com
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-general pgsql-performance

Hi,

I use it quite often, since I'm dealing with partitioning keys that have
high cardinality, ie, high number of different values.  If your
cardinality is very high, but your spacing between values is not
uniform, HASH will balance your partitioned tables naturally.  If your
spacing between values is consistent, perhaps RANGE partitioning would
be better.

Regards,
Michael Vitale

Oleksandr Shulgin wrote on 6/2/2020 1:17 PM:
> Hi!
>
> I was reading up on declarative partitioning[1] and I'm not sure what
> could be a possible application of Hash partitioning.
>
> Is anyone actually using it? What are typical use cases?  What
> benefits does such a partitioning scheme provide?
>
> On its face, it seems that it can only give you a number of tables
> which are smaller than the un-partitioned one, but I fail to see how
> it would provide any of the potential advantages listed in the
> documentation.
>
> With a reasonable hash function, the distribution of rows across
> partitions should be more or less equal, so I wouldn't expect any of
> the following to hold true:
> - "...most of the heavily accessed rows of the table are in a single
> partition or a small number of partitions."
> - "Bulk loads and deletes can be accomplished by adding or removing
> partitions...",
> etc.
>
> That *might* turn out to be the case with a small number of distinct
> values in the partitioning column(s), but then why rely on hash
> assignment instead of using PARTITION BY LIST in the first place?
>
> Regards,
> --
> Alex
>
> [1] https://www.postgresql.org/docs/12/ddl-partitioning.html
>

In response to

Browse pgsql-general by date

  From Date Subject
Next Message David G. Johnston 2020-06-02 17:43:02 Re: When to use PARTITION BY HASH?
Previous Message Justin Pryzby 2020-06-02 17:33:54 Re: When to use PARTITION BY HASH?

Browse pgsql-performance by date

  From Date Subject
Next Message David G. Johnston 2020-06-02 17:43:02 Re: When to use PARTITION BY HASH?
Previous Message Justin Pryzby 2020-06-02 17:33:54 Re: When to use PARTITION BY HASH?