Quick Links

Re: Partitioning options

From:	Greg Sabino Mullane <htamfids(at)gmail(dot)com>
To:	veem v <veema0000(at)gmail(dot)com>
Cc:	sud <suds1434(at)gmail(dot)com>, pgsql-general(at)lists(dot)postgresql(dot)org
Subject:	Re: Partitioning options
Date:	2024-02-08 20:50:34
Message-ID:	CAKAnmmLS6S0HvQdDh94u5hNXBVOFg8pqB3pY+1Z4SGg+MAFq1A@mail.gmail.com
Views:	Whole Thread \| Raw Message \| Download mbox \| Resend email
Thread:
Lists:	pgsql-general

>
> Out of curiosity, As OP mentioned that there will be Joins and also
> filters on column Customer_id column , so why don't you think that
> subpartition by customer_id will be a good option? I understand List
> subpartition may not be an option considering the new customer_ids gets
> added slowly in the future(and default list may not be allowed) and also OP
> mentioned, there is skewed distribution of data for customer_id column.
> However what is the problem if OP will opt for HASH subpartition on
> customer_id in this situation?
>

It doesn't really gain you much, given you would be hashing it, the
customers are unevenly distributed, and OP talked about filtering on the
customer_id column. A hash partition would just be a lot more work and
complexity for us humans and for Postgres. Partitioning for the sake of
partitioning is not a good thing. Yes, smaller tables are better, but they
have to be smaller targeted tables.

sud wrote:

130GB of storage space as we verified using the "pg_relation_size"
> function, for a sample data set.

You might also want to closely examine your schema. At that scale, every
byte saved per row can add up.

Cheers,
Greg

In response to

Re: Partitioning options at 2024-02-08 19:43:00 from veem v

Responses

Re: Partitioning options at 2024-02-08 21:25:11 from Justin

Browse pgsql-general by date

	From	Date	Subject
Next Message	Greg Sabino Mullane	2024-02-08 20:54:43	Re: Clarification regarding managing advisory locks in postgresql
Previous Message	Alpaslan AKDAĞ	2024-02-08 20:47:37	Re: archive command doesnt work