From: | Robert Haas <robertmhaas(at)gmail(dot)com> |
---|---|
To: | Jeevan Ladhe <jeevan(dot)ladhe(at)enterprisedb(dot)com> |
Cc: | Ashutosh Bapat <ashutosh(dot)bapat(at)enterprisedb(dot)com>, Beena Emerson <memissemerson(at)gmail(dot)com>, Amit Langote <Langote_Amit_f8(at)lab(dot)ntt(dot)co(dot)jp>, Rajkumar Raghuwanshi <rajkumar(dot)raghuwanshi(at)enterprisedb(dot)com>, Rahila Syed <rahilasyed90(at)gmail(dot)com>, PostgreSQL-development <pgsql-hackers(at)postgresql(dot)org> |
Subject: | Re: Adding support for Default partition in partitioning |
Date: | 2017-06-14 19:51:44 |
Message-ID: | CA+TgmoZ+54-z+VJrxeuMmSV8aDWmuEQcsE9iFfhh=ZybomcZnw@mail.gmail.com |
Views: | Raw Message | Whole Thread | Download mbox | Resend email |
Thread: | |
Lists: | pgsql-hackers |
On Wed, Jun 14, 2017 at 8:02 AM, Jeevan Ladhe
<jeevan(dot)ladhe(at)enterprisedb(dot)com> wrote:
> Here are the details of the patches in attached zip.
> 0001. refactoring existing ATExecAttachPartition code so that it can be
> used for
> default partitioning as well
> 0002. support for default partition with the restriction of preventing
> addition
> of any new partition after default partition.
> 0003. extend default partitioning support to allow addition of new
> partitions.
> 0004. extend default partitioning validation code to reuse the refactored
> code
> in patch 0001.
I think the core ideas of this patch are pretty solid now. It's come
a long way in the last month. High-level comments:
- Needs to be rebased over b08df9cab777427fdafe633ca7b8abf29817aa55.
- Still no documentation.
- Should probably be merged with the patch to add default partitioning
for ranges.
Other stuff I noticed:
- The regression tests don't seem to check that the scan-skipping
logic works as expected. We have regression tests for that case for
attaching regular partitions, and it seems like it would be worth
testing the default-partition case as well.
- check_default_allows_bound() assumes that if
canSkipPartConstraintValidation() fails for the default partition, it
will also fail for every subpartition of the default partition. That
is, once we commit to scanning one child partition, we're committed to
scanning them all. In practice, that's probably not a huge
limitation, but if it's not too much code, we could keep the top-level
check but also check each partitioning individually as we reach it,
and skip the scan for any individual partitions for which the
constraint can be proven. For example, suppose the top-level table is
list-partitioned with a partition for each of the most common values,
and then we range-partition the default partition.
- The changes to the regression test results in 0004 make the error
messages slightly worse. The old message names the default partition,
whereas the new one does not. Maybe that's worth avoiding.
Specific comments:
+ * Also, invalidate the parent's and a sibling default partition's relcache,
+ * so that the next rebuild will load the new partition's info into parent's
+ * partition descriptor and default partition constraints(which are dependent
+ * on other partition bounds) are built anew.
I find this a bit unclear, and it also repeats the comment further
down. Maybe something like: Also, invalidate the parent's relcache
entry, so that the next rebuild will load he new partition's info into
its partition descriptor. If there is a default partition, we must
invalidate its relcache entry as well.
+ /*
+ * The default partition constraints depend upon the partition bounds of
+ * other partitions. Adding a new(or even removing existing) partition
+ * would invalidate the default partition constraints. Invalidate the
+ * default partition's relcache so that the constraints are built anew and
+ * any plans dependent on those constraints are invalidated as well.
+ */
Here, I'd write: The partition constraint for the default partition
depends on the partition bounds of every other partition, so we must
invalidate the relcache entry for that partition every time a
partition is added or removed.
+ /*
+ * Default partition cannot be added if there already
+ * exists one.
+ */
+ if (spec->is_default)
+ {
+ overlap = partition_bound_has_default(boundinfo);
+ with = boundinfo->default_index;
+ break;
+ }
To support default partitioning for range, this is going to have to be
moved above the switch rather than done inside of it. And there's
really no downside to putting it there.
+ * constraint, by *proving* that the existing constraints of the table
+ * *imply* the given constraints. We include the table's check constraints and
Both the comma and the asterisks are unnecessary.
+ * Check whether all rows in the given table (scanRel) obey given partition
obey the given
I think the larger comment block could be tightened up a bit, like
this: Check whether all rows in the given table obey the given
partition constraint; if so, it can be attached as a partition. We do
this by scanning the table (or all of its leaf partitions) row by row,
except when the existing constraints are sufficient to prove that the
new partitioning constraint must already hold.
+ /* Check if we can do away with having to scan the table being attached. */
If possible, skip the validation scan.
+ * Set up to have the table be scanned to validate the partition
+ * constraint If it's a partitioned table, we instead schedule its leaf
+ * partitions to be scanned.
I suggest: Prepare to scan the default partition (or, if it is itself
partitioned, all of its leaf partitions).
+ int default_index; /* Index of the default partition if any; -1
+ * if there isn't one */
"if any" is a bit redundant with "if there isn't one"; note the
phrasing of the preceding entry.
+ /*
+ * Skip if it's a partitioned table. Only RELKIND_RELATION relations
+ * (ie, leaf partitions) need to be scanned.
+ */
+ if (part_rel->rd_rel->relkind == RELKIND_PARTITIONED_TABLE ||
+ part_rel->rd_rel->relkind == RELKIND_FOREIGN_TABLE)
The comment talks about what must be included in our list of things to
scan, but the code tests for the things that can be excluded. I
suspect the comment has the right idea and the code should be adjusted
to match, but anyway they should be consistent. Also, the correct way
to punctuate i.e. is like this: (i.e. leaf partitions) You should have
a period after each letter, but no following comma.
+ * The default partition must be already having an AccessExclusiveLock.
I think we should instead change DefineRelation to open (rather than
just lock) the default partition and pass the Relation as an argument
here so that we need not reopen it.
+ /* Construct const from datum */
+ val = makeConst(key->parttypid[0],
+ key->parttypmod[0],
+ key->parttypcoll[0],
+ key->parttyplen[0],
+ *boundinfo->datums[i],
+ false, /* isnull */
+ key->parttypbyval[0] /* byval */ );
The /* byval */ comment looks a bit redundant, but I think this could
use a comment along the lines of: /* Only single-column list
partitioning is supported, so we only need to worry about the
partition key with index 0. */ And I'd also add an Assert() verifying
the the partition key has exactly 1 column, so that this breaks a bit
more obviously if someone removes that restriction in the future.
+ * Handle NULL partition key here if there's a null-accepting list
+ * partition, else later it will be routed to the default partition if
+ * one exists.
This isn't a great update of the existing comment -- it's drifted from
explaining the code to which it is immediately attached to a more
general discussion of NULL handling. I'd just say something like: If
this is a NULL, send it to the null-accepting partition. Otherwise,
route by searching the array of partition bounds.
+ if (tab->is_default_partition)
+ ereport(ERROR,
+ (errcode(ERRCODE_INVALID_OBJECT_DEFINITION),
+ errmsg("updated partition constraint for
default partition would be violated by some row")));
+ else
+ ereport(ERROR,
+ (errcode(ERRCODE_CHECK_VIOLATION),
While there's room for debate about the correct error code here, it's
hard for me to believe that it's correct to use one error code for the
is_default_partition case and a different error code the rest of the
time.
+ * previously cached default partition constraints; those constraints
+ * won't stand correct after addition(or even removal) of a partition.
won't be correct after addition or removal
+ * allow any row that qualifies for this new partition. So, check if
+ * the existing data in the default partition satisfies this *would be*
+ * default partition constraint.
check that the existing data in the default partition satisfies the
constraint as it will exist after adding this partition
+ * Need to take a lock on the default partition, refer comment for locking
+ * the default partition in DefineRelation().
I'd say: We must also lock the default partition, for the same reasons
explained in DefineRelation().
And similarly in the other places that refer to that same comment.
+ /*
+ * In case of the default partition, the constraint is of the form
+ * "!(result)" i.e. one of the following two forms:
+ * 1. NOT ((keycol IS NULL) OR (keycol = ANY (arr)))
+ * 2. NOT ((keycol IS NOT NULL) AND (keycol = ANY (arr))), where arr is an
+ * array of datums in boundinfo->datums.
+ */
Does this survive pgindent? You might need to surround the comment
with dashes to preserve formatting.
I think it would be worth adding a little more text this comment,
something like this: Note that, in general, applying NOT to a
constraint expression doesn't necessarily invert the set of rows it
accepts, because NOT NULL is NULL. However, the partition constraints
we construct here never evaluate to NULL, so applying NOT works as
intended.
+ * Check whether default partition has a row that would fit the partition
+ * being attached by negating the partition constraint derived from the
+ * bounds. Since default partition is already part of the partitioned
+ * table, we don't need to validate the constraints on the partitioned
+ * table.
Here again, I'd add to the end of the first sentence a parenthetical
note, like this: ...from the bounds (the partition constraint never
evaluates to NULL, so negating it like this is safe).
I don't understand the second sentence. It seems to contradict the first one.
+extern List *get_default_part_validation_constraint(List *new_part_constaints);
#endif /* PARTITION_H */
There should be a blank line after the last prototype and before the #endif.
+-- default partition table when it is being used in cahced plan.
Typo.
--
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company
From | Date | Subject | |
---|---|---|---|
Next Message | Robert Haas | 2017-06-14 20:03:23 | Re: intermittent failures in Cygwin from select_parallel tests |
Previous Message | Tom Lane | 2017-06-14 19:33:48 | Re: intermittent failures in Cygwin from select_parallel tests |