Home > mailing lists

Re: Declarative partitioning grammar - Mailing list pgsql-hackers

From	Markus Schiltknecht
Subject	Re: Declarative partitioning grammar
Date	January 15, 2008 04:43:05
Msg-id	478C7210.8020007@bluegap.ch Whole thread Raw
In response to	Re: Declarative partitioning grammar (Jeff Cohen <jcohen@greenplum.com>)
Responses	Re: Declarative partitioning grammar
List	pgsql-hackers

Tree view

Hi Jeff,

Jeff Cohen wrote:
> If you don't define a "default" partition to handle outliers,  the 
> insert should fail with an error.

IMO, you should always have a "default" partition, then, so as not to 
violate the constraints (by rejecting tuples which are correct according 
to the constraints).

> With the generic approach, you start with a single table, and start by 
> splitting it into two six-month partitions:
> 
> ALTER TABLE sales
>   SPLIT where sales_date > date '2007-06-01'
>    INTO
>     (
>      PARTITION first_half
>      PARTITION second_half
>      );
> 
> We could implement this approach using check constraints and table 
> inheritance: the partition second_half is a child table where sales_date 
>  > date '2007-06-01', and the partition first_half has the complementary 
> constraint NOT(sales_date > date '2007-06-01').
> 
> Next, you split each partition:
> 
> ALTER TABLE sales
>   SPLIT PARTITION first_half where sales_date > date '2007-03-01'
>    INTO
>     (
>      PARTITION first_quarter
>      PARTITION second_quarter
>      );
> 
> So now the child table for first_half itself has two children.  As you 
> continue this process you construct a binary tree of table inheritance 
> using 12 ALTER statements.

<nitpicking>There are just 11 splits between 12 months, otherwise 
correct, yes.</nitpicking>

> In the "long" grammar you can create and partition the table in one 
> statement:
> 
> CREATE TABLE sales
> ...
> PARTITION BY sales_date
> (
> start (date '2007-01-01') end (date '2008-01-01')
> every (interval '1 month')
> );

To be fair, you should add the 12 partition names here as well.

I can certainly see merit in letting the database system handle the 
binary tree.

> Thanks for your feedback.  Partitioning the table using series of splits 
> is a clever solution for situations where the partitioning operation 
> cannot be described using simple equality (like list,hash) or ordered 
> comparison (range).  But for many common business cases, the "long" 
> grammar is easier to specify.

Easier to specify initially, maybe, yes. But how about managing it 
afterwards? Having seen all the different options for merging, 
splitting, exchanging, coalescing and adding, all of them with small 
little differences for hash, range and list partitioning - let alone 
sub-partitioning - with all of that, the proposed grammar doesn't look 
particularly easy to me.

Let's at least drop the differences for list, hash and range 
partitioning, those are pretty unneeded, IMO.

Regards

Markus

pgsql-hackers by date:

From: Tom Lane
Date: 15 January 2008, 00:50:30
Subject: Re: Array behavior oddities

From: Peter Eisentraut
Date: 15 January 2008, 05:10:43
Subject: Re: SSL over Unix-domain sockets

Re: Declarative partitioning grammar - Mailing list pgsql-hackers

Previous

Next