Re: A problem about partitionwise join - Mailing list pgsql-hackers

From Richard Guo
Subject Re: A problem about partitionwise join
Date
Msg-id CAN_9JTyU8XK+SamftPx6s+Rz+3K6mR5PJv8gCwpx7PUSg+LkPQ@mail.gmail.com
Whole thread Raw
In response to Re: A problem about partitionwise join  (Etsuro Fujita <etsuro.fujita@gmail.com>)
Responses Re: A problem about partitionwise join
List pgsql-hackers

On Fri, Aug 30, 2019 at 2:08 AM Etsuro Fujita <etsuro.fujita@gmail.com> wrote:
On Thu, Aug 29, 2019 at 6:45 PM Richard Guo <riguo@pivotal.io> wrote:
> On Wed, Aug 28, 2019 at 6:49 PM Etsuro Fujita <etsuro.fujita@gmail.com> wrote:
>> On Tue, Aug 27, 2019 at 4:57 PM Richard Guo <riguo@pivotal.io> wrote:
>> > Check the query below as a more illustrative example:
>> >
>> > create table p (k int, val int) partition by range(k);
>> > create table p_1 partition of p for values from (1) to (10);
>> > create table p_2 partition of p for values from (10) to (100);
>> >
>> > If we use quals 'foo.k = bar.k and foo.k = bar.val', we can generate
>> > partitionwise join:
>> >
>> > # explain (costs off)
>> > select * from p as foo join p as bar on foo.k = bar.k and foo.k = bar.val;
>> >                QUERY PLAN
>> > -----------------------------------------
>> >  Append
>> >    ->  Hash Join
>> >          Hash Cond: (foo.k = bar.k)
>> >          ->  Seq Scan on p_1 foo
>> >          ->  Hash
>> >                ->  Seq Scan on p_1 bar
>> >                      Filter: (k = val)
>> >    ->  Hash Join
>> >          Hash Cond: (foo_1.k = bar_1.k)
>> >          ->  Seq Scan on p_2 foo_1
>> >          ->  Hash
>> >                ->  Seq Scan on p_2 bar_1
>> >                      Filter: (k = val)
>> > (13 rows)
>> >
>> > But if we exchange the order of the two quals to 'foo.k = bar.val and
>> > foo.k = bar.k', then partitionwise join cannot be generated any more,
>> > because we only have joinclause 'foo.k = bar.val' as it first reached
>> > score of 3. We have missed the joinclause on the partition key although
>> > it does exist.
>> >
>> > # explain (costs off)
>> > select * from p as foo join p as bar on foo.k = bar.val and foo.k = bar.k;
>> >                QUERY PLAN
>> > -----------------------------------------
>> >  Hash Join
>> >    Hash Cond: (foo.k = bar.val)
>> >    ->  Append
>> >          ->  Seq Scan on p_1 foo
>> >          ->  Seq Scan on p_2 foo_1
>> >    ->  Hash
>> >          ->  Append
>> >                ->  Seq Scan on p_1 bar
>> >                      Filter: (val = k)
>> >                ->  Seq Scan on p_2 bar_1
>> >                      Filter: (val = k)
>> > (11 rows)
>>
>> I think it would be nice if we can address this issue.

> Attached is a patch as an attempt to address this issue. The idea is
> quite straightforward. When building partition info for joinrel, we
> generate any possible EC-derived joinclauses of form 'outer_em =
> inner_em', which will be used together with the original restrictlist to
> check if there exists an equi-join condition for each pair of partition
> keys.

Thank you for the patch!  Will review.  Could you add the patch to the
upcoming CF so that it doesn’t get lost?


Thanks
Richard 

pgsql-hackers by date:

Previous
From: keisuke kuroda
Date:
Subject: Re: Wrong value in metapage of GIN INDEX.
Next
From: Alexander Korotkov
Date:
Subject: Re: Yet another fast GiST build