Home > mailing lists

Re: Getting sorted data from foreign server for merge join - Mailing list pgsql-hackers

From	Robert Haas
Subject	Re: Getting sorted data from foreign server for merge join
Date	November 19, 2015 02:00:29
Msg-id	CA+TgmoYiJmqozhBRkhVTp4q-wrngW3geAVowyh7WaAAvQqW+eA@mail.gmail.com Whole thread Raw
In response to	Re: Getting sorted data from foreign server for merge join (Ashutosh Bapat <ashutosh.bapat@enterprisedb.com>)
Responses	Re: Getting sorted data from foreign server for merge join
List	pgsql-hackers

Tree view

On Tue, Nov 17, 2015 at 8:33 AM, Ashutosh Bapat
<ashutosh.bapat@enterprisedb.com> wrote:
>> Although I'm usually on the side of marking things as extern whenever
>> we find it convenient, I'm nervous about doing that to
>> make_canonical_pathkey(), because it has side effects.  Searching the
>> list of canonical pathkeys for the one we need is reasonable, but is
>> it really right to ever think that we might create a new one at this
>> stage?  Maybe it is, but if so I'd like to hear a good explanation as
>> to why.
>
> For a foreign table no pathkeys are created before creating paths for
> individual foreign table since there are no indexes. For a regular table,
> the pathkeys get created for useful indexes.

OK.

Could some portion of the second foreach loop inside
get_useful_pathkeys_for_relation be replaced by a call to
eclass_useful_for_merging?  The logic looks very similar.

More broadly, I'm wondering about the asymmetries between this code
and pathkeys_useful_for_merging().  The latter has a
right_merge_direction() test and a case for non-EC-derivable join
clauses that aren't present in your code.  I wonder if we could just
generate pathkeys speculatively and then test which ones survive
truncate_useless_pathkeys(), or something like that.  This isn't an
enormously complicated patch, but it duplicates more of the logic in
pathkeys.c than I'd like.

I'm inclined to think that we shouldn't consider pathkeys that might
be useful for merge-joining unless we're using remote estimates.  It
seems more speculative than pushing down a toplevel sort.

This patch seems rather light on test cases.

-- 
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

pgsql-hackers by date:

From: Greg Stark
Date: 19 November 2015, 01:23:40
Subject: Re: Using quicksort for every external sort run

From: Marko Tiikkaja
Date: 19 November 2015, 02:04:53
Subject: Re: COPY (INSERT/UPDATE/DELETE .. RETURNING ..)

Re: Getting sorted data from foreign server for merge join - Mailing list pgsql-hackers

Previous

Next