Home > mailing lists

Re: Merge Join with an Index Optimization - Mailing list pgsql-hackers

From	Michael Malis
Subject	Re: Merge Join with an Index Optimization
Date	September 11, 2016 21:08:04
Msg-id	CAFQtOwraN_11L1XuaRrsJJhE-XkReX=WH+GHNyA5uoJaoe9Kvg@mail.gmail.com Whole thread
In response to	Re: Merge Join with an Index Optimization (Tom Lane <tgl@sss.pgh.pa.us>)
Responses	Re: Merge Join with an Index Optimization
List	pgsql-hackers

Tree view

On Sun, Sep 11, 2016 at 9:20 AM Tom Lane <tgl@sss.pgh.pa.us> wrote:

Michael Malis <michaelmalis2@gmail.com> writes:
>> As I understand it, a merge join will currently read all tuples from both
>> subqueries (besides early termination). I believe it should be possible to
>> take advantages of the indexes on one or both of the tables being read from
>> to skip a large number of tuples that would currently be read.
>>
> IIUC, what you're proposing is to replace "read past some tuples in the
> index" with "re-descend the index btree". Yes, that would win if it

Roughly yes, although is it necessary to rescan the btree from the top? Is it not possible to "resume" the scan from the previous location?

> You might want to troll the archives for past discussions of index skip
> scan, which is more or less the same idea (could possibly be exactly the
> same idea, depending on how it's implemented).

After searching a bit, all I was able to find was this. It looks like someone had a rough patch of applying a similar idea to DISTINCT. I can't tell what happened to it, but in the patch they mention that it should be possible to do a "local traversal ie from the current page".

A different, but similar optimization, would be to first check the join condition with the data in the index. Then only fetch the full row only if the data in the index passes the join condition. I imagine that this would be easier to implement, although less efficent than what I proposed above because it has to scan the entire index.

Thanks,

Michael

pgsql-hackers by date:

From: Kevin Grittner
Date: 11 September 2016, 20:52:37
Subject: Re: [REVIEW] Tab Completion for CREATE DATABASE ... TEMPLATE ...

From: Peter Geoghegan
Date: 11 September 2016, 22:14:07
Subject: Re: Tuplesort merge pre-reading

Re: Merge Join with an Index Optimization - Mailing list pgsql-hackers

Previous

Next