Re: Retrieving unused tuple attributes in ExecScan - Mailing list pgsql-hackers

From Finnerty, Jim
Subject Re: Retrieving unused tuple attributes in ExecScan
Date
Msg-id 19452DD3-4DF1-4B85-8894-2A8131F04A4C@amazon.com
Whole thread Raw
In response to Re: Retrieving unused tuple attributes in ExecScan  (Andres Freund <andres@anarazel.de>)
List pgsql-hackers
Re: So I'm actually using the columns during merge join, basically I'm building a bloom filter on the outer relation
andfiltering out data on the inner relation of the join. I'm building the filter on the join keys
 

We had a whole implementation for Bloom filtering for hash inner join, complete with costing and pushdown of the Bloom
filterfrom the build side to the execution tree on the probe side (i.e. building a Bloom filter on the inner side of
thejoin at the conclusion of the build phase of the hash join, then pushing it down as a semi-join filter to the probe
sideof the join, where it could potentially be applied to multiple scans).  After a large change to that same area of
thecode by the community it got commented out and has been in that state ever since.  It's a good example of the sort
ofchange that really ought to be made with the community because there's too much merge burden otherwise.
 

It was a pretty effective optimization in some cases, though.  Most commercial systems have an optimization like this,
sometimeswith special optimizations when the number of distinct join keys is very small. If there is interest in
revivingthis functionality, we could probably extract some patches and work with the community to try to get it running
again. 
 

   /Jim



pgsql-hackers by date:

Previous
From: Andres Freund
Date:
Subject: Re: Retrieving unused tuple attributes in ExecScan
Next
From: Peter Geoghegan
Date:
Subject: Re: do only critical work during single-user vacuum?