Home > mailing lists

Re: WIP: bloom filter in Hash Joins with batches - Mailing list pgsql-hackers

From	Shulgin, Oleksandr
Subject	Re: WIP: bloom filter in Hash Joins with batches
Date	December 17, 2015 09:51:22
Msg-id	CACACo5Q0z9vQE8pHc+b6rzHr_TOTNg4=nc4eiQodtD1poOBCCQ@mail.gmail.com Whole thread
In response to	WIP: bloom filter in Hash Joins with batches (Tomas Vondra <tomas.vondra@2ndquadrant.com>)
Responses	Re: WIP: bloom filter in Hash Joins with batches
List	pgsql-hackers

Tree view

On Tue, Dec 15, 2015 at 11:30 PM, Tomas Vondra <tomas.vondra@2ndquadrant.com> wrote:

Attached is a spreadsheet with results for various work_mem values, and also with a smaller data set (just 30M rows in the fact table), which easily fits into memory. Yet it shows similar gains, shaving off ~40% in the best case, suggesting that this is not just thanks to reduction of I/O when forcing the temp files to disk.

A neat idea! Have you possibly tried to also collect statistics about actual false-positive rates and filter allocation sizes in every of the collected data points?

Alex

pgsql-hackers by date:

From: "Shulgin, Oleksandr"
Date: 17 December 2015, 09:46:06
Subject: Re: On-demand running query plans using auto_explain and signals

From: Simon Riggs
Date: 17 December 2015, 10:45:03
Subject: Re: WIP: bloom filter in Hash Joins with batches

Re: WIP: bloom filter in Hash Joins with batches - Mailing list pgsql-hackers

Previous

Next