Home > mailing lists

Re: tweaking NTUP_PER_BUCKET - Mailing list pgsql-hackers

From	Tomas Vondra
Subject	Re: tweaking NTUP_PER_BUCKET
Date	July 8, 2014 21:17:09
Msg-id	53BC5FC2.6020806@fuzzy.cz Whole thread Raw
In response to	Re: tweaking NTUP_PER_BUCKET (Tomas Vondra <tv@fuzzy.cz>)
Responses	Re: tweaking NTUP_PER_BUCKET
List	pgsql-hackers

Tree view

Hi,

Thinking about this a bit more, do we really need to build the hash
table on the first pass? Why not to do this:

(1) batching   - read the tuples, stuff them into a simple list   - don't build the hash table yet

(2) building the hash table   - we have all the tuples in a simple list, batching is done   - we know exact row count,
cansize the table properly   - build the table
 

Also, maybe we could use a regular linear hash table [1], instead of
using the current implementation with NTUP_PER_BUCKET=1. (Although,
that'd be absolutely awful with duplicates.)

regards
Tomas

[1] http://en.wikipedia.org/wiki/Linear_probing

pgsql-hackers by date:

From: Tomas Vondra
Date: 08 July 2014, 21:04:38
Subject: Re: tweaking NTUP_PER_BUCKET

From: Tom Lane
Date: 08 July 2014, 21:28:18
Subject: Re: Allowing join removals for more join types

Re: tweaking NTUP_PER_BUCKET - Mailing list pgsql-hackers

Previous

Next