pgsql: Rotate instead of shifting hash join batch number. - Mailing list pgsql-committers

From Thomas Munro
Subject pgsql: Rotate instead of shifting hash join batch number.
Date
Msg-id E1ijXxN-0006Vm-Bm@gemulon.postgresql.org
Whole thread Raw
List pgsql-committers
Rotate instead of shifting hash join batch number.

Our algorithm for choosing batch numbers turned out not to work
effectively for multi-billion key inner relations.  We would use
more hash bits than we have, and effectively concentrate all tuples
into a smaller number of batches than we intended.  While ideally
we should switch to wider hashes, for now, change the algorithm to
one that effectively gives up bits from the bucket number when we
don't have enough bits.  That means we'll finish up with longer
bucket chains than would be ideal, but that's better than having
batches that don't fit in work_mem and can't be divided.

Batch-patch to all supported releases.

Author: Thomas Munro
Reviewed-by: Tom Lane, thanks also to Tomas Vondra, Alvaro Herrera, Andres Freund for testing and discussion
Reported-by: James Coleman
Discussion: https://postgr.es/m/16104-dc11ed911f1ab9df%40postgresql.org

Branch
------
master

Details
-------
https://git.postgresql.org/pg/commitdiff/e69d644547785cc9f079650d29118a3688bc5039

Modified Files
--------------
src/backend/executor/nodeHash.c | 13 +++++++++----
src/include/port/pg_bitutils.h  |  9 +++++++++
2 files changed, 18 insertions(+), 4 deletions(-)


pgsql-committers by date:

Previous
From: Joe Conway
Date:
Subject: pgsql: Disallow null category in crosstab_hash
Next
From: Thomas Munro
Date:
Subject: pgsql: Rotate instead of shifting hash join batch number.