Home > mailing lists

Re: [HACKERS] Hash Functions - Mailing list pgsql-hackers

From	Jeff Davis
Subject	Re: [HACKERS] Hash Functions
Date	May 14, 2017 05:01:17
Msg-id	CAMp0ube+TYwN4Uu1kF+cQWjcAPvozCr_=ChezFfZmL26DZKenQ@mail.gmail.com Whole thread Raw
In response to	Re: [HACKERS] Hash Functions (Robert Haas <robertmhaas@gmail.com>)
List	pgsql-hackers

Tree view

On Fri, May 12, 2017 at 12:38 PM, Robert Haas <robertmhaas@gmail.com> wrote:
> That is a good question.  I think it basically amounts to this
> question: is hash partitioning useful, and if so, for what?

Two words: parallel query. To get parallelism, one of the best
approaches is dividing the data, then doing as much work as possible
before combining it again. If you have hash partitions on some key,
then you can do partition-wise joins or partition-wise aggregation on
that key in parallel with no synchronization/communication overhead
(until the final result).

You've taken postgres pretty far in this direction already; hash
partitioning will take it one step further by allowing more pushdowns
and lower sync/communication costs.

Some of these things can be done with range partitioning, too, but see
my other message here:

https://www.postgresql.org/message-id/CAMp0ubfNMSGRvZh7N7TRzHHN5tz0ZeFP13Aq3sv6b0H37fdcPg%40mail.gmail.com

Regards,   Jeff Davis

pgsql-hackers by date:

From: Andres Freund
Date: 14 May 2017, 04:49:21
Subject: Re: [HACKERS] walsender & parallelism

From: Andres Freund
Date: 14 May 2017, 05:08:29
Subject: Re: [HACKERS] Hash Functions

Re: [HACKERS] Hash Functions - Mailing list pgsql-hackers

Previous

Next