Home > mailing lists

Re: Hash Join cost estimates - Mailing list pgsql-hackers

From	Tom Lane
Subject	Re: Hash Join cost estimates
Date	April 4, 2013 18:19:40
Msg-id	5420.1365099576@sss.pgh.pa.us Whole thread Raw
In response to	Re: Hash Join cost estimates (Stephen Frost <sfrost@snowman.net>)
Responses	Re: Hash Join cost estimates Re: Hash Join cost estimates
List	pgsql-hackers

Tree view

Stephen Frost <sfrost@snowman.net> writes:
> I've been fiddling with this on the very much larger overall database
> where this test case came from and have found that hashing the large
> table can actually be *faster* and appears to cause a more consistent
> and constant amount of disk i/o (which is good).

Interesting.

> What I'm trying to get at in this overall email is: why in the world is
> it so expensive to do hash lookups?

perf or oprofile reveal anything?

Also, I assume that the cases you are looking at are large enough that
even the "small" table doesn't fit in a single hash batch?  It could
well be that the answer has to do with some bogus or at least
unintuitive behavior of the batching process, and it isn't really at all
a matter of individual hash lookups being slow.

(You never did mention what work_mem setting you're testing, anyway.)
        regards, tom lane

pgsql-hackers by date:

From: Vibhor Kumar
Date: 04 April 2013, 18:03:22
Subject: Re: pg_dump selectively ignores extension configuration tables

From: Stephen Frost
Date: 04 April 2013, 18:36:26
Subject: Re: Hash Join cost estimates

Re: Hash Join cost estimates - Mailing list pgsql-hackers

Previous

Next