Home > mailing lists

Re: Hash Join cost estimates - Mailing list pgsql-hackers

From	Stephen Frost
Subject	Re: Hash Join cost estimates
Date	April 4, 2013 18:36:26
Msg-id	20130404183622.GJ4361@tamriel.snowman.net Whole thread Raw
In response to	Re: Hash Join cost estimates (Tom Lane <tgl@sss.pgh.pa.us>)
Responses	Re: Hash Join cost estimates
List	pgsql-hackers

Tree view

* Tom Lane (tgl@sss.pgh.pa.us) wrote:
> > What I'm trying to get at in this overall email is: why in the world is
> > it so expensive to do hash lookups?
>
> perf or oprofile reveal anything?

Working on a test case actually- I've got one now:
http://snowman.net/~sfrost/test_case2.sql

In this example, hashing the large table is actually 2 seconds *faster*
than hashing the small table (again, all on my laptop).

> Also, I assume that the cases you are looking at are large enough that
> even the "small" table doesn't fit in a single hash batch?

No, quite the opposite, sorry for not mentioning that before.  Either
side fits completely into memory w/ a single batch.  The explain
analyze's that I posted before show that, either way, there's only one
batch involved.

> (You never did mention what work_mem setting you're testing, anyway.)

With the test case above (where I got a 2s faster run time by hashing
the big table) used a work_mem of 1GB.
Thanks!
    Stephen

pgsql-hackers by date:

From: Tom Lane
Date: 04 April 2013, 18:19:40
Subject: Re: Hash Join cost estimates

From: Dimitri Fontaine
Date: 04 April 2013, 18:53:47
Subject: Re: Multi-pass planner

Re: Hash Join cost estimates - Mailing list pgsql-hackers

Previous

Next