Home > mailing lists

Re: 7.4 vs 7.3 ( hash join issue ) - Mailing list pgsql-performance

From	Greg Stark
Subject	Re: 7.4 vs 7.3 ( hash join issue )
Date	September 22, 2004 18:38:20
Msg-id	87sm9acih3.fsf@stark.xeocode.com Whole thread Raw
In response to	Re: 7.4 vs 7.3 ( hash join issue ) (Dennis Bjorklund <db@zigo.dhs.org>)
Responses	Re: 7.4 vs 7.3 ( hash join issue )
List	pgsql-performance

Tree view

Dennis Bjorklund <db@zigo.dhs.org> writes:

> On 22 Sep 2004, Greg Stark wrote:
>
> > Actually this looks like it's arguably a bug to me. Why does the hash
> > join execute the sequential scan at all? Shouldn't it also like the
> > merge join recognize that the other hashed relation is empty and skip
> > the sequential scan entirely?
>
> I'm not sure you can classify that as a bug. It's just that he in one of
> the plans started with the empty scan and bacause of that didn't need
> the other, but with the hash join it started with the table that had 16
> rows and then got to the empty one.

No, postgres didn't do things in reverse order. It hashed the empty table and
then went ahead and checked every record of the non-empty table against the
empty hash table.

Reading the code there's no check for this, and it seems like it would be a
useful low-cost little optimization.

I think postgres normally hashes the table it thinks is smaller, so you do
join against an empty relation it should end up on the hash side of the hash
join and allow postgres to avoid the scan of the outer table.

--
greg

pgsql-performance by date:

From: Dennis Bjorklund
Date: 22 September 2004, 16:22:51
Subject: Re: 7.4 vs 7.3 ( hash join issue )

From: Tom Lane
Date: 22 September 2004, 18:56:52
Subject: Re: 7.4 vs 7.3 ( hash join issue )

Re: 7.4 vs 7.3 ( hash join issue ) - Mailing list pgsql-performance

Previous

Next