Home > mailing lists

Re: query optimization scenarios 17,701 times faster!!! - Mailing list pgsql-hackers

From	Tom Lane
Subject	Re: query optimization scenarios 17,701 times faster!!!
Date	April 24, 2003 23:30:41
Msg-id	3833.1051230640@sss.pgh.pa.us Whole thread Raw
In response to	Re: query optimization scenarios 17,701 times faster!!! (Stephan Szabo <sszabo@megazone23.bigpanda.com>)
List	pgsql-hackers

Tree view

Stephan Szabo <sszabo@megazone23.bigpanda.com> writes:
> The particular conversion you're doing for that query requires more than
> just that AFAICS. The outerness of the join makes it so you know there
> won't be 0 rows after the join, and the distinct cancels out the
> possibility of multiple rows. Both of those conditions seem necessary to
> make the conversion valid as well.

Yeah.  I think the optimization rule being proposed here is something
like "if a table is not used in the SELECT output list, *and* it's on
the nullable side of an outer join (so that it can't cause rows to be
omitted from the output), *and* there's a DISTINCT (so that generating
multiple matched rows isn't interesting either) then don't bother to
include the table in the join tree".  But I'm not convinced that's
a safe transformation --- are there any other conditions that would
have to be checked?  And of course the $64 question is whether the
combination would hit often enough to make it worth the cycles to check
for.  It seems like a fairly unusual case to me.

The variant where you know there are not multiple rows because you're
joining on a unique column, rather than depending on DISTINCT, might
be more useful in practice ... but I'm still not convinced it's worth
checking for.
        regards, tom lane

pgsql-hackers by date:

From: Rod Taylor
Date: 24 April 2003, 22:14:16
Subject: Re: Are we losing momentum?

From: Kevin Lo
Date: 25 April 2003, 01:01:01
Subject: Re: Postgresql Filesystem

Re: query optimization scenarios 17,701 times faster!!! - Mailing list pgsql-hackers

Previous

Next