Home > mailing lists

Re: full outer performance problem - Mailing list pgsql-performance

From	Tom Lane
Subject	Re: full outer performance problem
Date	June 8, 2005 11:44:25
Msg-id	1275.1118239415@sss.pgh.pa.us Whole thread Raw
In response to	full outer performance problem (Kim Bisgaard <kib+pg@dmi.dk>)
Responses	Re: full outer performance problem
List	pgsql-performance

Tree view

Kim Bisgaard <kib+pg@dmi.dk> writes:
> SELECT station_id, timeobs,temp_grass, temp_dry_at_2m
>         FROM temp_dry_at_2m a
>         FULL OUTER JOIN temp_grass b
>         USING (station_id, timeobs)
>         WHERE station_id = 52981
>           AND timeobs = '2004-1-1 0:0:0'

> explain analyse SELECT b.station_id, b.timeobs,temp_grass, temp_dry_at_2m
>         FROM temp_dry_at_2m a
>         FULL OUTER JOIN temp_grass b
>         USING (station_id, timeobs)
>         WHERE b.station_id = 52981
>           AND b.timeobs = '2004-1-1 0:0:0'

> Why will PostgreSQL not use the same plan for both these queries - they
> are virtually identical??

Because they're semantically completely different.  The second query is
effectively a RIGHT JOIN, because join rows in which b is all-null will
be thrown away by the WHERE.  The optimizer sees this (note your second
plan doesn't use a Full Join step anywhere) and is able to produce a
much better plan.  Full outer join is difficult to optimize, in part
because we have no choice but to use a merge join for it --- the other
join types don't support full join.

            regards, tom lane

pgsql-performance by date:

From: K C Lau
Date: 08 June 2005, 11:38:46
Subject: Re: SELECT DISTINCT Performance Issue

From: George Essig
Date: 08 June 2005, 11:55:06
Subject: Re: SELECT DISTINCT Performance Issue

Re: full outer performance problem - Mailing list pgsql-performance

Previous

Next