Re: Very slow left outer join - Mailing list pgsql-performance

From Tom Lane
Subject Re: Very slow left outer join
Date
Msg-id 26149.1180495108@sss.pgh.pa.us
Whole thread Raw
In response to Re: Very slow left outer join  (Klint Gore <kg@kgb.une.edu.au>)
Responses Re: Very slow left outer join
List pgsql-performance
Klint Gore <kg@kgb.une.edu.au> writes:
> On Tue, 29 May 2007 17:16:57 -0700, "Tyrrill, Ed" <tyrrill_ed@emc.com> wrote:
>> mdsdb=# explain analyze select backupobjects.record_id from
>> backupobjects left outer join backup_location using(record_id) where
>> backup_id = 1071;

> Why are you using left join?
> The where condition is going to force the row to exist.

Which indeed the planner figured out (note the lack of any mention of
left join in the EXPLAIN result).  Michael put his finger on the problem
though: there's something way off about the rowcount estimate here:

>    ->  Bitmap Heap Scan on backup_location  (cost=3831.20..360207.21
> rows=436915 width=8) (actual time=94.375..97.688 rows=2789 loops=1)
>          Recheck Cond: (backup_id = 1071)
>          ->  Bitmap Index Scan on backup_location_bid
> (cost=0.00..3831.20 rows=436915 width=0) (actual time=84.239..84.239
> rows=2789 loops=1)
>                Index Cond: (backup_id = 1071)

With such a simple index condition the planner really ought to be able
to come close to the right rowcount estimate.  Check for vacuuming
problems, check for lack of ANALYZE, consider whether you need to bump
up the statistics target ...

            regards, tom lane

pgsql-performance by date:

Previous
From: "Rajesh Kumar Mallah"
Date:
Subject: Re: setting up raid10 with more than 4 drives
Next
From: "Luke Lonergan"
Date:
Subject: Re: setting up raid10 with more than 4 drives