Re: Sequence vs. Index Scan - Mailing list pgsql-sql

From Tom Lane
Subject Re: Sequence vs. Index Scan
Date
Msg-id 5447.1178548679@sss.pgh.pa.us
Whole thread Raw
In response to Re: Sequence vs. Index Scan  (Andrew Sullivan <ajs@crankycanuck.ca>)
Responses Re: Sequence vs. Index Scan  ("Aaron Bono" <postgresql@aranya.com>)
List pgsql-sql
Andrew Sullivan <ajs@crankycanuck.ca> writes:
> On Sun, May 06, 2007 at 01:45:54PM -0500, Aaron Bono wrote:
>> So my conclusion is that the function is being treated as volatile even
>> though it is stable because the number of records is small. 

> I don't think that's the issue.  If this is dependent on the
> number of records, then for some reason the way the data is
> structured means that the planner thinks a seqscan's a better bet. 
> This is probably due to distribution of the values.  You could try
> increasing the stats sample, and see if that helps.

It's got nothing to do with distribution, just with numbers of pages to
fetch.  You'll nearly always get a seqscan plan if there are only a
couple of pages in the table, simply because it would take more I/O to
read the index too.

The reason this is a problem in this example is that the function is so
expensive to execute.  The planner should be avoiding the seqscan on the
basis of CPU cost not I/O cost, but it doesn't know that the function is
expensive enough to drive the decision that way.

In CVS HEAD (8.3-to-be) we've added a "cost" property to functions,
which provides a clean way to fix this issue, but there's no good way to
deal with it in existing releases :-(
        regards, tom lane


pgsql-sql by date:

Previous
From: Michael Fuhr
Date:
Subject: Re: How to use function PointN?
Next
From: Richard Broersma Jr
Date:
Subject: Temporal Table Relations and Referential Integrity