Home > mailing lists

Re: Sequence vs. Index Scan - Mailing list pgsql-sql

From	Tom Lane
Subject	Re: Sequence vs. Index Scan
Date	May 7, 2007 11:38:17
Msg-id	5447.1178548679@sss.pgh.pa.us Whole thread Raw
In response to	Re: Sequence vs. Index Scan (Andrew Sullivan <ajs@crankycanuck.ca>)
Responses	Re: Sequence vs. Index Scan
List	pgsql-sql

Tree view

Andrew Sullivan <ajs@crankycanuck.ca> writes:
> On Sun, May 06, 2007 at 01:45:54PM -0500, Aaron Bono wrote:
>> So my conclusion is that the function is being treated as volatile even
>> though it is stable because the number of records is small. 

> I don't think that's the issue.  If this is dependent on the
> number of records, then for some reason the way the data is
> structured means that the planner thinks a seqscan's a better bet. 
> This is probably due to distribution of the values.  You could try
> increasing the stats sample, and see if that helps.

It's got nothing to do with distribution, just with numbers of pages to
fetch.  You'll nearly always get a seqscan plan if there are only a
couple of pages in the table, simply because it would take more I/O to
read the index too.

The reason this is a problem in this example is that the function is so
expensive to execute.  The planner should be avoiding the seqscan on the
basis of CPU cost not I/O cost, but it doesn't know that the function is
expensive enough to drive the decision that way.

In CVS HEAD (8.3-to-be) we've added a "cost" property to functions,
which provides a clean way to fix this issue, but there's no good way to
deal with it in existing releases :-(
        regards, tom lane

pgsql-sql by date:

From: Michael Fuhr
Date: 07 May 2007, 10:13:55
Subject: Re: How to use function PointN?

From: Richard Broersma Jr
Date: 07 May 2007, 12:02:58
Subject: Temporal Table Relations and Referential Integrity

Re: Sequence vs. Index Scan - Mailing list pgsql-sql

Previous

Next