Home > mailing lists

Re: index / sequential scan problem - Mailing list pgsql-performance

From	Tom Lane
Subject	Re: index / sequential scan problem
Date	July 17, 2003 15:12:39
Msg-id	5586.1058465575@sss.pgh.pa.us Whole thread Raw
In response to	Re: index / sequential scan problem (Fabian Kreitner <fabian.kreitner@ainea-ag.de>)
Responses	Re: index / sequential scan problem
List	pgsql-performance

Tree view

Fabian Kreitner <fabian.kreitner@ainea-ag.de> writes:
> That is what I read too and is why Im confused that the index is indeed
> executing faster. Can this be a problem with the hardware and/or postgress
> installation?

I think the actual issue here is that you are executing the EXISTS
subplan over and over, once for each outer row.  The planner's cost
estimate for EXISTS is based on the assumption that you do it once
... in which scenario the seqscan very possibly is cheaper.  However,
when you do the EXISTS subplan over and over for many outer rows, you
get a savings from the fact that the index and table pages soon get
cached in memory.  The seqscan plan gets a savings too, since the table
is small enough to fit in memory, but once everything is in memory the
indexscan plan is faster.

There's been some discussion on pghackers about how to teach the planner
to account for repeated executions of subplans, but we have not come up
with a good solution yet.

For the moment, what people tend to do if they know their database is
small enough to mostly stay in memory is to reduce random_page_cost to
make the planner favor indexscans.  If you know the database is entirely
cached then the theoretically correct value of random_page_cost is 1.0
(since fetching any page will cost the same, if it's all in RAM).  I'd
recommend against adopting that as a default, but a lot of people find
that setting it to 2.0 or so seems to model their situation better than
the out-of-the-box 4.0.

            regards, tom lane

pgsql-performance by date:

From: Tom Lane
Date: 17 July 2003, 15:02:50
Subject: Re: index / sequential scan problem

From: Tom Lane
Date: 17 July 2003, 15:26:48
Subject: Re: Relation of indices to ANALYZE

Re: index / sequential scan problem - Mailing list pgsql-performance

Previous

Next