Re: bad estimates - Mailing list pgsql-performance

From Ken Geis
Subject Re: bad estimates
Date
Msg-id 3F4F0777.8090503@speakeasy.org
Whole thread Raw
In response to Re: bad estimates  (Bruno Wolff III <bruno@wolff.to>)
Responses Re: bad estimates
Re: bad estimates
List pgsql-performance
Sorry, all, to wipe out the context, but it was getting a little long.

Bruno Wolff III wrote:
> I am calling it quits for tonight, but will check back tomorrow
> to see how things turned out.

I went through the code (7.4 beta2) that estimates the cost of an index
scan path.  What I need to be sure of is that when running a query in
pgsql that uses only the columns that are in an index, the underlying
table need not be accessed.  I know that Oracle does this.

The cost_index function is assuming that after finding an entry in the
index it will be looking it up in the underlying table.  That table is
not well correlated to the index, so it is assuming (in the worst case)
a random page lookup for each of 17 million records!  In my case, if the
underlying table is indeed not touched, the estimated cost is 1000 times
the real cost.

63388.624000    to scan the index
67406506.915595 to scan the index and load a random page for each entry


Ken



pgsql-performance by date:

Previous
From: "Alexander Priem"
Date:
Subject: Re: Indexing question
Next
From: Tomasz Myrta
Date:
Subject: Re: Indexing question