Home > mailing lists

Re: Proposal - improve eqsel estimates by including histogram bucket numdistinct statistics - Mailing list pgsql-hackers

From	Nathan Boley
Subject	Re: Proposal - improve eqsel estimates by including histogram bucket numdistinct statistics
Date	June 10, 2008 16:16:17
Msg-id	6fa3b6e20806101216n5dd675eak954f54701a6ce268@mail.gmail.com Whole thread
In response to	Re: Proposal - improve eqsel estimates by including histogram bucket numdistinct statistics (Tom Lane <tgl@sss.pgh.pa.us>)
Responses	Re: Proposal - improve eqsel estimates by including histogram bucket numdistinct statistics
List	pgsql-hackers

Tree view

>> If we query on values that aren't in the table, the planner will
>> always overestimate the expected number of returned rows because it (
>> implicitly ) assumes that every query will return at least 1 record.
>
> That's intentional and should not be changed.

Why?  What if ( somehow ) we knew that there was a 90% chance that
query would return an empty result set on a big table with 20 non-mcv
distinct values. Currently the planner would always choose a seq scan,
where an index scan might be better. Better yet, couldn't that be
optimized to *if record exists, execute seq scan*. That being said, I
think queries are generally searching for values that exist in the
table.

> I can't see the value of allowing fractional-row estimates anyway.

Neither can I, but I could probably think of cases where knowing the
SD of the result set could result in better plans.

-Nathan

pgsql-hackers by date:

From: Tom Lane
Date: 10 June 2008, 15:54:29
Subject: Re: Proposal - improve eqsel estimates by including histogram bucket numdistinct statistics

From: Tom Lane
Date: 10 June 2008, 17:33:31
Subject: Re: Proposal - improve eqsel estimates by including histogram bucket numdistinct statistics

Re: Proposal - improve eqsel estimates by including histogram bucket numdistinct statistics - Mailing list pgsql-hackers

Previous

Next