Re: ANALYZE sampling is too good - Mailing list pgsql-hackers

From Claudio Freire
Subject Re: ANALYZE sampling is too good
Date
Msg-id CAGTBQpY+znTQOujv3yV38NzF06OnK_wakudNb+ZRRL8JsG6QBQ@mail.gmail.com
Whole thread Raw
In response to Re: ANALYZE sampling is too good  (Josh Berkus <josh@agliodbs.com>)
List pgsql-hackers
On Thu, Dec 12, 2013 at 3:56 PM, Josh Berkus <josh@agliodbs.com> wrote:
>
> Estimated grouping should, however, affect MCVs.  In cases where we
> estimate that grouping levels are high, the expected % of observed
> values should be "discounted" somehow.  That is, with total random
> distribution you have a 1:1 ratio between observed frequency of a value
> and assumed frequency.  However, with highly grouped values, you might
> have a 2:1 ratio.

Cross validation can help there. But it's costly.



pgsql-hackers by date:

Previous
From: Josh Berkus
Date:
Subject: Re: ANALYZE sampling is too good
Next
From: Jeff Janes
Date:
Subject: Re: ANALYZE sampling is too good