Home > mailing lists

Re: ANALYZE sampling is too good - Mailing list pgsql-hackers

From	Florian Pflug
Subject	Re: ANALYZE sampling is too good
Date	December 13, 2013 01:25:32
Msg-id	D1C450BF-DFBF-4330-8C7B-5F8A69BE49D1@phlo.org Whole thread Raw
In response to	Re: ANALYZE sampling is too good (Tom Lane <tgl@sss.pgh.pa.us>)
Responses	Re: ANALYZE sampling is too good
List	pgsql-hackers

Tree view

On Dec12, 2013, at 19:29 , Tom Lane <tgl@sss.pgh.pa.us> wrote:
> However ... where this thread started was not about trying to reduce
> the remaining statistical imperfections in our existing sampling method.
> It was about whether we could reduce the number of pages read for an
> acceptable cost in increased statistical imperfection.

True, but Jeff's case shows that even the imperfections of the current
sampling method are larger than what the n_distinct estimator expects.

Making it even more biased will thus require us to rethink how we
obtain a n_distinct estimate or something equivalent. I don't mean that
as an argument against changing the sampling method, just as something
to watch out for.

best regards,
Florian Pflug

pgsql-hackers by date:

From: Fabrízio de Royes Mello
Date: 13 December 2013, 00:59:01
Subject: Re: Time-Delayed Standbys

From: Andres Freund
Date: 13 December 2013, 01:41:09
Subject: Re: pgsql: Fix a couple of bugs in MultiXactId freezing

Re: ANALYZE sampling is too good - Mailing list pgsql-hackers

Previous

Next