Re: [GENERAL] how to get accurate values in pg_statistic (continued) - Mailing list pgsql-performance

From Tom Lane
Subject Re: [GENERAL] how to get accurate values in pg_statistic (continued)
Date
Msg-id 21962.1063254659@sss.pgh.pa.us
Whole thread Raw
In response to Re: [GENERAL] how to get accurate values in pg_statistic (continued)  (Christopher Browne <cbbrowne@libertyrms.info>)
Responses Re: [GENERAL] how to get accurate values in pg_statistic
List pgsql-performance
Christopher Browne <cbbrowne@libertyrms.info> writes:
> The "right answer" for most use seems likely to involve:
>  a) Getting an appropriate number of bins (I suspect 10 is a bit
>     small, but I can't justify that mathematically), and

I suspect that also, but I don't have real evidence for it either.
We've heard complaints from a number of people for whom it was indeed
too small ... but that doesn't prove it's not appropriate in the
majority of cases ...

> Does the sample size change if you increase the number of bins?

Yes, read the comments in backend/commands/analyze.c.

> Do we also need a parameter to control sample size?

Not if the paper I read before writing that code is correct.

            regards, tom lane

pgsql-performance by date:

Previous
From: Christopher Browne
Date:
Subject: Re: [osdldbt-general] Re: [GENERAL] how to get accurate
Next
From: "Christopher Kings-Lynne"
Date:
Subject: Re: Reading data in bulk - help?