Home > mailing lists

Re: statistic target and sample rate - Mailing list pgsql-general

From	Tom Lane
Subject	Re: statistic target and sample rate
Date	July 14, 2021 14:30:29
Msg-id	3484552.1626273029@sss.pgh.pa.us Whole thread Raw
In response to	statistic target and sample rate (Luca Ferrari <fluca1978@gmail.com>)
List	pgsql-general

Tree view

Luca Ferrari <fluca1978@gmail.com> writes:
> Therefore my question is about how the statistic collectore decides
> about the number of tuples to be sampled.

It's basically 300 times the largest statistics target:

https://git.postgresql.org/gitweb/?p=postgresql.git;a=blob;f=src/backend/commands/analyze.c;h=0c9591415e4b97dd5c5e693af1860294284a1575;hb=HEAD#l1919

Per that comment, there is good math backing this choice for the task
of making a histogram.  It's a little shakier for other sorts of
statistics --- notably, for n_distinct estimation, the error can still
be really bad.

            regards, tom lane

pgsql-general by date:

From: Laura Smith
Date: 14 July 2021, 13:18:51
Subject: Re: returning setof from insert ?

From: Sasha Aliashkevich
Date: 14 July 2021, 14:36:22
Subject: ERROR: cannot freeze committed xmax

Re: statistic target and sample rate - Mailing list pgsql-general

Previous

Next