Home > mailing lists

Re: Better default_statistics_target - Mailing list pgsql-patches

From	Simon Riggs
Subject	Re: Better default_statistics_target
Date	December 6, 2007 15:34:30
Msg-id	1196969682.4255.455.camel@ebony.site Whole thread Raw
In response to	Re: Better default_statistics_target (Chris Browne <cbbrowne@acm.org>)
List	pgsql-patches

Tree view

On Wed, 2007-12-05 at 15:13 -0500, Chris Browne wrote:

> I have the theory (thus far not borne out by any numbers) that it
> might be a useful approach to try to go through the DB schema and use
> what information is there to try to come up with better numbers on a
> per-column basis.

Yeh, agreed.

The difficulty is making this work for generic datatypes.

>  - Datestamps tend to imply temporal dispersion, ergo "somewhat fewer
>    bins."  Similar for floats.

Hmmm, not sure about that one. Some date/time columns can change very
quickly over time, so the stats are frequently out of date.

> Then could come a "second order" perspective, where data would
> actually get sampled from pg_statistics.
>
>  - If we look at the number of distinct histogram bins used, for a
>    particular column, and find that there are some not used, we might
>    drop bins.

The histograms are height balanced, so they are always all used.

--
  Simon Riggs
  2ndQuadrant  http://www.2ndQuadrant.com

pgsql-patches by date:

From: "Merlin Moncure"
Date: 06 December 2007, 15:12:32
Subject: Re: PQParam version 0.5

From: Tom Lane
Date: 06 December 2007, 20:19:42
Subject: Proposed patch to make mergejoin cost estimation more symmetric

Re: Better default_statistics_target - Mailing list pgsql-patches

Previous

Next