Home > mailing lists

Re: estimating # of distinct values - Mailing list pgsql-hackers

From	Csaba Nagy
Subject	Re: estimating # of distinct values
Date	January 7, 2011 04:00:09
Msg-id	1294234986.3889.22.camel@clnt-sysecm-cnagy Whole thread Raw
In response to	Re: estimating # of distinct values (Tom Lane <tgl@sss.pgh.pa.us>)
Responses	Re: estimating # of distinct values
List	pgsql-hackers

Tree view

On Thu, 2010-12-30 at 21:02 -0500, Tom Lane wrote:
> How is an incremental ANALYZE going to work at all?

How about a kind of continuous analyze ?

Instead of analyzing just once and then drop the intermediate results,
keep them on disk for all tables and then piggyback the background
writer (or have a dedicated process if that's not algorithmically
feasible) and before writing out stuff update the statistics based on
the values found in modified buffers. Probably it could take a random
sample of buffers to minimize overhead, but if it is done by a
background thread the overhead could be minimal anyway on multi-core
systems.

Not sure this makes sense at all, but if yes it would deliver the most
up to date statistics you can think of.

Cheers,
Csaba.

pgsql-hackers by date:

From: Zotov
Date: 07 January 2011, 04:00:08
Subject: join functions

From: "Mehdi MAACHE (Pyrenet)"
Date: 07 January 2011, 04:00:13
Subject: Re: Intermittent buildfarm failures in sequence test

Re: estimating # of distinct values - Mailing list pgsql-hackers

Previous

Next