Home > mailing lists

Re: Collect frequency statistics for arrays - Mailing list pgsql-hackers

From	Alexander Korotkov
Subject	Re: Collect frequency statistics for arrays
Date	March 1, 2012 10:57:56
Msg-id	CAPpHfdu3tSp0d=APy6swyG4H1d9JpkSEPYZ_JjqZh=V+-L9ZYA@mail.gmail.com Whole thread
In response to	Re: Collect frequency statistics for arrays (Alexander Korotkov <aekorotkov@gmail.com>)
Responses	Re: Collect frequency statistics for arrays Re: Collect frequency statistics for arrays
List	pgsql-hackers

Tree view

On Thu, Mar 1, 2012 at 1:19 AM, Alexander Korotkov <aekorotkov@gmail.com> wrote:

On Thu, Mar 1, 2012 at 1:09 AM, Tom Lane <tgl@sss.pgh.pa.us> wrote:
That seems like a pretty narrow, uncommon use-case. Also, to get
accurate stats for such queries that way, you'd need really enormous
histograms. I doubt that the existing parameters for histogram size
will permit meaningful estimation of more than the first array entry
(since we don't make the histogram any larger than we do for a scalar
column).

The real point here is that the fact that we're storing btree-style
stats for arrays is an accident, backed into by having added btree
comparators for arrays plus analyze.c's habit of applying default
scalar-oriented analysis functions to any type without an explicit
typanalyze entry. I don't recall that we ever thought hard about
it or showed that those stats were worth anything.

OK. I don't object to removing btree stats from arrays.
What do you thinks about pg_stats view in this case? Should it combine values histogram and array length histogram in single column like do for MCV and MCELEM?

Btree statistics for arrays and additional statistics slot are removed from attached version of patch. pg_stats view is untouched for while.

------
With best regards,
Alexander Korotkov.

Attachment

arrayanalyze-0.13.patch.gz

pgsql-hackers by date:

From: Robert Haas
Date: 01 March 2012, 10:54:32
Subject: performance results on IBM POWER7

From: Robert Haas
Date: 01 March 2012, 11:00:27
Subject: Re: Collect frequency statistics for arrays

Re: Collect frequency statistics for arrays - Mailing list pgsql-hackers

Attachment

Previous

Next