Re: proposal : cross-column stats - Mailing list pgsql-hackers

From Tom Lane
Subject Re: proposal : cross-column stats
Date
Msg-id 7913.1292254732@sss.pgh.pa.us
Whole thread Raw
In response to Re: proposal : cross-column stats  (Robert Haas <robertmhaas@gmail.com>)
Responses Re: proposal : cross-column stats  (Tomas Vondra <tv@fuzzy.cz>)
Re: proposal : cross-column stats  (Simon Riggs <simon@2ndQuadrant.com>)
List pgsql-hackers
Robert Haas <robertmhaas@gmail.com> writes:
> On Sun, Dec 12, 2010 at 9:16 PM, Tomas Vondra <tv@fuzzy.cz> wrote:
>> The proposed solution is based on contingency tables, built for selected
>> groups of columns (not for each possible group). And the contingency
>> table gives you the ability to estimate the probabilities needed to
>> compute the selectivity. Or am I missing something?

> Well, I'm not real familiar with contingency tables, but it seems like
> you could end up needing to store a huge amount of data to get any
> benefit out of it, in some cases.

The reason that this wasn't done years ago is precisely that nobody's
figured out how to do it with a tolerable amount of stats data and a
tolerable amount of processing time (both at ANALYZE time and during
query planning).  It's not hard to see what we'd ideally like to do;
it's getting from there to something useful in production that's hard.
        regards, tom lane


pgsql-hackers by date:

Previous
From: "BRUSSER Michael"
Date:
Subject: Re: initdb failure with Postgres 8.4.4
Next
From: Tom Lane
Date:
Subject: Re: Problem with pg_upgrade (8.4 -> 9.0) due to ALTER DATABASE SET ROLE