Home > mailing lists

Re: benchmarking the query planner - Mailing list pgsql-hackers

From	Tom Lane
Subject	Re: benchmarking the query planner
Date	December 12, 2008 14:19:44
Msg-id	8020.1229105929@sss.pgh.pa.us Whole thread Raw
In response to	Re: benchmarking the query planner (Simon Riggs <simon@2ndQuadrant.com>)
Responses	Re: benchmarking the query planner Re: benchmarking the query planner
List	pgsql-hackers

Tree view

Simon Riggs <simon@2ndQuadrant.com> writes:
> As I said, we would only increase sample for ndistinct, not for others.

How will you do that?  Keep in mind that one of the things we have to do
to compute ndistinct is to sort the sample.  ISTM that the majority of
the cost of a larger sample is going to get expended anyway ---
certainly we could form the histogram using the more accurate data at
precisely zero extra cost, and I think we have also pretty much done all
the work for MCV collection by the time we finish counting the number of
distinct values.

I seem to recall Greg suggesting that there were ways to estimate
ndistinct without sorting, but short of a fundamental algorithm change
there's not going to be a win here.

> Right now we may as well use a random number generator.

Could we skip the hyperbole please?
        regards, tom lane

pgsql-hackers by date:

From: Simon Riggs
Date: 12 December 2008, 14:19:05
Subject: Re: benchmarking the query planner

From: Tom Lane
Date: 12 December 2008, 14:21:57
Subject: Re: benchmarking the query planner

Re: benchmarking the query planner - Mailing list pgsql-hackers

Previous

Next