Home > mailing lists

Re: Re: Which qsort is used - Mailing list pgsql-hackers

From	Tom Lane
Subject	Re: Re: Which qsort is used
Date	December 17, 2005 05:41:06
Msg-id	3861.1134801659@sss.pgh.pa.us Whole thread Raw
In response to	Re: Re: Which qsort is used ("Dann Corbit" <DCorbit@connx.com>)
List	pgsql-hackers

Tree view

"Dann Corbit" <DCorbit@connx.com> writes:
>> I've still got a problem with these checks; I think they are a net
>> waste of cycles on average.

> The benchmarks say that they (order checks) are a good idea on average
> for ordered data, random data, and partly ordered data.

There are lies, damn lies, and benchmarks ;-)

The problem with citing a benchmark for this discussion is that a
benchmark can't tell you anything about real-world probabilities;
it only tells you about the probabilities occuring in the benchmark
case.  You need to make the case that the benchmark reflects the
real world, which you didn't.

> If you trace the algorithms in a debugger you will be surprised at how
> often the partitions are ordered, even with random sets as input.

Well, I do agree that checking for orderedness on small partitions would
succeed more often than on larger partitions or the whole file --- but
the code-as-given checks all the way down.  Moreover, the argument given
for spending these cycles is that insertion sort sucks on reverse-order
input ... where "sucks" means that it spends O(N^2) time.  But it spends
O(N^2) in the average case, too.
        regards, tom lane

pgsql-hackers by date:

From: "Dann Corbit"
Date: 17 December 2005, 05:15:03
Subject: Re: Re: Which qsort is used

From: "Dann Corbit"
Date: 17 December 2005, 05:44:01
Subject: Re: Re: Which qsort is used

Re: Re: Which qsort is used - Mailing list pgsql-hackers

Previous

Next