Re: gaussian distribution pgbench - Mailing list pgsql-hackers

From Gavin Flower
Subject Re: gaussian distribution pgbench
Date
Msg-id 53B4649D.2020403@archidevsys.co.nz
Whole thread Raw
In response to Re: gaussian distribution pgbench  (Fabien COELHO <coelho@cri.ensmp.fr>)
Responses Re: gaussian distribution pgbench  (Fabien COELHO <coelho@cri.ensmp.fr>)
List pgsql-hackers
<div class="moz-cite-prefix">On 02/07/14 21:05, Fabien COELHO wrote:<br /></div><blockquote
cite="mid:alpine.DEB.2.10.1407021046080.22369@sto"type="cite"><br /> Hello Mitsumasa-san, <br /><br /><blockquote
type="cite">AndI'm also interested in your "decile percents" output like under <br /> followings, <br /> decile
percents:39.6% 24.0% 14.6% 8.8% 5.4% 3.3% 2.0% 1.2% 0.7% 0.4% <br /></blockquote><br /> Sure, I'm really fine with
that.<br /><br /><blockquote type="cite">I think that it is easier than before. Sum of decile percents is just 100%.
<br/></blockquote><br /> That's a good property:-) <br /><br /><blockquote type="cite">However, I don't prefer
"highest/lowestpercentage" because it will be confused with decile percentage for users, and anyone cannot understand
thisdigits. I cannot understand "4.9%, 0.0%" when I see the first time. Then, I checked the source code, I understood
it:(It's not good design... #Why this parameter use 100? <br /></blockquote><br /> What else? People have ten fingers
andlike powers of 10, and are used to percents? <br /><br /><blockquote type="cite">So I'd like to remove it if you
like.It will be more simple. <br /></blockquote><br /> I think that for the exponential distribution it helps,
especiallyfor high threshold, to have the lowest/highest percent density. For low thresholds, the decile is also
definitelyuseful. So I'm fine with both outputs as you have put them. <br /><br /> I have just updated the wording so
thatit may be clearer: <br /><br />  decile percents: 69.9% 21.0% 6.3% 1.9% 0.6% 0.2% 0.1% 0.0% 0.0% 0.0% <br />
 probabilityof fist/last percent of the range: 11.3% 0.0% <br /><br /><blockquote type="cite">Attached patch is fixed
version,please confirm it. <br /></blockquote><br /> Attached a v15 which just fixes a typo and the above wording
update.I'm validating it for committers. <br /><br /><blockquote type="cite">#Of course, World Cup is being held now.
I'mnot hurry at all. <br /></blockquote><br /> I'm not a soccer kind of person, so it does not influence my
availibility.:-)<br /><br /><br /> Suggested commit message: <br /><br /> Add drawing random integers with a Gaussian
ortruncated exponentional distributions to pgbench. <br /><br /> Test variants with these distributions are also
providedand triggered <br /> with options "--gaussian=..." and "--exponential=...". <br /><br /><br /> Have a nice
day/night,<br /><br /><br /><fieldset class="mimeAttachmentHeader"></fieldset><br /><pre wrap="">
 
</pre></blockquote> I would suggest that probabilities should NEVER be expressed in percentages! As a percentage
probabilitylooks weird, and is never used for serious statistical work - in my experience at least.<br /><br /> I think
probabilitiesshould be expressed in the range 0 ... 1 - i.e. 0.35 rather than 35%.<br /><br /><br /> Cheers,<br />
Gavin<br/> 

pgsql-hackers by date:

Previous
From: David G Johnston
Date:
Subject: Re: Can simplify 'limit 1' with slow function?
Next
From: Tom Lane
Date:
Subject: Re: Audit of logout