Thread: [PATCH] pgbench --throttle (submission 4)

[PATCH] pgbench --throttle (submission 4)

From

Fabien COELHO

Date:

02 May 2013, 12:10:00

Minor changes wrt to the previous submission, so as to avoid running some 
stuff twice under some conditions. This is for reference to the next 
commit fest.

-- 
Fabien.

Re: [PATCH] pgbench --throttle (submission 4)

From

Michael Paquier

Date:

03 May 2013, 01:39:22

Hi,

It would be better to submit updated versions of a patch on the email thread it is dedicated to and not create a new thread so as people can easily follow the progress you are doing.

Thanks,

--
Michael

Re: [PATCH] pgbench --throttle (submission 5)

From

Fabien COELHO

Date:

11 May 2013, 15:18:10

Simpler version of 'pgbench --throttle' by handling throttling at the 
beginning of the transaction instead of doing it at the end.

This is for reference to the next commitfest.

-- 
Fabien.

Re: [PATCH] pgbench --throttle (submission 6)

From

Fabien COELHO

Date:

12 May 2013, 06:55:10

New submission which put option help in alphabetical position, as 
per Peter Eisentraut f0ed3a8a99b052d2d5e0b6153a8907b90c486636

This is for reference to the next commitfest.

-- 
Fabien.

Re: [PATCH] pgbench --throttle (submission 7 - with lag measurement)

From

Fabien COELHO

Date:

01 June 2013, 09:00:42

New submission for the next commit fest.

This new version also reports the average lag time, i.e. the delay between 
scheduled and actual transaction start times. This may help detect whether 
things went smothly, or if at some time some delay was introduced because 
of the load and some catchup was done afterwards.

Question 1: should it report the maximum lang encountered?

Question 2: the next step would be to have the current lag shown under 
option --progress, but that would mean having a combined --throttle 
--progress patch submission, or maybe dependencies between patches.

-- 
Fabien.

Re: [PATCH] pgbench --throttle (submission 7 - with lag measurement)

From

Greg Smith

Date:

08 June 2013, 19:31:24

On 6/1/13 5:00 AM, Fabien COELHO wrote:
> Question 1: should it report the maximum lang encountered?

I haven't found the lag measurement to be very useful yet, outside of
debugging the feature itself.  Accordingly I don't see a reason to add
even more statistics about the number outside of testing the code.  I'm
seeing some weird lag problems that this will be useful for though right
now, more on that a few places below.

> Question 2: the next step would be to have the current lag shown under
> option --progress, but that would mean having a combined --throttle
> --progress patch submission, or maybe dependencies between patches.

This is getting too far ahead.  Let's get the throttle part nailed down
before introducing even more moving parts into this.  I've attached an
updated patch that changes a few things around already.  I'm not done
with this yet and it needs some more review before commit, but it's not
too far away from being ready.

This feature works quite well.  On a system that will run at 25K TPS
without any limit, I did a run with 25 clients and a rate of 400/second,
aiming at 10,000 TPS, and that's what I got:

number of clients: 25
number of threads: 1
duration: 60 s
number of transactions actually processed: 599620
average transaction lag: 0.307 ms
tps = 9954.779317 (including connections establishing)
tps = 9964.947522 (excluding connections establishing)

I never thought of implementing the throttle like this before, but it
seems to work out well so far.  Check out tps.png to see the smoothness
of the TPS curve (the graphs came out of pgbench-tools.  There's a
little more play outside of the target than ideal for this case.  Maybe
it's worth tightening the Poisson curve a bit around its center?

The main implementation issue I haven't looked into yet is why things
can get weird at the end of the run.  See the latency.png graph attached
and you can see what I mean.

I didn't like the naming on this option or all of the ways you could
specify the delay.  None of those really added anything, since you can
get every other behavior by specifying a non-integer TPS.  And using the
word "throttle" inside the code is fine, but I didn't like exposing that
implementation detail more than it had to be.

What I did instead was think of this as a transaction rate target, which
makes the help a whole lot simpler:

   -R SPEC, --rate SPEC
                target rate per client in transactions per second

Made the documentation easier to write too.  I'm not quite done with
that yet, the docs wording in this updated patch could still be better.

I personally would like this better if --rate specified a *total* rate
across all clients.  However, there are examples of both types of
settings in the program already, so there's no one precedent for which
is right here.  -t is per-client and now -R is too; I'd prefer it to be
like -T instead.  It's not that important though, and the code is
cleaner as it's written right now.  Maybe this is better; I'm not sure.

I did some basic error handling checks on this and they seemed good, the
program rejects target rates of <=0.

On the topic of this weird latency spike issue, I did see that show up
in some of the results too.  Here's one where I tried to specify a rate
higher than the system can actually handle, 80000 TPS total on a
SELECT-only test

$ pgbench -S -T 30 -c 8 -j 4 -R10000tps pgbench
starting vacuum...end.
transaction type: SELECT only
scaling factor: 100
query mode: simple
number of clients: 8
number of threads: 4
duration: 30 s
number of transactions actually processed: 761779
average transaction lag: 10298.380 ms
tps = 25392.312544 (including connections establishing)
tps = 25397.294583 (excluding connections establishing)

It was actually limited by the capabilities of the hardware, 25K TPS.
10298 ms of lag per transaction can't be right though.

Some general patch submission suggestions for you as a new contributor:

-When re-submitting something with improvements, it's a good idea to add
a version number to the patch so reviewers can tell them apart easily.
But there is no reason to change the subject line of the e-mail each
time.  I followed that standard here.  If you updated this again I would
name the file pgbench-throttle-v9.patch but keep the same e-mail subject.

-There were some extra carriage return characters in your last
submission.  Wasn't a problem this time, but if you can get rid of those
that makes for a better patch.

--
Greg Smith   2ndQuadrant US    greg@2ndQuadrant.com   Baltimore, MD
PostgreSQL Training, Services, and 24x7 Support www.2ndQuadrant.com

I'm still getting the same sort of pauses waiting for input with your
v11.  This is a pretty frustrating problem; I've spent about two days so
far trying to narrow down how it happens.  I've attached the test
program I'm using.  It seems related to my picking a throttled rate
that's close to (but below) the maximum possible on your system.  I'm
using 10,000 on a system that can do about 16,000 TPS when running
pgbench in debug mode.

This problem is 100% reproducible here; happens every time.  This is a
laptop running Mac OS X.  It's possible the problem is specific to that
platform.  I'm doing all my tests with the database itself setup for
development, with debug and assertions on.  The lag spikes seem smaller
without assertions on, but they are still there.

Here's a sample:

transaction type: SELECT only
scaling factor: 10
query mode: simple
number of clients: 25
number of threads: 1
duration: 30 s
number of transactions actually processed: 301921
average transaction lag: 1.133 ms (max 137.683 ms)
tps = 10011.527543 (including connections establishing)
tps = 10027.834189 (excluding connections establishing)

And those slow ones are all at the end of the latency log file, as shown
in column 3 here:

22 11953 3369 0 1371578126 954881
23 11926 3370 0 1371578126 954918
3 12238 30310 0 1371578126 984634
7 12205 30350 0 1371578126 984742
8 12207 30359 0 1371578126 984792
11 12176 30325 0 1371578126 984837
13 12074 30292 0 1371578126 984882
0 12288 175452 0 1371578127 126340
9 12194 171948 0 1371578127 126421
12 12139 171915 0 1371578127 126466
24 11876 175657 0 1371578127 126507

Note that there are two long pauses here, which happens maybe half of
the time.  There's a 27 ms pause and then a 145 ms one.

The exact sequence for when the pauses show up is:

-All remaining clients have sent their SELECT statement and are waiting
for a response.  They are not sleeping, they're waiting for the server
to return a query result.
-A client receives part of the data it wants, but there is still data
left.  This is the path through pgbench.c where the "if
(PQisBusy(st->con))" test is true after receiving some information.  I
hacked up some logging that distinguishes those as a "client %d partial
receive" to make this visible.
-A select() call is made with no timeout, so it can only be satisfied by
more data being returned.
-Around ~100ms (can be less, can be more) goes by before that select()
returns more data to the client, where normally it would happen in ~2ms.

You were concerned about a possible bug in the timeout code.  If you
hack up the select statement to show some state information, the setup
for the pauses at the end always looks like this:

calling select min_usec=9223372036854775807 sleeping=0

When no one is sleeping, the timeout becomes infinite, so only returning
data will break it.  This is intended behavior though.  This exact same
sequence and select() call parameters happen in pgbench code every time
at the end of a run.  But clients without the throttling patch applied
never seem to get into the state where they pause for so long, waiting
for one of the active clients to receive the next bit of result.

I don't think the st->listen related code has anything to do with this
either.  That flag is only used to track when clients have completed
sending their first query over to the server.  Once reaching that point
once, afterward they should be "listening" for results each time they
exit the doCustom() code.  st->listen goes to 1 very soon after startup
and then it stays there, and that logic seems fine too.

--
Greg Smith   2ndQuadrant US    greg@2ndQuadrant.com   Baltimore, MD
PostgreSQL Training, Services, and 24x7 Support www.2ndQuadrant.com

Attachment

test

Re: [PATCH] pgbench --throttle (submission 7 - with lag measurement)

From

Fabien COELHO

Date:

19 June 2013, 13:58:15

> I'm still getting the same sort of pauses waiting for input with your v11.

Alas.

> This is a pretty frustrating problem; I've spent about two days so far trying 
> to narrow down how it happens.  I've attached the test program I'm using.  It 
> seems related to my picking a throttled rate that's close to (but below) the 
> maximum possible on your system.  I'm using 10,000 on a system that can do 
> about 16,000 TPS when running pgbench in debug mode.
>
> This problem is 100% reproducible here; happens every time.  This is a laptop 
> running Mac OS X.  It's possible the problem is specific to that platform. 
> I'm doing all my tests with the database itself setup for development, with 
> debug and assertions on.  The lag spikes seem smaller without assertions on, 
> but they are still there.
>
> Here's a sample:
>
> transaction type: SELECT only

What is this test script? I'm doing pgbench for tests.

> scaling factor: 10
> query mode: simple
> number of clients: 25
> number of threads: 1
> duration: 30 s
> number of transactions actually processed: 301921
> average transaction lag: 1.133 ms (max 137.683 ms)
> tps = 10011.527543 (including connections establishing)
> tps = 10027.834189 (excluding connections establishing)
>
> And those slow ones are all at the end of the latency log file, as shown in 
> column 3 here:
>
> 22 11953 3369 0 1371578126 954881
> 23 11926 3370 0 1371578126 954918
> 3 12238 30310 0 1371578126 984634
> 7 12205 30350 0 1371578126 984742
> 8 12207 30359 0 1371578126 984792
> 11 12176 30325 0 1371578126 984837
> 13 12074 30292 0 1371578126 984882
> 0 12288 175452 0 1371578127 126340
> 9 12194 171948 0 1371578127 126421
> 12 12139 171915 0 1371578127 126466
> 24 11876 175657 0 1371578127 126507

Indeed, there are two spikes, but not all clients are concerned.

As I have not seen that, debuging is hard. I'll give it a try on 
tomorrow.

> When no one is sleeping, the timeout becomes infinite, so only returning data 
> will break it.  This is intended behavior though.

This is not coherent. Under --throttle there should basically always be 
someone asleep, unless the server cannot cope with the load and *all* 
transactions are late. A no time out state looks pretty unrealistic, 
because it means that there is no throttling.

> I don't think the st->listen related code has anything to do with this 
> either.  That flag is only used to track when clients have completed sending 
> their first query over to the server.  Once reaching that point once, 
> afterward they should be "listening" for results each time they exit the 
> doCustom() code.

This assumption seems false if you can have a "sleep" at the beginning of 
the sequence, which is what throttle is doing, but can be done by any 
custom script, so that the client is expected to wait before sending any 
command, thus there can be no select underway in that case.

So listen should be set to 1 when a select as been sent, and set back to 0 
when the result data have all been received.

"doCustom" makes implicit assumptions about what is going on, whereas it 
should focus on looking at the incoming state, performing operations, and 
leaving with a state which correspond to the actual status, without 
assumptions about what is going to happen next.

> st->listen goes to 1 very soon after startup and then it stays there, 
> and that logic seems fine too.

-- 
Fabien.

Re: [PATCH] pgbench --throttle (submission 7 - with lag measurement)

From

Fabien COELHO

Date:

19 June 2013, 18:42:36

> number of transactions actually processed: 301921

Just a thought before spending too much time on this subtle issue.

The patch worked reasonnably for 301900 transactions in your above run, 
and the few last ones, less than the number of clients, show strange 
latency figures which suggest that something is amiss in some corner case 
when pgbench is stopping. However, the point of pgbench is to test a 
steady state, not to achieve the cleanest stop at the end of a run.

So my question is: should this issue be a blocker wrt to the feature?

-- 
Fabien.

Re: [PATCH] pgbench --throttle (submission 7 - with lag measurement)

From

Robert Haas

Date:

21 June 2013, 13:28:42

On Wed, Jun 19, 2013 at 2:42 PM, Fabien COELHO <coelho@cri.ensmp.fr> wrote:
>> number of transactions actually processed: 301921
> Just a thought before spending too much time on this subtle issue.
>
> The patch worked reasonnably for 301900 transactions in your above run, and
> the few last ones, less than the number of clients, show strange latency
> figures which suggest that something is amiss in some corner case when
> pgbench is stopping. However, the point of pgbench is to test a steady
> state, not to achieve the cleanest stop at the end of a run.
>
> So my question is: should this issue be a blocker wrt to the feature?

I think so.  If it doesn't get fixed now, it's not likely to get fixed
later.  And the fact that nobody understands why it's happening is
kinda worrisome...

-- 
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

Re: [PATCH] pgbench --throttle (submission 7 - with lag measurement)

From

Fabien COELHO

Date:

22 June 2013, 16:54:49

Dear Robert and Greg,

> I think so.  If it doesn't get fixed now, it's not likely to get fixed
> later.  And the fact that nobody understands why it's happening is
> kinda worrisome...

Possibly, but I thing that it is not my fault:-)

So, I spent some time at performance debugging...

First, I finally succeeded in reproducing Greg Smith spikes on my ubuntu 
laptop. It needs short transactions and many clients per thread so as to 
be a spike. With "pgbench" standard transactions and not too many clients 
per thread it is more of a little bump, or even it is not there at all.

After some poking around, and pursuing various red herrings, I resorted to 
measure the delay for calling "PQfinish()", which is really the only 
special thing going around at the end of pgbench run...

BINGO!

In his tests Greg is using one thread. Once the overall timer is exceeded, 
clients start closing their connections by calling PQfinish once their 
transactions are done. This call takes between a few us and a few ... ms. 
So if some client closing time hit the bad figures, the transactions of 
other clients are artificially delayed by this time, and it seems they 
have a high latency, but it is really because the thread is in another 
client's PQfinish and was not available to process the data. If you have 
one thread per client, no problem, especially as the final PQfinish() time 
is not measured at all by pgbench:-) Also, the more clients, the higher 
the spike because more are to be stopped and may hit the bad figures.

Here is a trace, with the simple SELECT transaction.
  sh> ./pgbench --rate=14000 -T 10 -r -l -c 30 -S bench  ...
  sh> less pgbench_log.*
 # short transactions, about 0.250 ms ... 20 4849 241 0 1371916583.455400 21 4844 256 0 1371916583.455470 22 4832 348 0
1371916583.45556923 4829 218 0 1371916583.455627
 
   ** TIMER EXCEEDED **
 25 4722 390 0 1371916583.455820 25 done in 1560 [1,2]              # BING, 1560 us for PQfinish() 26 4557 1969 0
1371916583.45740726 done in 21 [1,2] 27 4372 1969 0 1371916583.457447 27 done in 19 [1,2] 28 4009 1910 0
1371916583.45748628 done in 1445 [1,2]              # BOUM 2 interrupted in 1300 [0,0]        # BANG 3 interrupted in
15[0,0] 4 interrupted in 40 [0,0] 5 interrupted in 203 [0,0]         # boum? 6 interrupted in 1352 [0,0]        # ZIP 7
interruptedin 18 [0,0] ... 23 interrupted in 12 [0,0]
 
 ## the cumulated PQfinish() time above is about 6 ms which ## appears as an apparent latency for the next clients:
  0 4880 6521 0 1371916583.462157  0 done in 9 [1,2]  1 4877 6397 0 1371916583.462194  1 done in 9 [1,2] 24 4807 6796 0
1371916583.46221724 done in 9 [1,2] ...
 

Note that the bad measures also appear when there is no throttling:
  sh> ./pgbench -T 10 -r -l -c 30 -S bench  sh> grep 'done.*[0-9][0-9][0-9]' pgbench_log.*   0 done in 1974 [1,2]   1
donein 312 [1,2]   3 done in 2159 [1,2]   7 done in 409 [1,2]  11 done in 393 [1,2]  12 done in 2212 [1,2]  13 done in
1458[1,2]  ## other clients execute PQfinish in less than 100 us
 

This "done" is issued by my instrumented version of clientDone().

The issue does also appear if I instrument pgbench from master, without 
anything from the throttling patch at all:
  sh> git diff master  diff --git a/contrib/pgbench/pgbench.c b/contrib/pgbench/pgbench.c  index 1303217..7c5ea81
100644 --- a/contrib/pgbench/pgbench.c  +++ b/contrib/pgbench/pgbench.c  @@ -869,7 +869,15 @@ clientDone(CState *st,
boolok)
 
          if (st->con != NULL)          {  +               instr_time now;  +               int64 s0, s1;  +
  INSTR_TIME_SET_CURRENT(now);  +               s0 = INSTR_TIME_GET_MICROSEC(now);                  PQfinish(st->con);
+              INSTR_TIME_SET_CURRENT(now);  +               s1 = INSTR_TIME_GET_MICROSEC(now);  +
fprintf(stderr,"%d done in %ld [%d,%d]\n",  +                               st->id, s1-s0, st->listen, st->state);
           st->con = NULL;          }          return false;                           /* always false */
 
  sh> ./pgbench -T 10 -r -l -c 30 -S bench 2> x.err
  sh> grep 'done.*[0-9][0-9][0-9]' x.err  14 done in 1985 [1,2]  16 done in 276 [1,2]  17 done in 1418 [1,2]

So my argumented conclusion is that the issue is somewhere within 
PQfinish(), possibly in interaction with pgbench doings, but is *NOT* 
related in any way to the throttling patch, as it is preexisting it. Gregs 
stumbled upon it because he looked at latencies.

I'll submit a slightly improved v12 shortly.

-- 
Fabien.

Re: [PATCH] pgbench --throttle (submission 7 - with lag measurement)

From

Fabien COELHO

Date:

22 June 2013, 18:06:51

Please find attached a v12, which under timer_exceeded interrupts clients 
which are being throttled instead of waiting for the end of the 
transaction, as the transaction is not started yet.

I've also changed the log format that I used for debugging the apparent 
latency issue:
  x y z 12345677 1234 -> x y z 12345677.001234

It seems much clearer that way.

-- 
Fabien.

Re: [PATCH] pgbench --throttle (submission 7 - with lag measurement)

From

Fabien COELHO

Date:

22 June 2013, 18:09:34

> Please find attached a v12, which under timer_exceeded interrupts clients 
> which are being throttled instead of waiting for the end of the transaction, 
> as the transaction is not started yet.

Oops, I forgot the attachment. Here it is!

-- 
Fabien.

Re: [PATCH] pgbench --throttle (submission 7 - with lag measurement)

From

Fabien COELHO

Date:

23 June 2013, 07:00:53

> So my argumented conclusion is that the issue is somewhere within PQfinish(), 
> possibly in interaction with pgbench doings, but is *NOT* related in any way 
> to the throttling patch, as it is preexisting it. Gregs stumbled upon it 
> because he looked at latencies.

An additional thought:

The latency measures *elapsed* time. As a small laptop is running 30 
clients and their server processes at a significant load, there are a lot 
of context switching going on, so maybe it just happens that the pgbench 
process is switched off and then on as PQfinish() is running, so the point 
would really be that the host is loaded and that's it. I'm not sure of the 
likelyhood of such an event. It is possible that would be more frequent 
after timer_exceeded because the system is closing postgres processes, and 
would depend on what the kernel process scheduler does.

So the explanation would be: your system is loaded, and it shows in subtle 
ways here and there when you do detailed measures. That is life.

Basically this is a summary my (long) experience with performance 
experiments on computers. What are you really measuring? What is really 
happening?

When a system is loaded, there are many things which interact one with the 
other and induce particular effects on performance measures. So usually 
what is measured is not what one is expecting.

Greg thought that he was measuring transaction latencies, but it was 
really client contention in a thread. I thought that I was measuring 
PQfinish() time, but maybe it is the probability of a context switch.

-- 
Fabien.

Re: [PATCH] pgbench --throttle (submission 7 - with lag measurement)

From

Fabien COELHO

Date:

23 June 2013, 20:08:22

> An additional thought:

Yet another thought, hopefully final on this subject.

I think that the probability of a context switch is higher when calling 
PQfinish than in other parts of pgbench because it contains system calls 
(e.g. closing the network connection) where the kernel is likely to stop 
this process and activate another one.

-- 
Fabien.

Re: [PATCH] pgbench --throttle (submission 7 - with lag measurement)

From

Fabien COELHO

Date:

27 June 2013, 18:23:48

> Please find attached a v12, which under timer_exceeded interrupts 
> clients which are being throttled instead of waiting for the end of the 
> transaction, as the transaction is not started yet.

Please find attached a v13 which fixes conflicts introduced by the long 
options patch committed by Robert Haas.

-- 
Fabien.

Re: [PATCH] pgbench --throttle (submission 7 - with lag measurement)

From

Greg Smith

Date:

29 June 2013, 23:11:28

On 6/22/13 12:54 PM, Fabien COELHO wrote:
> After some poking around, and pursuing various red herrings, I resorted
> to measure the delay for calling "PQfinish()", which is really the only
> special thing going around at the end of pgbench run...

This wasn't what I was seeing, but it's related.  I've proved to myself 
the throttle change isn't reponsible for the weird stuff I'm seeing now.  I'd like to rearrange when PQfinish happens
nowbased on what I'm 

seeing, but that's not related to this review.

I duplicated the PQfinish problem you found too.  On my Linux system, 
calls to PQfinish are normally about 36 us long.  They will sometimes 
get lost for >15ms before they return.  That's a different problem 
though, because the ones I'm seeing on my Mac are sometimes >150ms. 
PQfinish never takes quite that long.

PQfinish doesn't pause for a long time on this platform.  But it does 
*something* that causes socket select() polling to stutter.  I have 
instrumented everything interesting in this part of the pgbench code, 
and here is the problem event.

1372531862.062236 select with no timeout sleeping=0
1372531862.109111 select returned 6 sockets latency 46875 us

Here select() is called with 0 sleeping processes, 11 that are done, and 
14 that are running.  The running ones have all sent SELECT statements 
to the server, and they are waiting for a response.  Some of them 
received some data from the server, but they haven't gotten the entire 
response back.  (The PQfinish calls could be involved in how that happened)

With that setup, select runs for 47 *ms* before it gets the next byte to 
a client.  During that time 6 clients get responses back to it, but it 
stays stuck in there for a long time anyway.  Why?  I don't know exactly 
why, but I am sure that pgbench isn't doing anything weird.  It's either 
libpq acting funny, or the OS.  When pgbench is waiting on a set of 
sockets, and none of them are returning anything, that's interesting. 
But there's nothing pgbench can do about it.

The cause/effect here is that the randomness to the throttling code 
spreads out when all the connections end a bit. There are more times 
during which you might have 20 connections finished while 5 still run.

I need to catch up with revisions done to this feature since I started 
instrumenting my copy more heavily.  I hope I can get this ready for 
commit by Monday.  I've certainly beaten on the feature for long enough now.

-- 
Greg Smith   2ndQuadrant US    greg@2ndQuadrant.com   Baltimore, MD
PostgreSQL Training, Services, and 24x7 Support www.2ndQuadrant.com

Re: [PATCH] pgbench --throttle (submission 7 - with lag measurement)

From

Fabien COELHO

Date:

30 June 2013, 06:04:25

> [...] Why?  I don't know exactly why, but I am sure that pgbench isn't 
> doing anything weird.  It's either libpq acting funny, or the OS.

My guess is the OS. "PQfinish" or "select" do/are systems calls that 
present opportunities to switch context. I think that the OS is passing 
time with other processes on the same host, expecially postgres backends, 
when it is not with the client. In order to test that, pgbench should run 
on a dedicated box with less threads than the number of available cores, 
or user time could be measured in addition to elapsed time. Also, testing 
with many clients per thread means that if any client is stuck all other 
clients incur an artificial latency: measures are intrinsically fragile.

> I need to catch up with revisions done to this feature since I started 
> instrumenting my copy more heavily.  I hope I can get this ready for 
> commit by Monday.

Ok, thanks!

-- 
Fabien.

Re: [PATCH] pgbench --throttle (submission 7 - with lag measurement)

From

Josh Berkus

Date:

08 July 2013, 18:50:31

On 06/29/2013 04:11 PM, Greg Smith wrote:
> I need to catch up with revisions done to this feature since I started
> instrumenting my copy more heavily.  I hope I can get this ready for
> commit by Monday.  I've certainly beaten on the feature for long enough
> now.

Greg, any progress?  Haven't seen an update on this in 10 days.

-- 
Josh Berkus
PostgreSQL Experts Inc.
http://pgexperts.com

Re: [PATCH] pgbench --throttle (submission 7 - with lag measurement)

From

Greg Smith

Date:

13 July 2013, 15:37:59

On 6/30/13 2:04 AM, Fabien COELHO wrote:
> My guess is the OS. "PQfinish" or "select" do/are systems calls that
> present opportunities to switch context. I think that the OS is passing
> time with other processes on the same host, expecially postgres
> backends, when it is not with the client.

I went looking for other instances of this issue in pgbench results, 
that's what I got lost in the last two weeks.  It's subtle because the 
clients normally all end in one very short burst of time, but I have 
found evidence of PQfinish issues elsewhere.  Evidence still seems to 
match the theory that throttling highlights this only because it spreads 
out the ending a bit more.  Also, it happens to be a lot worse on the 
Mac I did initial testing with, and I don't have nearly as many Mac 
pgbench results.

There's a refactoring possible here that seems to make this whole class 
of problem go away.  If I change pgbench so that PQfinish isn't called 
for any client until *all* of the clients are actually finished with 
transactions, the whole issue goes away.  I'm going to package that hack 
the right way into its own little change, revisit the throttling code, 
and then this all should wrap up nicely.  I'd like to get this one out 
of the commitfest so I can move onto looking at something else.

-- 
Greg Smith   2ndQuadrant US    greg@2ndQuadrant.com   Baltimore, MD
PostgreSQL Training, Services, and 24x7 Support www.2ndQuadrant.com

Re: [PATCH] pgbench --throttle (submission 7 - with lag measurement)

From

Fabien COELHO

Date:

13 July 2013, 16:14:19

Hello Greg,

> There's a refactoring possible here that seems to make this whole class of 
> problem go away.  If I change pgbench so that PQfinish isn't called for any 
> client until *all* of the clients are actually finished with transactions, 
> the whole issue goes away.

Sure. If the explanation is that because of the load the OS is just 
switching to a "postgres" process while PQfinish is being called, then 
waiting for the end of other transactions means that "postgres" processes 
don't have anything to do anymore, so process switching is much less 
likely, so nothing would show up... However this is really hiding the 
underlying fact from the measures rather than solving anything, IMHO.

> I'm going to package that hack the right way into its own little change,
> revisit the throttling code, and then this all should wrap up nicely.

My 0.02€: if it means adding complexity to the pgbench code, I think that 
it is not worth it. The point of pgbench is to look at a steady state, not 
to end in the most graceful possible way as far as measures are concerned. 
On the other end, if it does not add too much complexity, why not!

> I'd like to get this one out of the commitfest so I can move onto 
> looking at something else.

Thanks.

-- 
Fabien.

Re: [PATCH] pgbench --throttle (submission 7 - with lag measurement)

From

Greg Smith

Date:

14 July 2013, 17:56:00

On 7/13/13 12:13 PM, Fabien COELHO wrote:
> My 0.02€: if it means adding complexity to the pgbench code, I think
> that it is not worth it. The point of pgbench is to look at a steady
> state, not to end in the most graceful possible way as far as measures
> are concerned.

That's how some people use pgbench.  I'm just as likely to use it to
characterize system latency.  If there's a source of latency that's
specific to the pgbench code, I want that out of there even if it's hard.

But we don't have to argue about that because it isn't.  The attached
new patch seems to fix the latency spikes at the end, with -2 lines of
new code!  With that resolved I did a final pass across the rate limit
code too, attached as a v14 and ready for a committer.  I don't really
care what order these two changes are committed, there's no hard
dependency, but I would like to see them both go in eventually.

No functional code was changed from your v13 except for tweaking the
output.  The main thing I did was expand/edit comments and rename a few
variables to try and make this easier to read.  If you have any
objections to my cosmetic changes feel free to post an update.  I've put
a good bit of time into trying to simplify this further, thinking it
can't really be this hard.  But this seems to be the minimum complexity
that still works given the mess of the pgbench state machine.  Every
change I try now breaks something.

To wrap up the test results motivating my little pgbench-delay-finish
patch, the throttled cases that were always giving >100ms of latency
clustered at the end here now look like this:

average rate limit lag: 0.181 ms (max 53.108 ms)
tps = 10088.727398 (including connections establishing)
tps = 10105.775864 (excluding connections establishing)

There are still some of these cases where latency spikes, but they're
not as big and they're randomly distributed throughout the run.  The
problem I had with the ones at the end is how they tended to happen a
few times in a row.  I kept seeing multiple of these ~50ms lulls adding
up to a huge one, because the source of the lag kept triggering at every
connection close.

pgbench was already cleaning up all of its connections at the end, after
all the transactions were finished.  It looks safe to me to just rely on
that for calling PQfinish in the normal case.  And calls to client_done
already label themselves ok or abort, the code just didn't do anything
with that state before.  I tried adding some more complicated state
tracking, but that adds complexity while doing the exact same thing as
the simple implementation I did.

The only part of your code change I reverted was altering the latency
log transaction timestamps to read like "1373821907.65702" instead of
"1373821907 65702".  Both formats were considered when I added that
feature, and I completely understand why you'd like to change it.  One
problem is that doing so introduces a new class of float parsing and
rounding issues to consumers of that data.  I'd rather not revisit that
without a better reason to break the output format.  Parsing tools like
my pgbench-tools already struggle trying to support multiple versions of
pgbench, and I don't think there's enough benefit to the float format to
bother breaking them today.

--
Greg Smith   2ndQuadrant US    greg@2ndQuadrant.com   Baltimore, MD
PostgreSQL Training, Services, and 24x7 Support www.2ndQuadrant.com

Attachment

Re: [PATCH] pgbench --throttle (submission 7 - with lag measurement)

From

Fabien COELHO

Date:

14 July 2013, 18:48:31

Hello Greg,

> But we don't have to argue about that because it isn't.  The attached new 
> patch seems to fix the latency spikes at the end, with -2 lines of new code!

Hmmm... that looks like not too much complexity:-)

> With that resolved I did a final pass across the rate limit code too, 
> attached as a v14 and ready for a committer.

You attached my v13. Could you send your v14?

-- 
Fabien.

Re: [PATCH] pgbench --throttle (submission 7 - with lag measurement)

From

Greg Smith

Date:

14 July 2013, 19:14:56

On 7/14/13 2:48 PM, Fabien COELHO wrote:
> You attached my v13. Could you send your v14?

Correct patch (and the little one from me again) attached this time.

--
Greg Smith   2ndQuadrant US    greg@2ndQuadrant.com   Baltimore, MD
PostgreSQL Training, Services, and 24x7 Support www.2ndQuadrant.com

Attached is an update that I think sorts out all of the documentation
concerns.  I broke this section into paragraphs now that it's getting so
long too.

The only code change is that this now labels the controversial lag here
"average rate limit schedule lag".  That way if someone wants to
introduce other measures of rate limit lag, like a more transaction
oriented one, you might call that "average rate limit transaction lag"
and tell the two apart.

The rewritten documentation here tries to communicate that there is a
schedule that acts like it was pre-computed at the start of each client
too.  It's not ever adjusted based on what individual transactions do.
I also noted the way this can cause schedule lag for some time after a
slow transaction finishes, since that's the main issue observed so far.

--
Greg Smith   2ndQuadrant US    greg@2ndQuadrant.com   Baltimore, MD
PostgreSQL Training, Services, and 24x7 Support www.2ndQuadrant.com

Attachment

pgbench-throttle-v18.patch

Re: [PATCH] pgbench --throttle (submission 7 - with lag measurement)

From

Fabien COELHO

Date:

22 July 2013, 16:43:57

Hello Greg,

Thanks for the improvement!


I have a small reservation about "finish/end time schedule" in the second 
paragraph, or maybe there is something that I do not understand. There is 
no schedule for finishing anything, only start times are scheduled, so I 
wish the text could avoid suggesting that finish time are scheduled.

> The rate is targeted by starting transactions along a 
> Poisson-distributed schedule time line.  The expected

> finish time schedule

-> start time schedule

> moves forward based on when the client first started, not when 
> the previous transaction ended.


> That approach means that when transactions go past their original 
> scheduled end time, it is possible for later ones to catch up again.

-> That approach means that long transactions can result in later 
transactions to be late with respect to the schedule, while short 
transactions makes it possible for late ones to catch up again.

Would you be ok with that?

-- 
Fabien.

Re: [PATCH] pgbench --throttle (submission 7 - with lag measurement)

From

Alvaro Herrera

Date:

22 July 2013, 16:44:26

Greg Smith wrote:

Thanks.  I didn't look at the code, but while trying to read the docs:

> +        <para>
> +         High rate limit schedule lag values, that is values not small with
> +         respect to the actual transaction latency, indicate that something is
> +         amiss in the throttling process.

I couldn't really parse the above.  Of the first six words, which one is
a verb?  Is there a noun that needs to be plural?  Is there a word that
shouldn't be there?

... Oh, I think it makes sense if I assume that "rate limit schedule lag"
is a single concept .. but if so, that phrase seems too many words for it.
(So when the RLSL values are high, this indicates a problem.  Is that
what the above means?)

Also, it took me a while to understand what "values not small" means.  I
think there must be a way to phrase this that's easier to understand.

>                                            High lag can highlight a subtle
> +         problem there even if the target rate limit is met in the end.  One
> +         possible cause of schedule lage is insufficient pgbench threads to
> +         handle all of the clients.

typo "lage" above.

>                                       To improve that, consider reducing the
> +         number of clients, increasing the number of threads in pgbench, or
> +         running pgbench on a separate host.  Another possibility is that the
> +         database is not keeping up with the load at some point.  When that
> +         happens, you will have to reduce the expected transaction rate to
> +         lower schedule lag.
> +        </para>

Thanks

-- 
Álvaro Herrera                http://www.2ndQuadrant.com/
PostgreSQL Development, 24x7 Support, Training & Services

Re: [PATCH] pgbench --throttle (submission 7 - with lag measurement)

From

Fabien COELHO

Date:

22 July 2013, 17:00:46

Hello Alvaro,

> Thanks.  I didn't look at the code, but while trying to read the docs:
>
>> +        <para>
>> +         High rate limit schedule lag values, that is values not small with
>> +         respect to the actual transaction latency, indicate that something is
>> +         amiss in the throttling process.
>
> I couldn't really parse the above.  Of the first six words, which one is
> a verb?

None. "High values for the time lag measured with respect to the <rate 
limit schedule>".

> Is there a noun that needs to be plural?  Is there a word that shouldn't 
> be there?

I do not think so.

> ... Oh, I think it makes sense if I assume that "rate limit schedule lag"
> is a single concept .. but if so, that phrase seems too many words for it.
> (So when the RLSL values are high, this indicates a problem.  Is that
> what the above means?)

Yep!

> Also, it took me a while to understand what "values not small" means.  I
> think there must be a way to phrase this that's easier to understand.

That's what we are trying to do, but we still need to be precise. With 
less words it seems more understandable, but previous versions suggested 
that the meaning with ambiguous, that is people put their own intuitive 
definition of lag, which resulted in surprises at the measures and 
cumulative behavior. The alternative was either to change what is 
measured, but I insisted that this measure is the useful one, or to try to 
reduce the ambiguity of the documentation, the result of efforts by Greg & 
myself your helping to debug:-)

>>                                            High lag can highlight a subtle
>> +         problem there even if the target rate limit is met in the end.

I'm fine with that, if it is clear from the context that the lag we're 
talking about is the one defined on the preceeding paragraph. Greg what 
do you think?

>> + One possible cause of schedule lage is insufficient pgbench threads 
>> to handle all of the clients.
>
> typo "lage" above.

Indeed.

>>                                       To improve that, consider 
>> reducing the + number of clients, increasing the number of threads in 
>> pgbench, or + running pgbench on a separate host.  Another possibility 
>> is that the + database is not keeping up with the load at some point. 
>> When that + happens, you will have to reduce the expected transaction 
>> rate to + lower schedule lag. + </para>

Thanks for your help!

-- 
Fabien.

Re: [PATCH] pgbench --throttle (submission 7 - with lag measurement)

From

Greg Smith

Date:

22 July 2013, 17:49:50

Very minor update with V19 here, to reflect Alvaro's comments.  The
tricky part now reads like this:

High rate limit schedule lag values, that is lag values that are large
compared to the actual transaction latency, indicate that something is
amiss in the throttling process.  High schedule lag can highlight a
subtle problem there even if the target rate limit is met in the end.

--
Greg Smith   2ndQuadrant US    greg@2ndQuadrant.com   Baltimore, MD
PostgreSQL Training, Services, and 24x7 Support www.2ndQuadrant.com

Attachment

pgbench-throttle-v19.patch

Re: [PATCH] pgbench --throttle (submission 7 - with lag measurement)

From

David Fetter

Date:

22 July 2013, 18:07:26

On Mon, Jul 22, 2013 at 01:49:39PM -0400, Greg Smith wrote:
> Very minor update with V19 here, to reflect Alvaro's comments.  The
> tricky part now reads like this:
> 
> High rate limit schedule lag values,

"High values of the rate limit schedule lag measurement?"

> that is lag values that are large compared to the actual transaction
> latency, indicate that something is amiss in the throttling process.
> High schedule lag can highlight a subtle problem there even if the
> target rate limit is met in the end.

Cheers,
David.
-- 
David Fetter <david@fetter.org> http://fetter.org/
Phone: +1 415 235 3778  AIM: dfetter666  Yahoo!: dfetter
Skype: davidfetter      XMPP: david.fetter@gmail.com
iCal: webcal://www.tripit.com/feed/ical/people/david74/tripit.ics

Remember to vote!
Consider donating to Postgres: http://www.postgresql.org/about/donate

Re: [PATCH] pgbench --throttle (submission 7 - with lag measurement)

From

Tatsuo Ishii

Date:

22 July 2013, 23:52:19

> Very minor update with V19 here, to reflect Alvaro's comments.  The
> tricky part now reads like this:
> 
> High rate limit schedule lag values, that is lag values that are large
> compared to the actual transaction latency, indicate that something is
> amiss in the throttling process.  High schedule lag can highlight a
> subtle problem there even if the target rate limit is met in the end.

I have committed this along with slight modification. I changed
"--rate rate" to "--rate=rate" to follow option style of pgbench.

Also I have removed a space in "--progress= sec" in the doc, which is
probably mistakenly added by previous commit.
--
Tatsuo Ishii
SRA OSS, Inc. Japan
English: http://www.sraoss.co.jp/index_en.php
Japanese: http://www.sraoss.co.jp