Home > mailing lists

Re: auto_explain WAS: RFC: Timing Events - Mailing list pgsql-hackers

From	Jim Nasby
Subject	Re: auto_explain WAS: RFC: Timing Events
Date	February 25, 2013 04:54:54
Msg-id	512AAFE7.3040904@nasby.net Whole thread Raw
In response to	Re: auto_explain WAS: RFC: Timing Events (Gavin Flower <GavinFlower@archidevsys.co.nz>)
Responses	Re: auto_explain WAS: RFC: Timing Events (Robert Haas <robertmhaas@gmail.com>)
List	pgsql-hackers

Tree view

Sorry for the late reply, but I think I can add some ideas here...

On 11/21/12 5:33 PM, Gavin Flower wrote:
> On 22/11/12 12:15, Greg Smith wrote:
>> On 11/8/12 2:16 PM, Josh Berkus wrote:
>>
>>> Also, logging only the long-running queries is less useful than people
>>> on this list seem to think.  When I'm doing real performance analysis, I
>>> need to see *everything* which was run, not just the slow stuff.  Often
>>> the real problem is a query which used to take 1.1ms, now takes 1.8ms,
>>> and gets run 400 times/second. Looking just at the slow queries won't
>>> tell you that.
>>
>> No argument here.  I've tried to be clear that some of these high-speed but lossy things I'm hacking on are not
goingto be useful for everyone.  A solution that found most of these 1.8ms queries, but dropped some percentage under
thehighest peak load conditions, would still be very useful to me.

>>
>> An anecdote on this topic seems relevant.  I have a troublesome production server that has moved
log_min_duration_statementfrom 100ms to 500ms to 2000ms as the system grew.  Even the original setting wasn't short
enoughto catch everything we would like to watch *now*, but seeing sub-second data is a dream at this point. The
increaseshave been forced by logging contention becoming unmanagable when every client has to fight for the log to
writeto disk.  I can see the buggers stack up as waiting for locks if I try to log shorter statements, stalling enough
thatit drags the whole server down under peak load.

>>
>> If I could just turn off logging just during those periods--basically, throwing them away only when some output rate
throttledcomponent hit its limit--I could still find them in the data collected the rest of the time.  There are some
typesof problems that also only occur under peak load that this idea would miss, but you'd still be likely to get
*some*of them, statistically.

>>
>> There's a really hard trade-off here:
>>
>> -Sometimes you must save data on every query to capture fine details
>> -Making every query wait for disk I/O is impractical
>>
>> The sort of ideas you threw out for making things like auto-explain logging per-session can work.  The main
limitationthere though is that it presumes you even know what area the problem is in the first place. I am not
optimisticthat covers a whole lot of ground either.

>>
>> Upthread I talked about a background process that persists shared memory queues as a future consumer of timing
events--onethat might consume slow query data too.  That is effectively acting as the ideal component I described
above,one that only loses things when it exceeds the system's write capacity for saving them.  I wouldn't want to try
andretrofit the existing PostgreSQL logging facility for such a thing though.  Log parsing as the way to collect data
isfilled with headaches anyway.

>>
>> I don't see any other good way to resolve this trade-off.  To help with the real-world situation you describe,
ideallyyou dump all the query data somewhere, fast, and have the client move on.  You can't make queries wait for
storage,something else (with a buffer!) needs to take over that job.

>>
>> I can't find the URL right now, but at PG.EU someone was showing me a module that grabbed the new 9.2 logging hook
andshipped the result to another server.  Clients burn a little CPU and network time and they move on, and magically
diskI/O goes out of their concerns.  How much overhead persisting the data takes isn't the database server's job at all
then. That's the sort of flexibility people need to have with logging eventually.  Something so efficient that every
clientcan afford to do it; it is capable of saving all events under ideal conditions; but under adverse ones, you have
tokeep going and accept the loss.

I definitely think we need a means to "log" stuff really, really fast and then deal with it later. And the log ideally
wouldnot be text either, because it's a PITA to deal with and often far more data to handle (which makes all our
problemsworse).

I like that Greg is proposing a method of logging information in a more useful format than text. I suggest that it
wouldbe best if the non-text structure be implemented outside of any particular logging methodology if possible.
Perhapsas something *equivalent to* a set of data types in PG. Of course I doubt real PG types could be used for the
actuallogging due to performance reasons, but the end interface should be able to use PG capabilities (inheritance is
anotherinteresting possibility).

Josh also brings up an interesting use case, one where I really like Tom Kyte's (the Oracle "Ask Tom" guy) methodology.
Whenasked about how to do logging in PL/SQL he proposed a system where you always associate some kind of "context" with
logdata so that you can enable logging *only for that context*. In this case, (and in plpgsql syntax), you'd do
somethinglike:

PERFORM debug( 'my function name', 'blah blah just happened' );

That way you're not stuck trying to mess around with verbosity levels to stem a flood of data, instead you can only
enablegetting data for a specific context.

IIRC, his system also had a method for turning on debugging on a *different backend*. That allowed you to debug a
specificapplication thread. If we had that ability, it would eliminate the issue of requiring SU to change the
autoexplainGUC.

Please note that I'm not suggesting these ideas at the exclusion of others; they can all be valuable. IE: Retain the
conceptof "log level", add a user specified context (or maybe it'd be enough to just use function names), and provide
someability to control debug on a separate backend. All three features are useful and can stand on their own or in
combination.

> Would it be useful to be able to specify a fixed sampling rate, say 1 in 100.  That way you could get a well defined
statisticalsample, yet not cause excessive I/O?

We actually do that in our application and have discovered that random sampling can end up significantly skewing your
data.

pgsql-hackers by date:

From: Craig Ringer
Date: 25 February 2013, 04:13:27
Subject: Re: Show type in psql SELECT

From: Michael Paquier
Date: 25 February 2013, 06:00:01
Subject: Re: Support for REINDEX CONCURRENTLY

Re: auto_explain WAS: RFC: Timing Events - Mailing list pgsql-hackers

Previous

Next