Re: weird performances problem - Mailing list pgsql-performance

From Ron
Subject Re: weird performances problem
Date
Msg-id 6.2.5.6.0.20051122120225.040f8658@earthlink.net
Whole thread Raw
In response to Re: weird performances problem  (Guillaume Smet <guillaume.smet@openwide.fr>)
List pgsql-performance
At 10:26 AM 11/22/2005, Guillaume Smet wrote:
>Ron,
>
>First of all, thanks for your time.
Happy to help.


>>As has been noted many times around here, put the WAL on its own
>>dedicated HD's.  You don't want any head movement on those HD's.
>
>Yep, I know that. That's just we supposed it was not so important if
>it was nearly a readonly database which is wrong according to you.
It's just good practice with pg that pg-xlog should always get it's
own dedicated HD set.  OTOH, I'm not at all convinced given the scant
evidence so far that it is the primary problem here; particularly
since if I understand you correctly, px-xlog is not on sdb or sdb1
where you are having the write storm.


>>_Something_ is doing long bursts of write IO on sdb and sdb1 every
>>30 minutes or so according to your previous posts.
>
>It's not every 30 minutes. It's a 20-30 minutes slow down 3-4 times
>a day when we have a high load.

Thanks for the correction and I apologize for the misunderstanding.
Clearly the first step is to instrument sdb and sdb1 so that you
understand exactly what is being accessed and written on them.

Possibilities that come to mind:
a) Are some of your sorts requiring more than 32MB during high
load?  If sorts do not usually require HD temp files and suddenly do,
you could get behavior like yours.

b) Are you doing too many 32MB sorts during high load?  Same comment as above.

c) Are you doing some sort of data updates or appends during high
load that do not normally occur?

d) Are you constantly doing "a little" write IO that turns into a
write storm under high load because of queuing issues?

Put the scaffolding in needed to trace _exactly_ what's happening on
sdb and sdb1 throughout the day and then examine the traces over a
day, a few days, and a week.  I'll bet you will notice some patterns
that will be helpful in identifying what's going on.

Ron



pgsql-performance by date:

Previous
From: Vivek Khera
Date:
Subject: Re: High context switches occurring
Next
From: Jeff Trout
Date:
Subject: Re: System queue