Re: Tuning PostgreSQL - Mailing list pgsql-performance

From scott.marlowe
Subject Re: Tuning PostgreSQL
Date
Msg-id Pine.LNX.4.33.0307291405400.22680-100000@css120.ihs.com
Whole thread Raw
In response to Re: Tuning PostgreSQL  (Ron Johnson <ron.l.johnson@cox.net>)
Responses Re: Tuning PostgreSQL
List pgsql-performance
On 29 Jul 2003, Ron Johnson wrote:

> On Tue, 2003-07-29 at 14:00, scott.marlowe wrote:
> > On 29 Jul 2003, Ron Johnson wrote:
> >
> > > On Tue, 2003-07-29 at 11:18, scott.marlowe wrote:
> > > > On 29 Jul 2003, Ron Johnson wrote:
> > > >
> > > > > On Tue, 2003-07-29 at 10:14, Vivek Khera wrote:
> > > > > > >>>>> "GS" == Greg Stark <gsstark@mit.edu> writes:
> > > > > >
> > > > > > GS> "scott.marlowe" <scott.marlowe@ihs.com> writes:
> > > > > >
> > > > > > GS> But you have to actually test your setup in practice to see if it
> > > > > > GS> hurts. A big data warehousing system will be faster under RAID5
> > > > > > GS> than under RAID1+0 because of the extra disks in the
> > > > > > GS> stripeset. The more disks in the stripeset the more bandwidth you
> > > > > > GS> get.
> > > > > >
> > > > > > Anyone have ideas on 14 spindles?  I just ordered a disk subsystem
> > > > > > with 14 high speed (U320 15kRPM) SCSI disks to hook up with a dell
> > > > > > PERC3/DC controller (only 128MB cache, though).
> > > > >
> > > > > 14 drives on one SCSI card, eh?  I'd be worried about saturating
> > > > > the bus.
> > > >
> > > > I'm pretty sure those PERCs are based on the megaraid cards, which can
> > > > handle 3 or 4 channels each...
> > >
> > > Each with 14 devices?  If so, isn't that a concentrated point of
> > > failure, even if the channels are 1/2 full?
> >
> > Yep.  I've built one once before when BIG hard drives were 9 gigs.  :-)
> >
> > And it is a point of concentrated failure, which brings me to my favorite
> > part about the LSI megaraid cards (which most / all perc3s are
> > apparently.)
> >
> > If you build a RAID1+0 or 0+1, you can seperate it out so each sub part is
> > on it's own card, and the other cards keep acting like one big card.
> > Assuming the bad card isn't killing your PCI bus or draining the 12V rail
> > or something.
>
> Sounds like my kinda card!
>
> Is the cache battery-backed up?

Yep

> How much cache can you stuff in them?

the old old old school MegaRAID428 could hold up to 128 Meg.  I'm sure the
new ones can handle 512Meg or more.

> > > > > Maybe it's an old rule of thumb, but I would fill a SCSI chain
> > > > > more than half full.
> > > >
> > > > It's an old rule of thumb, but it still applies, it just takes more drives
> > > > to saturate the channel.  Figure ~ 30 to 50 MBytes a second per drive, on
> > > > a U320 port it would take 10 drives to saturate it, and considering random
> > > > accesses will be much slower than the max ~30 megs a second off the
> > > > platter rate, it might take more than the max 14 drives to saturate U320.
> > >
> > > Ok.  You'd still saturate the 133MB/s PCI bus at 133/30 = 4.4 drives.
> >
> > But that's seq scan.  For many database applications, random access
> > performance is much more important.  Imagine 200 people entering
> > reservations of 8k or less each into a transaction processing engine.
> > Each transactions chance to hit an unoccupied spindle is what really
> > counts.  If there's 30 spindles, each doing a stripe's worth of access all
> > the time, it's likely to never flood the channel.
> >
> > If random access is 1/4th the speed of seq scan, then you need to multiply
> > it by 4 to get the number of drives that'd saturate the PCI bus.
>
> Maybe it's just me, but I've never seen a purely TP system.

I think most of them are running under TPF on a mainframe in a basement
somewhere, like for airline reservations.  I've never worked on one, but
met one of the guys who runs one, and they use 12 mainframes for 6 live
machines and each live machine has a failover machine behind it in sysplex
mode.  I kept thinking of the giant dinosaurs in Jurassic park...

> Even if roll off the daily updates to a "reporting database" each
> night, some yahoo manager with enough juice to have his way still
> wants up-to-the-minute reports...

Just because it's TP doesn't mean it doesn't have real time reporting.
But expensive reports probably do get run at night.

> Better yet, the Access Jockey, who thinks s/he's an SQL whiz but
> couldn't JOIN himself out of a paper bag...

I've seen a few who got joins and unions and what not, but explaining fks
or transactions got me a glazed look... :-)


pgsql-performance by date:

Previous
From: Ron Johnson
Date:
Subject: Re: Tuning PostgreSQL
Next
From: Ron Johnson
Date:
Subject: Re: Tuning PostgreSQL