Re: Physical sites handling large data - Mailing list pgsql-general

From Justin Clift
Subject Re: Physical sites handling large data
Date
Msg-id 3D827747.F07484FF@postgresql.org
Whole thread Raw
In response to Physical sites handling large data  ("scott.marlowe" <scott.marlowe@ihs.com>)
Responses Re: Physical sites handling large data
List pgsql-general
Hi Scott,

Good move.  :)

Shridhar, any idea of the kind of demands they'll be placing on the
database?

For example, does your friend have an idea of:

a) how many clients will be simultaneously connecting to the database(s)
normally, and at peak times

b) how sensitive to performance lag and downtime is the application?

c) what are the data integrity requirements?  Large array's of data that
are mission critical need to be treated differently than small arrays,
especially when taking b) into consideration.  Higher end non-intel
servers generally have better features in their OS and hardware for
dealing with large amounts of important data.

d) what kind of stuff is your friend familar with?  For example, is he
ok with unix in general, etc?

The more info you can get to us, the better we can help yourselves out.

:-)

Regards and best wishes,

Justin Clift


"scott.marlowe" wrote:
>
> I moved this over to general, where it's more on topic...
>
> On Fri, 13 Sep 2002, Shridhar Daithankar wrote:
>
> > Hi all,
> >
> > One of my friends is evaluating postgres for large databases. This is a select
> > intensive application which is something similar to data-warehousing as far as
> > I can see.
> >
> > The data is 150GB in flat files so would swell to 200GB+ with indexes.
> >
> > Is anybody running that kind of site? Any url? Any performance numbers/tuning
> > tips for random selects?
> >
> > I would hate to put mysql there but we are evaluating that too. I would hate if
> > postgres loses this to mysql because I didn't know few things about postgres.
> >
> > Secondly would it make a difference if I host that database on say, an HP-UX
> > box? From some tests I have done for my job, single CPU HP-UX box trounces 4
> > way xeon box. Any suggestions in this directions?
>
> Often times the real limiter for database performance is IO bandwidth and
> subsystem, not the CPUs.  After that memory access speed and bandwidth are
> very important too, so I can see a big HP UX box beating the pants off of
> a Xeon.
>
> Honestly, I'd put a dual 1G PIII 1G ram up against a quad xeon with 2
> Gig ram if I got to spend the difference in cost on a very fast RAID
> array for the PIII.  Since a quad Xeon with 2 Gigs ram and a pair of 18
> gig SCSI drives goes for ~ $27,500 on Dell, and a Dual PIII 1Ghz with 5
> 15KRPM 18 gig drives goes for ~ $6,700, that leaves me with about $20,000
> to spend on an external RAID array on top of the 5 15kRPM drives I've
> already got configured.  An external RAID array with 144GB of 15krpm 18gig
> drives runs ~$7700, so you could get three if you got the dual PIII
> without all those drives built into it.  That makes for 24 15kRPM drives
> and about 430 Gigs of storage, all in a four unit Rack mounted setup.
>
> My point being, spend more money on the drive subsystem than anything else
> and you'll probably be fine, but postgresql may or may not be your best
> answer.  It may be better to use something like berkeley db to handle this
> job than a SQL database.
>
> ---------------------------(end of broadcast)---------------------------
> TIP 6: Have you searched our list archives?
>
> http://archives.postgresql.org

--
"My grandfather once told me that there are two kinds of people: those
who work and those who take the credit. He told me to try to be in the
first group; there was less competition there."
   - Indira Gandhi

pgsql-general by date:

Previous
From: "Williams, Travis L, NPONS"
Date:
Subject: Re: Can't run configure
Next
From: Tom Lane
Date:
Subject: Re: explain analyze