Re: Peformance Tuning Opterons/ Hard Disk Layout - Mailing list pgsql-performance

From Vig, Sandor (G/FI-2)
Subject Re: Peformance Tuning Opterons/ Hard Disk Layout
Date
Msg-id 977921B17B2F2048AA5AAE9B4CBB713E01423D2C@huaudigs0035.audi.de
Whole thread Raw
In response to Peformance Tuning Opterons/ Hard Disk Layout  (John Allgood <john@turbocorp.com>)
List pgsql-performance
Hi,

RAID1 (mirroring) and RAID1+0 (striping and mirroring) seems to
be a good choice. (RAID 5 is for saving money, but it doesn't have a
good performance)

I suggest you to make a different array for:
- Operating system
- db logs
- each database

It is a little bit  of "wasting" disk storage, but it has the best
performance.
Forget RAID 5. If your fibre channel card and the external storage exceeds
their throughput limits you should consider to implement +1 fibre channel
and/or +1 external storage unit. (If you had such a load)

But it is only the hardware. The database structure, and the application
logic is the other 50% of the performance...

Bye
Vig Sándor



-----Original Message-----
From: pgsql-performance-owner@postgresql.org
[mailto:pgsql-performance-owner@postgresql.org]On Behalf Of John Allgood
Sent: Wednesday, February 23, 2005 9:42 PM
To: John Arbash Meinel
Cc: pgsql-performance@postgresql.org
Subject: Re: [PERFORM] Peformance Tuning Opterons/ Hard Disk Layout


Here is a summary about the cluster suite from redhat. All 9 databases
will be on the primary server the secondary server I have is the
failover. They don't actually share the partitions at the same time.
When you have some type of failure the backup server takes over. Once
you setup the hardware and install the clustering software. You then
setup a service "ie postgres" and then you tell it what harddrive you
will be using. /dev/sde1 and the clustering software takes care of
starting and stopping the postgres database.


  Cluster Manager

The Cluster Manager feature of Red Hat Cluster Suite provides an
application failover infrastructure that can be used by a wide range of
applications, including:

    * Most custom and mainstream commercial applications
    * File and print serving
    * Databases and database applications
    * Messaging applications
    * Internet and open source application

With Cluster Manager, these applications can be deployed in high
availability configurations so that they are always operational—bringing
"scale-out" capabilities to enterprise Linux deployments.

For high-volume open source applications, such as NFS, Samba, and
Apache, Cluster Manager provides a complete ready-to-use failover
solution. For most other applications, customers can create custom
failover scripts using provided templates. Red Hat Professional Services
can provide custom Cluster Manager deployment services where required.


    Features

    * Support for up to eight nodes: Allows high availability to be
      provided for multiple applications simultaneously.
    * NFS/CIFS Failover: Supports highly available file serving in Unix
      and Windows environments.
    * Fully shared storage subsystem: All cluster members have access to
      the same storage.
    * Comprehensive Data Integrity guarantees: Uses the latest I/O
      barrier technology, such as programmable power switches and
      watchdog timers.
    * SCSI and Fibre Channel support: Cluster Manager configurations can
      be deployed using latest SCSI and Fibre Channel technology.
      Multi-terabyte configurations can readily be made highly available.
    * Service failover: Cluster Manager not only ensures hardware
      shutdowns or failures are detected and recovered from
      automatically, but also will monitor your applications to ensure
      they are running correctly, and will restart them automatically if
      they fail.



John Arbash Meinel wrote:
> John Allgood wrote:
>
>> This some good info. The type of attached storage is a Kingston 14 bay
>> Fibre Channel Infostation. I have 14 36GB 15,000 RPM drives. I think
>> the way it is being explained that I should build a mirror with two
>> disk for the pg_xlog and the striping and mirroring the rest and put
>> all my databases into one cluster. Also I might mention that I am
>> running clustering using Redhat Clustering Suite.
>
>
> So are these 14-disks supposed to be shared across all of your 9
> databases?
> It seems to me that you have a few architectural issues here.
>
> First, you can't really have 2 masters writing to the same disk array.
> I'm not sure if Redhat Clustering gets around this. But second is that
> you can't run 2 postgres engines on the same database. Postgres doesn't
> support a clustered setup. There are too many issues with concurancy and
> keeping everyone in sync.
>
> Since you seem to be okay with having a bunch of smaller localized
> databases, which update a master database 1/day, I would think you would
> want hardware to go something like this.
>
> 1 master server, at least dual opteron with access to lots of disks
> (likely the whole 14 if you can get away with it). Put 2 as a RAID1 for
> the OS, 4 as a RAID10 for pg_xlog, and then the other 8 as RAID10 for
> the rest of the database.
>
> 8-9 other servers, these don't need to be as powerful, since they are
> local domains. Probably a 4-disk RAID10 for the OS and pg_xlog is plenty
> good, and whatever extra disks you can get for the local database.
>
> The master database holds all information for all domains, but the other
> databases only hold whatever is the local information. Every night your
> script sequences through the domain databases one-by-one, updating the
> master database, and synchronizing whatever data is necesary back to the
> local domain. I would guess that this script could actually just
> continually run, going to each local db in turn, but you may want
> nighttime only updating depending on what kind of load they have.
>
> John
> =:->
>

---------------------------(end of broadcast)---------------------------
TIP 9: the planner will ignore your desire to choose an index scan if your
      joining column's datatypes do not match

The information transmitted is intended only for the person or entity to
which it is addressed and may contain confidential and/or privileged
material. Any review, retransmission, dissemination or other use of, or
taking of any action in reliance upon, this information by persons or
entities other than the intended recipient is prohibited. If you received
this in error, please contact the sender and delete the material from any
computer.

pgsql-performance by date:

Previous
From: Christopher Browne
Date:
Subject: Re: Peformance Tuning Opterons/ Hard Disk Layout
Next
From: Gaetano Mendola
Date:
Subject: Re: is pg_autovacuum so effective ?