Re: 10K vs 15k rpm for analytics - Mailing list pgsql-performance

From Scott Carey
Subject Re: 10K vs 15k rpm for analytics
Date
Msg-id 06A22EB5-2D84-401C-B75F-7CA883695E0A@richrelevance.com
Whole thread Raw
In response to Re: 10K vs 15k rpm for analytics  (david@lang.hm)
Responses Re: 10K vs 15k rpm for analytics
List pgsql-performance
On Mar 2, 2010, at 2:10 PM, <david@lang.hm> wrote:

> On Tue, 2 Mar 2010, Scott Marlowe wrote:
>
>> On Tue, Mar 2, 2010 at 2:30 PM, Francisco Reyes <lists@stringsutils.com> wrote:
>>> Scott Marlowe writes:
>>>
>>>> Then the real thing to compare is the speed of the drives for
>>>> throughput not rpm.
>>>
>>> In a machine, simmilar to what I plan to buy, already in house 24 x 10K rpm
>>> gives me about 400MB/sec while 16 x 15K rpm (2 to 3 year old drives) gives
>>> me about 500MB/sec
>>
>> Have you tried short stroking the drives to see how they compare then?
>> Or is the reduced primary storage not a valid path here?
>>
>> While 16x15k older drives doing 500Meg seems only a little slow, the
>> 24x10k drives getting only 400MB/s seems way slow.  I'd expect a
>> RAID-10 of those to read at somewhere in or just past the gig per
>> second range with a fast pcie (x8 or x16 or so) controller.  You may
>> find that a faster controller with only 8 or so fast and large SATA
>> drives equals the 24 10k drives you're looking at now.  I can write at
>> about 300 to 350 Megs a second on a slower Areca 12xx series
>> controller and 8 2TB Western Digital Green drives, which aren't even
>> made for speed.
>
> what filesystem is being used. There is a thread on the linux-kernel
> mailing list right now showing that ext4 seems to top out at ~360MB/sec
> while XFS is able to go to 500MB/sec+

I have Centos 5.4 with 10 7200RPM 1TB SAS drives in RAID 10 (Seagate ES.2, same perf as the SATA ones), XFS, Adaptec
5805,and get ~750MB/sec read and write sequential throughput. 

A RAID 0 of two of these stops around 1000MB/sec because it is CPU bound in postgres -- for select count(*).  If it is
select* piped to /dev/null, it is CPU bound below 300MB/sec converting data to text. 

For xfs, set readahead to 16MB or so (2MB or so per stripe) (--setra 32768 is 16MB) and absolutely make sure that the
xfsmount parameter 'allocsize' is set to about the same size or more.   For large sequential operations, you want to
makesure interleaved writes don't interleave files on disk.  I use 80MB allocsize, and 40MB readahead for the reporting
data.

Later Linux kernels have significantly improved readahead systems that don't need to be tuned quite as much.  For high
sequentialthroughput, nothing is as optimized as XFS on Linux yet.  It has weaknesses elsewhere however. 

And 3Ware on Linux + high throughput sequential = slow.  PERC 6 was 20% faster, and Adaptec was 70% faster with the
samedrives, and with experiments to filesystem and readahead for all.  From what I hear, Areca is a significant notch
aboveAdaptec on that too. 

>
> on single disks the disk performance limits you, but on arrays where the
> disk performance is higher there may be other limits you are running into.
>
> David Lang
>
> --
> Sent via pgsql-performance mailing list (pgsql-performance@postgresql.org)
> To make changes to your subscription:
> http://www.postgresql.org/mailpref/pgsql-performance


pgsql-performance by date:

Previous
From: Scott Carey
Date:
Subject: Re: 10K vs 15k rpm for analytics
Next
From: Greg Smith
Date:
Subject: Re: 10K vs 15k rpm for analytics