Thread: Hardware needed for 15,000,000 record DB?

Hardware needed for 15,000,000 record DB?

From
Date:
Hello,  :-)

Can someone help me spec out the hardware needed for a simple web-based
database app?

Basically, that application needs to lookup a single row by its primary key.
This would be fairly straightforward, except that the table needs to contain
up to 15 million records.  Each row will contain approximately 25  variable
length CHAR fields with perhaps a total of 3 to 4 kilobytes of data per row.

Updates will be done nightly via some sort of a batch process.

What kind of hardware would be needed for this sort of application? The
queries are not complex, it's just a lot of data.

Would a midrange Celeron processor with 256 MB RAM be sufficient? How would
backups work for a database this large?

Is PostgreSQL even the best database engine for this app? Perhaps MySQL? Or
maybe a Microsoft solution?

Thanks so much,
David


Re: Hardware needed for 15,000,000 record DB?

From
Mathieu Arnold
Date:
--On dimanche 21 avril 2002 19:25 -0400 pdg@stratos.net wrote:

> Hello,  :-)
>
> Can someone help me spec out the hardware needed for a simple web-based
> database app?
>
> Basically, that application needs to lookup a single row by its primary
> key. This would be fairly straightforward, except that the table needs to
> contain up to 15 million records.  Each row will contain approximately 25
> variable length CHAR fields with perhaps a total of 3 to 4 kilobytes of
> data per row.
>
> Updates will be done nightly via some sort of a batch process.
>
> What kind of hardware would be needed for this sort of application? The
> queries are not complex, it's just a lot of data.
>
> Would a midrange Celeron processor with 256 MB RAM be sufficient? How
> would backups work for a database this large?
>
> Is PostgreSQL even the best database engine for this app? Perhaps MySQL?
> Or maybe a Microsoft solution?

well, I have a server with a database over 7GB, and :

backup=> select count(*) from file;
  count
----------
 19430605
(1 row)

backup=> explain analyze select * from file where id_file = 29000000;
NOTICE:  QUERY PLAN:

Index Scan using file_pkey on file  (cost=0.00..3.18 rows=1 width=117)
(actual time=0.60..0.61 rows=1 loops=1)
Total runtime: 0.77 msec

and it's an old poor :
hda: IBM-DTLA-307030, ATA DISK drive

ps: the select count(*) took a long time, but I believe that was because
there was a batch running and feeding the db :)
--
Mathieu Arnold

Re: [ADMIN] Hardware needed for 15,000,000 record DB?

From
Date:
Hello,


> well, I have a server with a database over 7GB, and :
>
> backup=> select count(*) from file;
>   count
> ----------
>  19430605
> (1 row)
>
> backup=> explain analyze select * from file where id_file = 29000000;
> NOTICE:  QUERY PLAN:
>
> Index Scan using file_pkey on file  (cost=0.00..3.18 rows=1 width=117)
> (actual time=0.60..0.61 rows=1 loops=1)
> Total runtime: 0.77 msec
>
> and it's an old poor :
> hda: IBM-DTLA-307030, ATA DISK drive
>
> ps: the select count(*) took a long time, but I believe that was because
> there was a batch running and feeding the db :)

Thanks for the information.

Though, as a newbie, I am not quite sure what all the above performance
numbers actually mean.  :)

What sort of CPU/Memory configuration are you running this server on?

Thanks again,
David


Re: [ADMIN] Hardware needed for 15,000,000 record DB?

From
Mathieu Arnold
Date:

--On lundi 22 avril 2002 03:41 -0400 pdg@stratos.net wrote:

> Hello,
>
>
>> well, I have a server with a database over 7GB, and :
>>
>> backup=> select count(*) from file;
>>   count
>> ----------
>>  19430605
>> (1 row)
>>
>> backup=> explain analyze select * from file where id_file = 29000000;
>> NOTICE:  QUERY PLAN:
>>
>> Index Scan using file_pkey on file  (cost=0.00..3.18 rows=1 width=117)
>> (actual time=0.60..0.61 rows=1 loops=1)
>> Total runtime: 0.77 msec
>>
>> and it's an old poor :
>> hda: IBM-DTLA-307030, ATA DISK drive
>>
>> ps: the select count(*) took a long time, but I believe that was because
>> there was a batch running and feeding the db :)
>
> Thanks for the information.
>
> Though, as a newbie, I am not quite sure what all the above performance
> numbers actually mean.  :)
>
> What sort of CPU/Memory configuration are you running this server on?

well, it means that I got an entry from my database (which have 19,430,605
entries) in 0.77 msec using the primary key.
the server is a pIII 600 with 256M of ram, but there is a lot of other
things running on.
The main things about database is the hard drive speed, with the turtle
I've got in this server, it is *slow*, just get a scsi3 hard drive and
it'll run like smoothly, the second thing, is the memory/cpu speed, but it
really only gets useful if you have very complex queries.

--
Mathieu Arnold