Re: Hi Community - Mailing list pgsql-admin

From Steve Crawford
Subject Re: Hi Community
Date
Msg-id 54CFCFA7.1050301@pinpointresearch.com
Whole thread Raw
In response to Hi Community  (Naresh Soni <jmnaresh@gmail.com>)
Responses Re: Hi Community
List pgsql-admin
On 02/02/2015 03:18 AM, Naresh Soni wrote:
Good Evening Community

This is my first question on the list, I wanted to ask if postgres can handle multi millions records?  for example there will be  1 million records per table per day, so 365 millions per year.

Is yes, then please elaborate. 


Yes. Maybe. But you will need to elaborate. (A plain text file can store millions of records, too. The use pattern is what matters.)

For reference, I have a modest database of about 350-million records running on modest hardware of a several-year-old 4-core/24GB RAM/4-drive RAID10 with battery-backup server and get mostly acceptable performance with query peaks well over 1,000 TPS at about 2/3 writes.

The people here can give you lots of good advice with purchasing the PostgreSQL Performance book being a good one. PostgreSQL is feature-rich but you will need to design and choose appropriately. To get started a few questions jump out:

  • You say 1-million/day/table. How many tables (i.e. what is the total record count)?

  • How long do you want to keep the data (total anticipated database size)?

  • Can the data be easily partitioned? For example, can you set up child tables by month for query and archive efficiency?

  • What sort of query mix do you anticipate? What portion of inserts, updates, selects,...? Lots of data-warehouse type queries?

  • Is the schema fairly simple or are there lots of complex foreign-key constraints?

  • What sort of performance do you need - fraction-of-a-second transactional response or overnight analytical runs?

  • What hardware can you throw at it? At one end I've run PostgreSQL on a Raspberry Pi. At the other end, I'm about to migrate a database to a 20-core/128GB machine with Intel SSD storage that can purportedly match the TPS of 2,000 15k drives. (This is certainly not the upper limit of hardware for running PostgreSQL). Each is appropriate for its use-case.

In any case, don't let the number of options and features overwhelm you - you don't need to optimize prematurely. Describe your use-case here for some starting suggestions, get familiar with it, and adjust your design and settings as necessary.

Welcome to the list.

Cheers,
Steve

pgsql-admin by date:

Previous
From: Craig James
Date:
Subject: Re: dropping all databases and starting afresh
Next
From: Yaser Raja
Date:
Subject: Re: pg_xlog filling up under load