Distributed/Parallel Computing - Mailing list pgsql-performance

From Viji V Nair
Subject Distributed/Parallel Computing
Date
Msg-id 84c89ac10910051211j4b6ae882m8f7eb0ea7dc3506c@mail.gmail.com
Whole thread Raw
Responses Re: Distributed/Parallel Computing
List pgsql-performance
Hi Team,

This question may have asked many times previously also, but I could not find a solution for this in any post. any help on the following will be greatly appreciated.

We have a PG DB with PostGIS functions. There are around 100 tables in the DB and almost all the tables contains 1 million records, around 5 table contains more than 20 million records. The total DB size is 40GB running on a 16GB, 2 x XEON 5420, RAID6, RHEL5 64bit machines, the questions is
   
1. The geometry calculations which we does are very complex and it is taking a very long time to complete. We have optimised PG config to the best, now we need a mechanism to distribute these queries to multiple boxes. What is best recommended way for this distributed/parallel deployment. We have tried PGPOOL II, but the performance is not satisfactory. Going for a try with GridSQL

2. How we can distribute/split these large tables to multiple disks of different nodes?

Thanks in advance

Viji


pgsql-performance by date:

Previous
From: Greg Smith
Date:
Subject: Re: Speed while runnning large transactions.
Next
From: Guy Rouillier
Date:
Subject: Re: Query plan for NOT IN