Home > mailing lists

Distributed/Parallel Computing - Mailing list pgsql-performance

From	Viji V Nair
Subject	Distributed/Parallel Computing
Date	October 5, 2009 16:11:20
Msg-id	84c89ac10910051211j4b6ae882m8f7eb0ea7dc3506c@mail.gmail.com Whole thread Raw
Responses	Re: Distributed/Parallel Computing
List	pgsql-performance

Tree view

Hi Team,

This question may have asked many times previously also, but I could not find a solution for this in any post. any help on the following will be greatly appreciated.

We have a PG DB with PostGIS functions. There are around 100 tables in the DB and almost all the tables contains 1 million records, around 5 table contains more than 20 million records. The total DB size is 40GB running on a 16GB, 2 x XEON 5420, RAID6, RHEL5 64bit machines, the questions is

1. The geometry calculations which we does are very complex and it is taking a very long time to complete. We have optimised PG config to the best, now we need a mechanism to distribute these queries to multiple boxes. What is best recommended way for this distributed/parallel deployment. We have tried PGPOOL II, but the performance is not satisfactory. Going for a try with GridSQL

2. How we can distribute/split these large tables to multiple disks of different nodes?

Thanks in advance

Viji

pgsql-performance by date:

From: Greg Smith
Date: 05 October 2009, 15:59:08
Subject: Re: Speed while runnning large transactions.

From: Guy Rouillier
Date: 05 October 2009, 16:35:31
Subject: Re: Query plan for NOT IN

Distributed/Parallel Computing - Mailing list pgsql-performance

Previous

Next