Home > mailing lists

Re: TPC-H Scaling Factors X PostgreSQL Cluster Command - Mailing list pgsql-performance

From	Heikki Linnakangas
Subject	Re: TPC-H Scaling Factors X PostgreSQL Cluster Command
Date	April 23, 2007 06:47:07
Msg-id	462C8085.3020602@enterprisedb.com Whole thread Raw
In response to	TPC-H Scaling Factors X PostgreSQL Cluster Command ("Nelson Kotowski" <nkotowski@gmail.com>)
Responses	Re: TPC-H Scaling Factors X PostgreSQL Cluster Command
List	pgsql-performance

Tree view

Nelson Kotowski wrote:
> So far, i need to do it in three different scale factors (1, 2 and 5GB
> databases).
>
> My build process comprehends creating the tables without any foreign keys,
> indexes, etc. - Running OK!
> Then, i load the data from the flat files generated through DBGEN software
> into these tables. - Running OK!
>
> Finally, i run a "optimize" script that does the following:
>
> - Alter the tables to add the mandatory foreign keys;
> - Create all mandatory indexes;
> - Cluster the orders table by the orders table index;
> - Cluster the lineitem table by the lineitem table index;
> - Vacuum the database;
> - Analyze statistics.

Cluster will completely rewrite the table and indexes. On step 2, you
should only create the indexes you're clustering on, and create the rest
of them after clustering.

Or even better, generate and load the data in the right order to start
with, so you don't need to cluster at all.

--
   Heikki Linnakangas
   EnterpriseDB   http://www.enterprisedb.com

pgsql-performance by date:

From: Mario Weilguni
Date: 23 April 2007, 05:53:27
Subject: Re: postgres: 100% CPU utilization

From: "henk de wit"
Date: 23 April 2007, 08:35:38
Subject: Re: Redundant sub query triggers slow nested loop left join

Re: TPC-H Scaling Factors X PostgreSQL Cluster Command - Mailing list pgsql-performance

Previous

Next