Re: TPC-H Scaling Factors X PostgreSQL Cluster Command - Mailing list pgsql-performance

From Heikki Linnakangas
Subject Re: TPC-H Scaling Factors X PostgreSQL Cluster Command
Date
Msg-id 462C8085.3020602@enterprisedb.com
Whole thread Raw
In response to TPC-H Scaling Factors X PostgreSQL Cluster Command  ("Nelson Kotowski" <nkotowski@gmail.com>)
Responses Re: TPC-H Scaling Factors X PostgreSQL Cluster Command
List pgsql-performance
Nelson Kotowski wrote:
> So far, i need to do it in three different scale factors (1, 2 and 5GB
> databases).
>
> My build process comprehends creating the tables without any foreign keys,
> indexes, etc. - Running OK!
> Then, i load the data from the flat files generated through DBGEN software
> into these tables. - Running OK!
>
> Finally, i run a "optimize" script that does the following:
>
> - Alter the tables to add the mandatory foreign keys;
> - Create all mandatory indexes;
> - Cluster the orders table by the orders table index;
> - Cluster the lineitem table by the lineitem table index;
> - Vacuum the database;
> - Analyze statistics.

Cluster will completely rewrite the table and indexes. On step 2, you
should only create the indexes you're clustering on, and create the rest
of them after clustering.

Or even better, generate and load the data in the right order to start
with, so you don't need to cluster at all.

--
   Heikki Linnakangas
   EnterpriseDB   http://www.enterprisedb.com

pgsql-performance by date:

Previous
From: Mario Weilguni
Date:
Subject: Re: postgres: 100% CPU utilization
Next
From: "henk de wit"
Date:
Subject: Re: Redundant sub query triggers slow nested loop left join