Home > mailing lists

How to determine whether to VACUUM or CLUSTER - Mailing list pgsql-performance

From	ken shaw
Subject	How to determine whether to VACUUM or CLUSTER
Date	June 16, 2005 15:04:53
Msg-id	20050616180442.8983.qmail@web33703.mail.mud.yahoo.com Whole thread Raw
Responses	Re: How to determine whether to VACUUM or CLUSTER
List	pgsql-performance

Tree view

Hi All,

I have an app that updates a PostgreSQL db in a batch fashion. After each batch (or several batches), it issues VACUUM and ANALYZE calls on the updated tables. Now I want to cluster some tables for better performance. I understand that doing a VACUUM and a CLUSTER on a table is wasteful as the CLUSTER makes the VACUUM superfluous. The app does not have a built-in list of the tables and whether each is clustered or not. It looks to me as if the only way to determine whether to issue a VACUUM (on a non-clustered table) or a CLUSTER (on a clustered table) is to query the table "pg_index", much like view "pg_indexes" does, for the column "indisclustered". Is this right?

Also, how expensive is CLUSTER compared to VACUUM? Does CLUSTER read in the whole table, sort it, and write it back out? Or write out a completely new file? Is the time for a CLUSTER the same whether one row is out of place or the table is completely disordered?

Thanks,

Ken

Discover Yahoo!
Find restaurants, movies, travel & more fun for the weekend. Check it out!

pgsql-performance by date:

From: Bruno Wolff III
Date: 16 June 2005, 14:03:58
Subject: Re: Needed: Simplified guide to optimal memory configuration

From: Veikko Mäkinen
Date: 16 June 2005, 16:28:43
Subject: How does the transaction buffer work?

How to determine whether to VACUUM or CLUSTER - Mailing list pgsql-performance

Previous

Next