Home > mailing lists

Re: Massive table (500M rows) update nightmare - Mailing list pgsql-performance

From	marcin mank
Subject	Re: Massive table (500M rows) update nightmare
Date	January 7, 2010 17:05:54
Msg-id	b1b9fac61001071305vf182f3ajff6827f92c943c68@mail.gmail.com Whole thread Raw
In response to	Massive table (500M rows) update nightmare ("Carlo Stonebanks" <stonec.register@sympatico.ca>)
Responses	Re: Massive table (500M rows) update nightmare
List	pgsql-performance

Tree view

> every update is a UPDATE ... WHERE id
>>= x AND id < x+10 and a commit is performed after every 1000 updates
> statement, i.e. every 10000 rows.

What is the rationale behind this? How about doing 10k rows in 1
update, and committing every time?

You could try making the condition on the ctid column, to not have to
use the index on ID, and process the rows in physical order. First
make sure that newly inserted production data has the correct value in
the new column, and add 'where new_column is null' to the conditions.
But I have never tried this, use at Your own risk.

Greetings
Marcin Mank

pgsql-performance by date:

From: Robert Haas
Date: 07 January 2010, 16:20:10
Subject: Re: noob inheritance question

From: "Carlo Stonebanks"
Date: 07 January 2010, 18:14:27
Subject: Re: Massive table (500M rows) update nightmare

Re: Massive table (500M rows) update nightmare - Mailing list pgsql-performance

Previous

Next