Home > mailing lists

Re: update faster way - Mailing list pgsql-general

From	Juan Rodrigo Alejandro Burgos Mella
Subject	Re: update faster way
Date	September 15, 2024 05:51:49
Msg-id	CAHbZ42wTJSHZL+0NVHyDNmi=v7xEGLKfoVei=Y=_URfW9ySHBg@mail.gmail.com Whole thread
In response to	update faster way (yudhi s <learnerdatabase99@gmail.com>)
List	pgsql-general

Tree view

The only way that I see as plausible to use a subquery, both in the query and in the setting of the variable, is that the relationship is one to one, and that there is an index that responds to the predicate

UPDATE table1 t1

SET column_value = (SELECT <value> FROM table2 t2 WHERE t2.column_relation = t1.column_relation)
WHERE (colum_relation) IN (SELECT column_relation FROM table2)

PD: the index of being in table2

Atte

JRBM

El sáb, 14 sept 2024 a las 0:22, yudhi s (<learnerdatabase99@gmail.com>) escribió:

Hello,
We have to update a column value(from numbers like '123' to codes like 'abc' by looking into a reference table data) in a partitioned table with billions of rows in it, with each partition having 100's millions rows. As we tested for ~30million rows it's taking ~20minutes to update. So if we go by this calculation, it's going to take days for updating all the values. So my question is

1) If there is any inbuilt way of running the update query in parallel (e.g. using parallel hints etc) to make it run faster?
2) should we run each individual partition in a separate session (e.g. five partitions will have the updates done at same time from 5 different sessions)? And will it have any locking effect or we can just start the sessions and let them run without impacting our live transactions?

UPDATE tab_part1
SET column1 = reftab.code
FROM reference_tab reftab
WHERE tab_part1.column1 = subquery.column1;

Regards
Yudhi

pgsql-general by date:

From: Adrian Klaver
Date: 14 September 2024, 23:16:09
Subject: Re: Reg: Size difference

From: Dan Kortschak
Date: 15 September 2024, 12:07:53
Subject: Re: re-novice coming back to pgsql: porting an SQLite update statement to postgres

Re: update faster way - Mailing list pgsql-general

Previous

Next