Avoiding deadlocks when performing bulk update and delete operations - Mailing list pgsql-general

From Sanjaya Vithanagama
Subject Avoiding deadlocks when performing bulk update and delete operations
Date
Msg-id CAMbKYynOjDazPNpLc5u_jK+Vw+1u=UQZUQKTwXyGNOPmstWCWw@mail.gmail.com
Whole thread Raw
Responses Re: Avoiding deadlocks when performing bulk update and delete operations  (Gavin Flower <GavinFlower@archidevsys.co.nz>)
Re: Avoiding deadlocks when performing bulk update and delete operations  (Igor Neyman <ineyman@perceptron.com>)
Re: Avoiding deadlocks when performing bulk update and delete operations  (Bill Moran <wmoran@potentialtech.com>)
List pgsql-general
Hi All,

We have a single table which does not have any foreign key references.

id_A (bigint)
id_B (bigint)
val_1 (varchar)
val_2 (varchar)

The primary key of the table is a composite of id_A and id_B.

Reads and writes of this table are highly concurrent and the table has millions of rows. We have several stored procedures which do mass updates and deletes. Those stored procedures are being called concurrently mainly by triggers and application code.

The operations usually look like the following where it could match thousands of records to update or delete:

DELETE FROM table_name t
USING (
   SELECT id_A, id_B
   FROM   table_name 
   WHERE  id_A = ANY(array_of_id_A)
   AND    id_B = ANY(array_of_id_B)
   ORDER  BY id_A, id_B
   FOR    UPDATE
   ) del
WHERE  t.id_A = del.id_A
AND    t.id_B = del.id_B;


UPDATE table_name t
SET    val_1 = 'some value'
     , val_2 = 'some value'
FROM (
   SELECT id_A, id_B
   FROM   table_name 
   WHERE  id_A = ANY(array_of_id_A)
   AND    id_B = ANY(array_of_id_B)
   ORDER  BY id_A, id_B
   FOR    UPDATE
   ) upd
WHERE  t.id_A = upd.id_A
AND    t.id_B = upd.id_B;

We are experiencing deadlocks and all our attempts to perform operations with locks (row level using SELECT FOR UPDATE as used in the above queries and table level locks) do not seem to solve these deadlock issues. (Note that we cannot in any way use access exclusive locking on this table because of the performance impact)

Is there another way that we could try to solve these deadlock situations? The reference manual says — "The best defense against deadlocks is generally to avoid them by being certain that all applications using a database acquire locks on multiple objects in a consistent order."

Is there a guaranteed way to do bulk update/delete operations in a particular order so that we can ensure deadlocks won't occur? Or are there any other tricks to avoid deadlocks in this situation?

Thank you in advance,
Sanjaya

pgsql-general by date:

Previous
From: zach cruise
Date:
Subject: Re: better architecture?
Next
From: Gavin Flower
Date:
Subject: Re: Avoiding deadlocks when performing bulk update and delete operations