Re: A case for UPDATE DISTINCT attribute - Mailing list pgsql-hackers

From Alexey Bashtanov
Subject Re: A case for UPDATE DISTINCT attribute
Date
Msg-id 7afde60a-ec30-3c62-bbe2-6f73845a2cac@imap.cc
Whole thread Raw
In response to A case for UPDATE DISTINCT attribute  (Gajus Kuizinas <gajus@gajus.com>)
List pgsql-hackers
Hello Gajus,

I have observed that the following pattern is repeating in our data management programs:

UPDATE
  event
SET
  fuid = ${fuid},
  venue_id = ${venueId},
  url = ${url}
WHERE
  id = ${id} AND
  fuid IS != ${fuid} AND
  venue_id IS != ${venueId} AND
  url IS DISTINCT FROM ${url};

...
Meanwhile, a WHERE condition that excludes rows with matching values makes this into a noop in case of matching target column values.

For this to hold, you need your conditions in WHERE to be ORed, not ANDed.


UPDATE DISTINCT
  event
SET
  fuid = ${fuid},
  venue_id = ${venueId},
  url = ${url}
WHERE
  id = ${id}

would encourage greater adoption of such pattern.

Is there a technical reason this does not existing already?


At least a bunch of questions and concerns arise. Like these:

1) We cannot treat it as a syntactic sugar only and just expand it on parsing stage,
as the expression to generate the value assigned may be volatile, like UPDATE ... SET ... = random();
2) How should this interact with triggers? E.g. when NEW and OLD were the same
before BEFORE UPDATE trigger execution, but would be different after. Or visa versa.
Should they be included into transition tables?
3) Should RETURNING clause return the non-updated rows?
4) It must be not easy to guarantee anything if there is a FROM clause, a target row is present in the join more than once.
5) We need to fail correctly if one of the column data types doesn't have an equality operator.

Best regards,
  Alexey

pgsql-hackers by date:

Previous
From: "Bossart, Nathan"
Date:
Subject: Re: A few new options for vacuumdb
Next
From: Tom Lane
Date:
Subject: Re: Switching to 64-bit Bitmapsets