Home > mailing lists

Re: making update/delete of inheritance trees scale better - Mailing list pgsql-hackers

From	Heikki Linnakangas
Subject	Re: making update/delete of inheritance trees scale better
Date	October 30, 2020 20:35:48
Msg-id	2e50d782-36f9-e723-0c4b-d133e63c6127@iki.fi Whole thread Raw
In response to	Re: making update/delete of inheritance trees scale better (Amit Langote <amitlangote09@gmail.com>)
Responses	Re: making update/delete of inheritance trees scale better
List	pgsql-hackers

Tree view

On 29/10/2020 15:03, Amit Langote wrote:
> On Sun, Oct 4, 2020 at 11:44 AM Amit Langote <amitlangote09@gmail.com> wrote:
>> On Fri, Sep 11, 2020 at 7:20 PM Amit Langote <amitlangote09@gmail.com> wrote:
>>> Here are the commit messages of the attached patches:
>>>
>>> [PATCH v3 1/3] Overhaul how updates compute a new tuple
>>
>> I tried to assess the performance impact of this rejiggering of how
>> updates are performed.  As to why one may think there may be a
>> negative impact, consider that ExecModifyTable() now has to perform an
>> extra fetch of the tuple being updated for filling in the unchanged
>> values of the update's NEW tuple, because the plan itself will only
>> produce the values of changed columns.
>>
> ...
>> It seems clear that the saving on the target list computation overhead
>> that we get from the patch is hard to ignore in this case.
>>
>> I've attached updated patches, because as Michael pointed out, the
>> previous version no longer applies.
> 
> Rebased over the recent executor result relation related commits.

I also did some quick performance testing with a simple update designed 
as a worst-case scenario:

create unlogged table tab (a int4, b int4);
insert into tab select g, g from generate_series(1, 10000000) g;

\timing on
vacuum tab; update tab set b = b, a = a;

Without the patch, the update takes about 7.3 s on my laptop, and about 
8.3 s with the patch.

In this case, the patch fetches the old tuple, but it wouldn't really 
need to, because all the columns are updated. Could we optimize that 
special case?

In principle, it would sometimes also make sense to add the old columns 
to the targetlist like we used to, to avoid the fetch. But estimating 
when that's cheaper would be complicated.

Despite that, I like this new approach a lot. It's certainly much nicer 
than inheritance_planner().

- Heikki

pgsql-hackers by date:

From: Justin Pryzby
Date: 30 October 2020, 20:01:00
Subject: Re: Assertion failure when ATTACH partition followed by CREATE PARTITION.

From: Tomas Vondra
Date: 30 October 2020, 20:37:30
Subject: Re: Parallel copy

Re: making update/delete of inheritance trees scale better - Mailing list pgsql-hackers

Previous

Next