Re: Buffer locking is special (hints, checksums, AIO writes) - Mailing list pgsql-hackers

From Heikki Linnakangas
Subject Re: Buffer locking is special (hints, checksums, AIO writes)
Date
Msg-id 68e89de8-5f6c-4eaf-a800-e16a5e487667@iki.fi
Whole thread Raw
In response to Re: Buffer locking is special (hints, checksums, AIO writes)  (Andres Freund <andres@anarazel.de>)
Responses Re: Buffer locking is special (hints, checksums, AIO writes)
List pgsql-hackers
On 03/02/2026 00:33, Andres Freund wrote:
> - The way MarkBufferDirtyHint() operates was copied into
>    heap_inplace_update_and_unlock(). Now that MarkBufferDirtyHint() won't work
>    that way anymore, it seems better to go with the alternative approach the
>    comments already outlined, namely to only delay updating of the buffer
>    contents.
> 
>    I've done this in a prequisite commit, as it doesn't actually depend on any
>    of the other changes.  Noah, any chance you could take a look at this?

Patch 0001 Looks correct to me. However:

>      * ["D" is a VACUUM (ONLY_DATABASE_STATS)]
>      * ["R" is a VACUUM tbl]
>      * D: vac_update_datfrozenxid() -> systable_beginscan(pg_class)
>      * D: systable_getnext() returns pg_class tuple of tbl
>      * R: memcpy() into pg_class tuple of tbl
>      * D: raise pg_database.datfrozenxid, XLogInsert(), finish
>      * [crash]
>      * [recovery restores datfrozenxid w/o relfrozenxid]
>      *
>      * As we hold an exclusive lock - preventing the buffer from being written
>      * out once dirty - we can work around this as follows: MarkBufferDirty(),
>      * XLogInsert(), memcpy().

That last reference to 'memcpy' is a little orphaned now. The comment 
used to talk about the stack copy of the page, but now there's no 
mention of that except for this reference to memcpy(). To make things 
worse, the steps have "memcpy() into pg_class tuple of tbl", so one 
could think that the "memcpy" refers to that.

How about this:

     * We avoid that by using a temporary copy of the buffer to hide our
     * change from other backends until it's been WAL-logged. We apply our
     * change to the temporary copy and WAL-log it before modifying the real
     * page. That way any action a reader of the in-place-updated value takes
     * will be WAL logged after this change.

- Heikki



pgsql-hackers by date:

Previous
From: Peter Eisentraut
Date:
Subject: Re: Make copyObject work in C++
Next
From: Hannu Krosing
Date:
Subject: Re: pg_upgrade: transfer pg_largeobject_metadata's files when possible