Home > mailing lists

Re: Improve WALRead() to suck data directly from WAL buffers when possible - Mailing list pgsql-hackers

From	Jeff Davis
Subject	Re: Improve WALRead() to suck data directly from WAL buffers when possible
Date	February 14 04:29:47
Msg-id	381ee9523ac78994ca6a8c2b3795cd303c99cebc.camel@j-davis.com Whole thread Raw
In response to	Re: Improve WALRead() to suck data directly from WAL buffers when possible (Jeff Davis <pgsql@j-davis.com>)
Responses	Re: Improve WALRead() to suck data directly from WAL buffers when possible
List	pgsql-hackers

Tree view

Attached 2 patches.

Per Andres's suggestion, 0001 adds an:
  Assert(startptr + count <= LogwrtResult.Write)

Though if we want to allow the caller (e.g. in an extension) to
determine the valid range, perhaps using WaitXLogInsertionsToFinish(),
then the check is wrong. Maybe we should just get rid of that code
entirely and trust the caller to request a reasonable range?

On Mon, 2024-02-12 at 17:33 -0800, Jeff Davis wrote:
> That makes me wonder whether my previous idea[1] might matter: when
> some buffers have been evicted, should WALReadFromBuffers() keep
> going
> through the loop and return the end portion of the requested data
> rather than the beginning?
> [1]
> https://www.postgresql.org/message-id/2b36bf99e762e65db0dafbf8d338756cf5fa6ece.camel@j-davis.com

0002 is to illustrate the above idea. It's a strange API so I don't
intend to commit it in this form, but I think we will ultimately need
to do something like it when we want to replicate unflushed data.

The idea is that data past the Write pointer is always (and only)
available in the WAL buffers, so WALReadFromBuffers() should always
return it. That way we can always safely fall through to ordinary
WALRead(), which can only see before the Write pointer. There's also
data before the Write pointer that could be in the WAL buffers, and we
might as well copy that, too, if it's not evicted.

If some buffers are evicted, it will fill in the *end* of the buffer,
leaving a gap at the beginning. The nice thing is that if there is any
gap, it will be before the Write pointer, so we can always fall back to
WALRead() to fill the gap and it should always succeed.

Regards,
    Jeff Davis

Attachment

pgsql-hackers by date:

From: Yugo NAGATA
Date: 14 February, 03:53:45
Subject: Re: Small fix on query_id_enabled

From: Jeff Davis
Date: 14 February, 04:32:30
Subject: Re: Improve WALRead() to suck data directly from WAL buffers when possible

Re: Improve WALRead() to suck data directly from WAL buffers when possible - Mailing list pgsql-hackers

Attachment

Previous

Next