Thread: autoprewarm_dump_now

autoprewarm_dump_now

From

Дарья Шанина

Date:

04 April, 16:40:57

Hello everyone!
I have a question.

What would be better for the function autoprewarm_dump_now in case when we need to allocate memory that exceeds 1 GB:
1) allocate enough memory for the entire shared_buffer array (1..NBuffers) using palloc_extended;
2) allocate the maximum of currently possible memory (1 GB) using an ordinary palloc.

Thank you for your attention!

--
Best regards,
Daria Shanina

Re: autoprewarm_dump_now

From

Heikki Linnakangas

Date:

04 April, 17:04:18

On 04/04/2025 16:40, Дарья Шанина wrote:
> Hello everyone!
> I have a question.
> 
> What would be better for the function autoprewarm_dump_now in case when 
> we need to allocate memory that exceeds 1 GB:

Hmm, so if I counted right, sizeof(BlockInfoRecord) == 20 bytes, which 
means that you can fit about 409 GB worth of buffers in a 1 GB 
allocation. So autoprewarm will currently not work with shared_buffers > 
409 GB. That's indeed quite unfortunate.

> 1) allocate enough memory for the entire shared_buffer array 
> (1..NBuffers) using palloc_extended;

That would be a pretty straightforward fix.

> 2) allocate the maximum of currently possible memory (1 GB) using an 
> ordinary palloc.

That'd put an upper limit on how much is prewarmed. It'd be a weird 
limitation. And prewarming matters the most with large shared_buffers.

3) Don't pre-allocate the array, write it out in a streaming fashion.

Unfortunately the file format doesn't make that easy: the number of 
entries is at the beginning of the file. You could count the entries 
beforehand, but the buffers can change concurrently. You could write a 
placeholder first, and seek back to the beginning of the file to fill in 
the real number at the end. The problem with that is that the number of 
bytes needed for the count itself varies. I suppose we could write some 
spaces as placeholders to accommodate the max count.

In apw_load_buffers(), we also load the file into (DSM) memory. There's 
no similar 1 GB limit in dsm_create(), but I think it's a bit 
unfortunate that the array needs to be allocated upfront upon loading.

In short, ISTM the easy answer here is "use palloc_extended". But 
there's a lot of room for further optimizations.

-- 
Heikki Linnakangas
Neon (https://neon.tech)

Re: autoprewarm_dump_now

From

Melanie Plageman

Date:

04 April, 19:17:19

On Fri, Apr 4, 2025 at 10:04 AM Heikki Linnakangas <hlinnaka@iki.fi> wrote:
>
> In apw_load_buffers(), we also load the file into (DSM) memory. There's
> no similar 1 GB limit in dsm_create(), but I think it's a bit
> unfortunate that the array needs to be allocated upfront upon loading.

Unrelated to this problem, but I wondered why autoprewarm doesn''t
launch background workers for each database simultaneously instead of
waiting for each one to finish a db before moving onto the next one.
Is it simply to limit the number of bgworkers taking up resources?

- Melanie

Re: autoprewarm_dump_now

From

Robert Haas

Date:

04 April, 19:36:33

On Fri, Apr 4, 2025 at 12:17 PM Melanie Plageman
<melanieplageman@gmail.com> wrote:
> Unrelated to this problem, but I wondered why autoprewarm doesn''t
> launch background workers for each database simultaneously instead of
> waiting for each one to finish a db before moving onto the next one.
> Is it simply to limit the number of bgworkers taking up resources?

That's probably part of it, but also (1) a system that allowed for
multiple workers would be somewhat more complex to implement and (2)
I'm not sure how beneficial it would be. We go to some trouble to make
the I/O as sequential as possible, and this would detract from that. I
also don't know how long prewarming normally takes -- if it's fast
enough already, then maybe this doesn't matter. But if somebody is
having a problem with autoprewarm being slow and wants to implement a
multi-worker system to make it faster, cool.

--
Robert Haas
EDB: http://www.enterprisedb.com

Re: autoprewarm_dump_now

From

Daria Shanina

Date:

29 May, 16:16:13

Hello!

I have made a patch, now we can allocate more than 1 GB of memory for the autoprewarm_dump_now function.

Best regards,

Daria Shanina

пт, 4 апр. 2025 г. в 19:36, Robert Haas <robertmhaas@gmail.com>:

On Fri, Apr 4, 2025 at 12:17 PM Melanie Plageman
<melanieplageman@gmail.com> wrote:
> Unrelated to this problem, but I wondered why autoprewarm doesn''t
> launch background workers for each database simultaneously instead of
> waiting for each one to finish a db before moving onto the next one.
> Is it simply to limit the number of bgworkers taking up resources?

That's probably part of it, but also (1) a system that allowed for
multiple workers would be somewhat more complex to implement and (2)
I'm not sure how beneficial it would be. We go to some trouble to make
the I/O as sequential as possible, and this would detract from that. I
also don't know how long prewarming normally takes -- if it's fast
enough already, then maybe this doesn't matter. But if somebody is
having a problem with autoprewarm being slow and wants to implement a
multi-worker system to make it faster, cool.

--
Robert Haas
EDB: http://www.enterprisedb.com

Attachment

v1-0001-PGPRO-9971-Allocate-enough-memory-with-huge-share.patch

Re: autoprewarm_dump_now

From

Tom Lane

Date:

29 May, 16:21:48

Daria Shanina <vilensipkdm@gmail.com> writes:
> I have made a patch, now we can allocate more than 1 GB of memory for the
> autoprewarm_dump_now function.

Is that solving a real-world problem?  If it is, shouldn't we be
looking for a different approach that doesn't require such a huge
amount of memory?

            regards, tom lane

Re: autoprewarm_dump_now

From

Daria Shanina

Date:

30 May, 13:07:37

Some of our clients encountered a problem — they needed to allocate shared_buffers = 700 GB on a server with 1.5 TB RAM, and the error "invalid memory alloc request size 1835008000" occurred. That is, these are not mental exercises.

Best regards,

Daria Shanina

чт, 29 мая 2025 г. в 16:21, Tom Lane <tgl@sss.pgh.pa.us>:

Daria Shanina <vilensipkdm@gmail.com> writes:
> I have made a patch, now we can allocate more than 1 GB of memory for the
> autoprewarm_dump_now function.

Is that solving a real-world problem? If it is, shouldn't we be
looking for a different approach that doesn't require such a huge
amount of memory?

regards, tom lane

С уважением,

Шанина Дарья Александровна

Re: autoprewarm_dump_now

From

Robert Haas

Date:

03 June, 20:17:43

On Fri, May 30, 2025 at 6:07 AM Daria Shanina <vilensipkdm@gmail.com> wrote:
> Some of our clients encountered a problem — they needed to allocate shared_buffers = 700 GB on a server with 1.5 TB
RAM,and the error "invalid memory alloc request size 1835008000" occurred. That is, these are not mental exercises. 

I think the proposed patch should be committed and back-patched, after
fixing it so that it's pgindent-clean and adding a comment.  Does
anyone have strong objection to that?

--
Robert Haas
EDB: http://www.enterprisedb.com

Re: autoprewarm_dump_now

From

Tom Lane

Date:

03 June, 20:24:16

Robert Haas <robertmhaas@gmail.com> writes:
> I think the proposed patch should be committed and back-patched, after
> fixing it so that it's pgindent-clean and adding a comment.  Does
> anyone have strong objection to that?

Not here.  I do wonder if we can't find a more memory-efficient way,
but I concur that any such change would likely not be back-patch
material.

            regards, tom lane

Re: autoprewarm_dump_now

From

Andres Freund

Date:

03 June, 21:58:42

Hi,

On 2025-06-03 13:17:43 -0400, Robert Haas wrote:
> On Fri, May 30, 2025 at 6:07 AM Daria Shanina <vilensipkdm@gmail.com> wrote:
> > Some of our clients encountered a problem — they needed to allocate shared_buffers = 700 GB on a server with 1.5 TB
RAM,and the error "invalid memory alloc request size 1835008000" occurred. That is, these are not mental exercises.
 
> 
> I think the proposed patch should be committed and back-patched, after
> fixing it so that it's pgindent-clean and adding a comment.  Does
> anyone have strong objection to that?

No, seems like a thing that pretty obviously should be fixed.

Greetings,

Andres Freund

Re: autoprewarm_dump_now

From

Robert Haas

Date:

06 June, 16:18:48

On Tue, Jun 3, 2025 at 2:58 PM Andres Freund <andres@anarazel.de> wrote:
> > I think the proposed patch should be committed and back-patched, after
> > fixing it so that it's pgindent-clean and adding a comment.  Does
> > anyone have strong objection to that?
>
> No, seems like a thing that pretty obviously should be fixed.

Done.

*quakes in fear of incoming buildfarm results*

--
Robert Haas
EDB: http://www.enterprisedb.com