Thread: Memory footprint diff between 9.5 and 12

Memory footprint diff between 9.5 and 12

From
Tory M Blue
Date:
I hadn't noticed this until today, but a running 9.5 system with buffers at 10GB starts and has been running years without issues. (15GB available)

Postgres 12 will not start with that configuration, complaining about memory availability.  So Postgres12 won't start until shared buffers is 6GB, but even with that, my DB servers , postgres queries started complaining about being unable to allocate memory "unable to allocate".

So dropping them to 4GB (on a 15GB system), may help, but did I miss a huge note about significant memory changes between 9.5 to 12?

Is there something else I'm missing that on busy systems is important, something introduced in 10 or 11 as again I'm not seeing anything noted in 12.
Thanks
Tory

Re: Memory footprint diff between 9.5 and 12

From
Alan Hodgson
Date:
On Thu, 2020-05-07 at 13:33 -0700, Tory M Blue wrote:
I hadn't noticed this until today, but a running 9.5 system with buffers at 10GB starts and has been running years without issues. (15GB available)

Postgres 12 will not start with that configuration, complaining about memory availability.  So Postgres12 won't start until shared buffers is 6GB, but even with that, my DB servers , postgres queries started complaining about being unable to allocate memory "unable to allocate".

So dropping them to 4GB (on a 15GB system), may help, but did I miss a huge note about significant memory changes between 9.5 to 12?

Is there something else I'm missing that on busy systems is important, something introduced in 10 or 11 as again I'm not seeing anything noted in 12.
Thanks

Is this running on an otherwise identical system? Or do you have a different kernel, overcommit settings, or swap configuration?

Re: Memory footprint diff between 9.5 and 12

From
Laurenz Albe
Date:
On Thu, 2020-05-07 at 13:33 -0700, Tory M Blue wrote:
> I hadn't noticed this until today, but a running 9.5 system with buffers at 10GB starts
> and has been running years without issues. (15GB available)
> 
> Postgres 12 will not start with that configuration, complaining about memory availability.
> So Postgres12 won't start until shared buffers is 6GB, but even with that, my DB servers ,
> postgres queries started complaining about being unable to allocate memory "unable to allocate".
> 
> So dropping them to 4GB (on a 15GB system), may help, but did I miss a huge note about
> significant memory changes between 9.5 to 12?
> 
> Is there something else I'm missing that on busy systems is important, something introduced
> in 10 or 11 as again I'm not seeing anything noted in 12.

There must be something else running on the machine that allocates memory.

Did you perchance run the 9.5 and the v12 server on the same machine?

Yours,
Laurenz Albe
-- 
Cybertec | https://www.cybertec-postgresql.com




Re: Memory footprint diff between 9.5 and 12

From
Tory M Blue
Date:


On Thu, May 7, 2020 at 11:39 PM Laurenz Albe <laurenz.albe@cybertec.at> wrote:
On Thu, 2020-05-07 at 13:33 -0700, Tory M Blue wrote:
> I hadn't noticed this until today, but a running 9.5 system with buffers at 10GB starts
> and has been running years without issues. (15GB available)
>
> Postgres 12 will not start with that configuration, complaining about memory availability.
> So Postgres12 won't start until shared buffers is 6GB, but even with that, my DB servers ,
> postgres queries started complaining about being unable to allocate memory "unable to allocate".
>
> So dropping them to 4GB (on a 15GB system), may help, but did I miss a huge note about
> significant memory changes between 9.5 to 12?
>
> Is there something else I'm missing that on busy systems is important, something introduced
> in 10 or 11 as again I'm not seeing anything noted in 12.

There must be something else running on the machine that allocates memory.

Did you perchance run the 9.5 and the v12 server on the same machine?

Yours,
Laurenz Albe
--


I guess the one thing I can come up with, is that my older config has commands that are no longer valid or have been changed. or the defaults that I'm not overwriting have changed significantly.

I'm using the stock postgresql.conf file (not edited, other than to add the include at the bottom for my config file). the include file is our local config and same one I've been using between 9.5 and 12, and 9.5 will start with shared buffers of 10GB but 12 will not.

This box has 15GB of available memory.

listen_addresses = '*'
#
max_connections = 300  
#
log_destination 'stderr'
#
log_directory = '/pgsql/logs'
#
logging_collector = on
#
log_filename = 'pgsql-%m-%d.log' # log file name pattern,
#
log_min_duration_statement = 80ms # -1 is disabled, 0 logs all statements
#
log_lock_waits = on # log lock waits >= deadlock_timeout
#
log_timezone = 'US/Pacific'
#
autovacuum_max_workers = 3 # max number of autovacuum subprocesses
#
autovacuum_vacuum_threshold = 10000 # min number of row updates before
#
autovacuum_analyze_threshold = 3000 # min number of row updates before
#
timezone = 'US/Pacific'
#
deadlock_timeout = 2s
#
autovacuum_work_mem = -1                # min 1MB, or -1 to use
#
max_stack_depth = 2MB                  # min 100kB
#
dynamic_shared_memory_type = posix      # the default is the first option
#
shared_buffers = 5GB
#
effective_cache_size = 10GB
#
work_mem = 256MB
#
maintenance_work_mem = 256MB
#
# min_wal_size = 100MB
#
# max_wal_size = 2GB
#
checkpoint_completion_target = 0.9
#
wal_buffers = 16MB
#
default_statistics_target = 100
#
default_text_search_config = 'pg_catalog.simple'
#
synchronous_commit = off
#
log_line_prefix = '< %m %h >' 

Is there a significant difference in Memory settings between 9.5 and 12

From
Tory M Blue
Date:

Upgraded from 9.5 to 12 and 12 would not start with the current configured Shared Buffers.

Same hardware, same config file.

Which tells me something has changed, are there new default settings in the 12 postgresql.conf file that are not being called out in my 9.5 config file that could be the cause or?

9.5
shared_buffers 10GB

12 
shared_buffers 5GB

12 will not start at 10GB, even though it's the same hardware, same config file, same physical box, same everything, just version 12 vs 9.5

So this tells me that maybe i'm missing a new memory setting in 12 , that is not being overwritten in my local config file (I run an include and my own settings), this worked fine in 9.5 but I'm guessing there is something considerably different between 9.5 and 12 (I'm just not seeing it). Anyone with insight into what memory settings may have been added in 10/11/12 that are significantly different than 9.5?

Thanks
Tory

CentOS 7
Postgresql 9.5.x
Postgresql 12.2


Tory M Blue <tmblue@gmail.com> writes:
> 12 will not start at 10GB, even though it's the same hardware, same config
> file, same physical box, same everything, just version 12 vs 9.5

For me, using all-default settings (in particular, shared_buffers =
128MB), the shared memory block is about 141.6MB using 9.5 and 142.1MB
using 12.  So there's half a meg or so of additional data in v12, but
certainly not gigabytes worth.

Are you trying to start both postmasters concurrently?  Maybe you're
hitting some kernel limit on the total amount of shared memory in the
system.

            regards, tom lane





On Mon, May 11, 2020 at 1:36 PM Tom Lane <tgl@sss.pgh.pa.us> wrote:
Tory M Blue <tmblue@gmail.com> writes:
> 12 will not start at 10GB, even though it's the same hardware, same config
> file, same physical box, same everything, just version 12 vs 9.5

For me, using all-default settings (in particular, shared_buffers =
128MB), the shared memory block is about 141.6MB using 9.5 and 142.1MB
using 12.  So there's half a meg or so of additional data in v12, but
certainly not gigabytes worth.

Are you trying to start both postmasters concurrently?  Maybe you're
hitting some kernel limit on the total amount of shared memory in the
system.

                        regards, tom lane

Hey Tom

Nope, just a single one that is why i'm flummoxed :) I've even rebooted, but I can't start Postgres 12 with my current setting of 10GB, I can start 9.5 with 10GB configured.   I've tried, shutting down 9.5 and rebooting so nothing is running and attempting to start 12 and nada, it won't unless I drop the Shared Buffers down to 5GB (half)..  But these are dedicated postgresql servers. And in fact my latest migrations, don't even have 9.5 binaries anymore and 12 will not start with my 9.5 configuration of 10GB buffers. So something feels really different.

It's very possible that there are new defaults , new memory settings that I'm not finding in the default postgresql 12 .conf file, and my include is not overwriting it. But really I just can't fathom what that could be.. Buffers, work mem, effective cache, what would they have added?

I am going to pull the settings from postgres itself and compare 12 and 9.5 to see if there is something glaring.

Thanks! :)
Tory 


On Mon, May 11, 2020 at 1:36 PM Tom Lane <tgl@sss.pgh.pa.us> wrote:
Tory M Blue <tmblue@gmail.com> writes:
> 12 will not start at 10GB, even though it's the same hardware, same config
> file, same physical box, same everything, just version 12 vs 9.5

For me, using all-default settings (in particular, shared_buffers =
128MB), the shared memory block is about 141.6MB using 9.5 and 142.1MB
using 12.  So there's half a meg or so of additional data in v12, but
certainly not gigabytes worth.

Are you trying to start both postmasters concurrently?  Maybe you're
hitting some kernel limit on the total amount of shared memory in the
system.

                        regards, tom lane

Okay the one difference I see in settings is this little gem in 12.. 

shared_memory_typemmap

Otherwise i'm not seeing a ton of other settings not common between them,.

This is the only major difference I'm seeing, as it's really not an option in 9.5..... Appears 9.5 was using

shared_memory_type (enum)

Specifies the shared memory implementation that the server should use for the main shared memory region that holds PostgreSQL's shared buffers and other shared data. Possible values are mmap (for anonymous shared memory allocated using mmap), sysv (for System V shared memory allocated via shmget) and windows (for Windows shared memory). Not all values are supported on all platforms; the first supported option is the default for that platform. The use of the sysv option, which is not the default on any platform, is generally discouraged because it typically requires non-default kernel settings to allow for large allocations (see Section 18.4.1).

 

Re: Is there a significant difference in Memory settings between 9.5and 12

From
Adrian Klaver
Date:
On 5/11/20 1:42 PM, Tory M Blue wrote:
> 
> 
> On Mon, May 11, 2020 at 1:36 PM Tom Lane <tgl@sss.pgh.pa.us 
> <mailto:tgl@sss.pgh.pa.us>> wrote:
> 
>     Tory M Blue <tmblue@gmail.com <mailto:tmblue@gmail.com>> writes:
>      > 12 will not start at 10GB, even though it's the same hardware,
>     same config
>      > file, same physical box, same everything, just version 12 vs 9.5
> 
>     For me, using all-default settings (in particular, shared_buffers =
>     128MB), the shared memory block is about 141.6MB using 9.5 and 142.1MB
>     using 12.  So there's half a meg or so of additional data in v12, but
>     certainly not gigabytes worth.
> 
>     Are you trying to start both postmasters concurrently?  Maybe you're
>     hitting some kernel limit on the total amount of shared memory in the
>     system.
> 
>                              regards, tom lane
> 
> 
> Hey Tom
> 
> Nope, just a single one that is why i'm flummoxed :) I've even rebooted, 
> but I can't start Postgres 12 with my current setting of 10GB, I can 

So what is the error output from console, Postgres log and/or system log?

> start 9.5 with 10GB configured.   I've tried, shutting down 9.5 and 
> rebooting so nothing is running and attempting to start 12 and nada, it 
> won't unless I drop the Shared Buffers down to 5GB (half)..  But these 
> are dedicated postgresql servers. And in fact my latest migrations, 
> don't even have 9.5 binaries anymore and 12 will not start with my 9.5 
> configuration of 10GB buffers. So something feels really different.
> 
> It's very possible that there are new defaults , new memory settings 
> that I'm not finding in the default postgresql 12 .conf file, and my 
> include is not overwriting it. But really I just can't fathom what that 
> could be.. Buffers, work mem, effective cache, what would they have added?
> 
> I am going to pull the settings from postgres itself and compare 12 and 
> 9.5 to see if there is something glaring.
> 
> Thanks! :)
> Tory


-- 
Adrian Klaver
adrian.klaver@aklaver.com



Tory M Blue <tmblue@gmail.com> writes:
> Okay the one difference I see in settings is this little gem in 12..
> shared_memory_type mmap

Well, v12 is just exposing a switch for something that was hard-wired
before.  But now I wonder if your 9.5 installation could've been compiled
to force it to use SysV shmem instead of POSIX.  It would be pretty
unusual to have a system where the SysV shmem limits were higher than
the POSIX limits --- usually it's the other way 'round.  But this'd
explain why you're seeing a difference.

Does v12 start with the higher shared_buffers setting if you
set shared_memory_type = sysv?

            regards, tom lane





On Mon, May 11, 2020 at 2:08 PM Tory M Blue <tmblue@gmail.com> wrote:


On Mon, May 11, 2020 at 1:36 PM Tom Lane <tgl@sss.pgh.pa.us> wrote:
Tory M Blue <tmblue@gmail.com> writes:
> 12 will not start at 10GB, even though it's the same hardware, same config
> file, same physical box, same everything, just version 12 vs 9.5

For me, using all-default settings (in particular, shared_buffers =
128MB), the shared memory block is about 141.6MB using 9.5 and 142.1MB
using 12.  So there's half a meg or so of additional data in v12, but
certainly not gigabytes worth.

Are you trying to start both postmasters concurrently?  Maybe you're
hitting some kernel limit on the total amount of shared memory in the
system.

                        regards, tom lane

Okay the one difference I see in settings is this little gem in 12.. 

shared_memory_typemmap

Otherwise i'm not seeing a ton of other settings not common between them,.

This is the only major difference I'm seeing, as it's really not an option in 9.5..... Appears 9.5 was using

shared_memory_type (enum)

Specifies the shared memory implementation that the server should use for the main shared memory region that holds PostgreSQL's shared buffers and other shared data. Possible values are mmap (for anonymous shared memory allocated using mmap), sysv (for System V shared memory allocated via shmget) and windows (for Windows shared memory). Not all values are supported on all platforms; the first supported option is the default for that platform. The use of the sysv option, which is not the default on any platform, is generally discouraged because it typically requires non-default kernel settings to allow for large allocations (see Section 18.4.1).

 

That didn't help.

ay 11 19:46:13 qdb03.prod.ca. postmaster[31048]: < 2020-05-11 19:46:13.026 PDT  >FATAL:  could not create shared memory segment: Cannot allocate memory
May 11 19:46:13 qdb03.prod.ca postmaster[31048]: < 2020-05-11 19:46:13.026 PDT  >DETAIL:  Failed system call was shmget(key=5432001, size=11026235392, 03600).
May 11 19:46:13 qdb03.prod.ca postmaster[31048]: < 2020-05-11 19:46:13.026 PDT  >HINT:  This error usually means that PostgreSQL's request for a shared memory segm
May 11 19:46:13 qdb03.prod.ca postmaster[31048]: The PostgreSQL documentation contains more information about shared memory configuration. 

Attempted to change ;
#shared_memory_type = 'sysv'

It took the change but didn't help. So 10GB of shared_buffers in 12 is still a no go. I'm down to 5GB and it works, but this is the same hardware, the same exact 9.5 configuration. So I'm missing something. WE have not had to mess with kernel memory settings since 9.4, so this is an odd one.

I'll keep digging, but i'm hesitant to do my multiple TB db's with half of their shared buffer configs, until I understand what 12 is doing differently than 9.5

Thanks again for the ideas

Tory

Re: Is there a significant difference in Memory settings between 9.5and 12

From
"David G. Johnston"
Date:
On Monday, May 11, 2020, Tory M Blue <tmblue@gmail.com> wrote:
I'll keep digging, but i'm hesitant to do my multiple TB db's with half of their shared buffer configs, until I understand what 12 is doing differently than 9.5

Maybe run your test suite on 9.6, 10, and 11 to see if it is indeed new to 12 or at least appears on other versions?

David J. 
On Tue, May 12, 2020 at 2:52 PM Tory M Blue <tmblue@gmail.com> wrote:
> It took the change but didn't help. So 10GB of shared_buffers in 12 is still a no go. I'm down to 5GB and it works,
butthis is the same hardware, the same exact 9.5 configuration. So I'm missing something. WE have not had to mess with
kernelmemory settings since 9.4, so this is an odd one. 
>
> I'll keep digging, but i'm hesitant to do my multiple TB db's with half of their shared buffer configs, until I
understandwhat 12 is doing differently than 9.5 

Which exact version of 9.5.x are you coming from?  What's the exact
error message on 12 (you showed the shared_memory_type=sysv error, but
with the default  value (mmap) how does it look)?  What's your
huge_pages setting?

Can you reproduce the problem with a freshly created test cluster?  As
a regular user, assuming regular RHEL packaging, something like
/usr/pgsql-12/bin/initdb -D test_pgdata, and then
/usr/pgsql-12/bin/postgres -D test_pgdata -c shared_buffers=10GB (then
^C to stop it).  If that fails to start in the same way, it'd be
interesting to see the output of the second command with strace in
front of it, in the part where it allocates shared memory.  And
perhaps it'd be interesting to see the same output with
/usr/pgsql-9.5/bin/XXX (if you still have the packages).  For example,
on my random dev laptop that looks like:

openat(AT_FDCWD, "/proc/meminfo", O_RDONLY) = 6
fstat(6, {st_mode=S_IFREG|0444, st_size=0, ...}) = 0
read(6, "MemTotal:       16178852 kB\nMemF"..., 1024) = 1024
read(6, ":    903168 kB\nShmemHugePages:  "..., 1024) = 311
close(6)                                = 0
mmap(NULL, 11016339456, PROT_READ|PROT_WRITE,
MAP_SHARED|MAP_ANONYMOUS|MAP_HUGETLB, -1, 0) = -1 ENOMEM (Cannot
allocate memory)
mmap(NULL, 11016003584, PROT_READ|PROT_WRITE,
MAP_SHARED|MAP_ANONYMOUS, -1, 0) = 0x7ff74e579000
shmget(0x52e2c1, 56, IPC_CREAT|IPC_EXCL|0600) = 3244038
shmat(3244038, NULL, 0)                 = 0x7ff9df5ad000

The output is about the same on REL9_5_STABLE and REL_12_STABLE for
me, only slightly different sizes.  If that doesn't fail in the same
way on your system with 12, perhaps there are some more settings from
your real clusters required to make it fail.  You could add them one
by one with -c foo=bar or in the throw away
test_pgdata/postgresql.conf, and perhaps that process might shed some
light?

I was going to ask if it might be a preloaded extension that is asking
for gobs of extra memory in 12, but we can see from your "Failed
system call was shmget(key=5432001, size=11026235392, 03600)" that
it's in the same ballpark as my total above for shared_buffers=10GB.





On Mon, May 11, 2020 at 7:57 PM David G. Johnston <david.g.johnston@gmail.com> wrote:
On Monday, May 11, 2020, Tory M Blue <tmblue@gmail.com> wrote:
I'll keep digging, but i'm hesitant to do my multiple TB db's with half of their shared buffer configs, until I understand what 12 is doing differently than 9.5

Maybe run your test suite on 9.6, 10, and 11 to see if it is indeed new to 12 or at least appears on other versions?

David J. 

That may be the next step in the lab, but was hoping someone knew of a significant difference. 

Thanks
Tory
Tory M Blue <tmblue@gmail.com> writes:
> That may be the next step in the lab, but was hoping someone knew of a
> significant difference.

I think we've made it perfectly clear that we don't.  There's something
odd about your situation.

            regards, tom lane





On Mon, May 11, 2020 at 9:01 PM Thomas Munro <thomas.munro@gmail.com> wrote:
On Tue, May 12, 2020 at 2:52 PM Tory M Blue <tmblue@gmail.com> wrote:
> It took the change but didn't help. So 10GB of shared_buffers in 12 is still a no go. I'm down to 5GB and it works, but this is the same hardware, the same exact 9.5 configuration. So I'm missing something. WE have not had to mess with kernel memory settings since 9.4, so this is an odd one.
>
> I'll keep digging, but i'm hesitant to do my multiple TB db's with half of their shared buffer configs, until I understand what 12 is doing differently than 9.5

Which exact version of 9.5.x are you coming from?  What's the exact
error message on 12 (you showed the shared_memory_type=sysv error, but
with the default  value (mmap) how does it look)?  What's your
huge_pages setting?

9.5-20
postgresql95-9.5.20-2PGDG.rhel7.x86_64
postgresql95-contrib-9.5.20-2PGDG.rhel7.x86_64
postgresql95-libs-9.5.20-2PGDG.rhel7.x86_64
postgresql95-server-9.5.20-2PGDG.rhel7.x86_64 

I don't use huge_pages

And this error is actually from the default mmap

May 08 12:33:58 qdb01.prod.ca postmaster[8790]: < 2020-05-08 12:33:58.324 PDT  >HINT:  This error usually means that PostgreSQL's request for a shared memory segment exceeded available memory, swap space, or huge pages. To reduce the request size (currently 11026235392 bytes), reduce PostgreSQL's shared memory usage, perhaps by reducing shared_buffers or max_connections.

The above error is with 12 trying to start with shared_buffers = 10GB...

9.5 starts fine with the same configuration file.   That kind of started me down this path.

And just to repeat. Same exact hardware, same kernel, nothing more than installing the latest postgres12, copying my config files from 9.5 to 12 and running the pg_upgrade.

9.5 has been running for years with the same configuration file, so something changed somewhere along the line that is preventing 12 to start with the same config file.  And the allocation error is with either the sysv or mman on 12. (will start with 5GB allocated, but not 10GB, on a 15GB box (dedicated postgres server).


Can you reproduce the problem with a freshly created test cluster?  As
a regular user, assuming regular RHEL packaging, something like
/usr/pgsql-12/bin/initdb -D test_pgdata, and then
/usr/pgsql-12/bin/postgres -D test_pgdata -c shared_buffers=10GB (then
^C to stop it).  If that fails to start in the same way, it'd be
interesting to see the output of the second command with strace in
front of it, in the part where it allocates shared memory.  And
perhaps it'd be interesting to see the same output with
/usr/pgsql-9.5/bin/XXX (if you still have the packages).  For example,
on my random dev laptop that looks like:

openat(AT_FDCWD, "/proc/meminfo", O_RDONLY) = 6
fstat(6, {st_mode=S_IFREG|0444, st_size=0, ...}) = 0
read(6, "MemTotal:       16178852 kB\nMemF"..., 1024) = 1024
read(6, ":    903168 kB\nShmemHugePages:  "..., 1024) = 311
close(6)                                = 0
mmap(NULL, 11016339456, PROT_READ|PROT_WRITE,
MAP_SHARED|MAP_ANONYMOUS|MAP_HUGETLB, -1, 0) = -1 ENOMEM (Cannot
allocate memory)
mmap(NULL, 11016003584, PROT_READ|PROT_WRITE,
MAP_SHARED|MAP_ANONYMOUS, -1, 0) = 0x7ff74e579000
shmget(0x52e2c1, 56, IPC_CREAT|IPC_EXCL|0600) = 3244038
shmat(3244038, NULL, 0)                 = 0x7ff9df5ad000

The output is about the same on REL9_5_STABLE and REL_12_STABLE for
me, only slightly different sizes.  If that doesn't fail in the same
way on your system with 12, perhaps there are some more settings from
your real clusters required to make it fail.  You could add them one
by one with -c foo=bar or in the throw away
test_pgdata/postgresql.conf, and perhaps that process might shed some
light?

I was going to ask if it might be a preloaded extension that is asking
for gobs of extra memory in 12, but we can see from your "Failed
system call was shmget(key=5432001, size=11026235392, 03600)" that
it's in the same ballpark as my total above for shared_buffers=10GB.

Be more than happy to test this out. I'll see what I can pull tomorrow and provide some dataz :)   I know it's not ideal to use the same config file, I know that various things are added or changed (usually added) but the defaults are typically safe. But after sometime dialing in the settings for our use case, I've just kind of kept moving them forward.

But  let me do some more testing tomorrow (since I'm trying to get to the bottom of this, before I attempt my big DB upgrades).  So I'll spend some time testing and see if I can't get similar "failures/challenges"? and go from there.

Appreciate the ideas!

Tory


On Mon, May 11, 2020 at 9:57 PM Tom Lane <tgl@sss.pgh.pa.us> wrote:
Tory M Blue <tmblue@gmail.com> writes:
> That may be the next step in the lab, but was hoping someone knew of a
> significant difference.

I think we've made it perfectly clear that we don't.  There's something
odd about your situation.

                        regards, tom lane

totally, and i'll try to provide some more data tomorrow.

Thanks everyone.

Tory 

Re: Is there a significant difference in Memory settings between 9.5and 12

From
"David G. Johnston"
Date:
Repost, edited subject by mistake...

On Monday, May 11, 2020, Tory M Blue <tmblue@gmail.com> wrote:
And just to repeat. Same exact hardware, same kernel, nothing more than installing the latest postgres12, copying my config files from 9.5 to 12 and running the pg_upgrade.

You’ll want to remove the pg_upgrade from the equation and try v12 

David J.

Re: Is there a significant difference in Memory settings between 9.5and 12

From
"David G. Johnston"
Date:
On Monday, May 11, 2020, David G. Johnston <david.g.johnston@gmail.com> wrote:
Repost, edited subject by mistake...

On Monday, May 11, 2020, Tory M Blue <tmblue@gmail.com> wrote:
And just to repeat. Same exact hardware, same kernel, nothing more than installing the latest postgres12, copying my config files from 9.5 to 12 and running the pg_upgrade.

You’ll want to remove the pg_upgrade from the equation and try v12 

Sorry...if you copied the config to v12 before the upgrade and the upgrade worked that suggests that v12 booted up at some point with the configuration, no?  Does pg_upgrade do something special?

David J.


On Mon, May 11, 2020 at 11:09 PM David G. Johnston <david.g.johnston@gmail.com> wrote:
On Monday, May 11, 2020, David G. Johnston <david.g.johnston@gmail.com> wrote:
Repost, edited subject by mistake...

On Monday, May 11, 2020, Tory M Blue <tmblue@gmail.com> wrote:
And just to repeat. Same exact hardware, same kernel, nothing more than installing the latest postgres12, copying my config files from 9.5 to 12 and running the pg_upgrade.

You’ll want to remove the pg_upgrade from the equation and try v12 

Sorry...if you copied the config to v12 before the upgrade and the upgrade worked that suggests that v12 booted up at some point with the configuration, no?  Does pg_upgrade do something special?

David J

Not entirely sure I follow, and it may be that I confused the issue.

9.5 running for years, run the upgrade, and migrate my config files. 12 won't start without bumping the shared_buffers down.

12 won't start with "my" original config files.

I'm going to do native 9.5 and 12 installs on the same piece of hardware, no data migration, no pg_upgrade just to see if I can get 12 to start with my current configuration. I'll try your suggestions from last night as well, see if setting via command line will give us more dataz.

Tory 
 


On Mon, May 11, 2020 at 10:55 PM Tory M Blue <tmblue@gmail.com> wrote:


On Mon, May 11, 2020 at 9:01 PM Thomas Munro <thomas.munro@gmail.com> wrote:
On Tue, May 12, 2020 at 2:52 PM Tory M Blue <tmblue@gmail.com> wrote:
> It took the change but didn't help. So 10GB of shared_buffers in 12 is still a no go. I'm down to 5GB and it works, but this is the same hardware, the same exact 9.5 configuration. So I'm missing something. WE have not had to mess with kernel memory settings since 9.4, so this is an odd one.
>
> I'll keep digging, but i'm hesitant to do my multiple TB db's with half of their shared buffer configs, until I understand what 12 is doing differently than 9.5

Which exact version of 9.5.x are you coming from?  What's the exact
error message on 12 (you showed the shared_memory_type=sysv error, but
with the default  value (mmap) how does it look)?  What's your
huge_pages setting?

9.5-20
postgresql95-9.5.20-2PGDG.rhel7.x86_64
postgresql95-contrib-9.5.20-2PGDG.rhel7.x86_64
postgresql95-libs-9.5.20-2PGDG.rhel7.x86_64
postgresql95-server-9.5.20-2PGDG.rhel7.x86_64 

I don't use huge_pages

And this error is actually from the default mmap

May 08 12:33:58 qdb01.prod.ca postmaster[8790]: < 2020-05-08 12:33:58.324 PDT  >HINT:  This error usually means that PostgreSQL's request for a shared memory segment exceeded available memory, swap space, or huge pages. To reduce the request size (currently 11026235392 bytes), reduce PostgreSQL's shared memory usage, perhaps by reducing shared_buffers or max_connections.

The above error is with 12 trying to start with shared_buffers = 10GB...

9.5 starts fine with the same configuration file.   That kind of started me down this path.

And just to repeat. Same exact hardware, same kernel, nothing more than installing the latest postgres12, copying my config files from 9.5 to 12 and running the pg_upgrade.

9.5 has been running for years with the same configuration file, so something changed somewhere along the line that is preventing 12 to start with the same config file.  And the allocation error is with either the sysv or mman on 12. (will start with 5GB allocated, but not 10GB, on a 15GB box (dedicated postgres server).


Can you reproduce the problem with a freshly created test cluster?  As
a regular user, assuming regular RHEL packaging, something like
/usr/pgsql-12/bin/initdb -D test_pgdata, and then
/usr/pgsql-12/bin/postgres -D test_pgdata -c shared_buffers=10GB (then
^C to stop it).  If that fails to start in the same way, it'd be
interesting to see the output of the second command with strace in
front of it, in the part where it allocates shared memory.  And
perhaps it'd be interesting to see the same output with
/usr/pgsql-9.5/bin/XXX (if you still have the packages).  For example,
on my random dev laptop that looks like:

openat(AT_FDCWD, "/proc/meminfo", O_RDONLY) = 6
fstat(6, {st_mode=S_IFREG|0444, st_size=0, ...}) = 0
read(6, "MemTotal:       16178852 kB\nMemF"..., 1024) = 1024
read(6, ":    903168 kB\nShmemHugePages:  "..., 1024) = 311
close(6)                                = 0
mmap(NULL, 11016339456, PROT_READ|PROT_WRITE,
MAP_SHARED|MAP_ANONYMOUS|MAP_HUGETLB, -1, 0) = -1 ENOMEM (Cannot
allocate memory)
mmap(NULL, 11016003584, PROT_READ|PROT_WRITE,
MAP_SHARED|MAP_ANONYMOUS, -1, 0) = 0x7ff74e579000
shmget(0x52e2c1, 56, IPC_CREAT|IPC_EXCL|0600) = 3244038
shmat(3244038, NULL, 0)                 = 0x7ff9df5ad000

The output is about the same on REL9_5_STABLE and REL_12_STABLE for
me, only slightly different sizes.  If that doesn't fail in the same
way on your system with 12, perhaps there are some more settings from
your real clusters required to make it fail.  You could add them one
by one with -c foo=bar or in the throw away
test_pgdata/postgresql.conf, and perhaps that process might shed some
light?

I was going to ask if it might be a preloaded extension that is asking
for gobs of extra memory in 12, but we can see from your "Failed
system call was shmget(key=5432001, size=11026235392, 03600)" that
it's in the same ballpark as my total above for shared_buffers=10GB.

Be more than happy to test this out. I'll see what I can pull tomorrow and provide some dataz :)   I know it's not ideal to use the same config file, I know that various things are added or changed (usually added) but the defaults are typically safe. But after sometime dialing in the settings for our use case, I've just kind of kept moving them forward.

But  let me do some more testing tomorrow (since I'm trying to get to the bottom of this, before I attempt my big DB upgrades).  So I'll spend some time testing and see if I can't get similar "failures/challenges"? and go from there.

Appreciate the ideas!

Tory

Well that is interesting. Built a new system, installed 9.5 and 12, moved my config file in, added the include line to the standard postgresql.conf file in each version and location
/pgsql/9.5
/pgsql/12

Edited/created custom systemctl  files for each version.

And it starts.There are no errors,  I can start 9.5 and 12 with my config file that i'm attempting to use in the upgraded system.

So maybe, the upgrade is actually doing something funky. I'll do a mock upgrade now . Loaded my data

OKAY I see something but don't understand why..

I loaded my data into 9.5 and 12 both started fine, using my 9.5 data.   I then destroyed the 12 data and ran a clean init and then performed a link upgrade, postgrres 12 started with no issues at all, same shared_buffers 10GB.. I started scratching my head, then I remember I force some stuff via sysctl so I added those and boom, postgres12 will no longer start with the Shared_buffers of 10GB, but 9.5 starts.

May 12 10:59:49 ip-100-98-136-145.ca. postmaster[9975]: < 2020-05-12 10:59:49.719 PDT  >FATAL:  could not map anonymous shared memory: Cannot

So this appears not to be directly related to the upgrade but something with my existing sysctl settings and postgres 12

Anyone know why these settings are causing an issue with 12?

vm.overcommit_memory = 2
    This is the culprit I think. But makes no sense why postgres9.5 allowed it and 12 does not.
vm.overcommit_memory to 2, the vm.overcommit_ratio value becomes relevant. By default, this value is set to 50, which means the system       would only allocate up to 50% of your RAM (plus swap).  (so 15GB system, 10GB request, is more than 50% (but 9.5 worked). Setting to 1 allows it again, but I'm a tad confused on why this is causing an issue in 12 but not 9.5 
vm.dirty_background_ratio = 5
vm.dirty_ratio = 10
vm.swappiness = 0

At least I have an answer as to what, I just am not clear why.

Thanks again for the ideas!

Tory

Tory