Thread: Losing data because of problematic configuration?

Losing data because of problematic configuration?

From

"Holtgrewe, Manuel"

Date:

15 June 2021, 10:42:01

Hi,

I have a database that is meant to have high-performance for bulk insert operations. I've attached my postgres.conf file.

However, I'm seeing the following behaviour. At around 12:04, I have started the database. Then, I did a bulk insert and that completed. I then went on to kill postgres processes at 12:33 with SIGSEGV signal. I'm getting the following log excerpt:

< 2021-06-15 12:33:03.656 CEST > LOG: database system was interrupted; last known up at 2021-06-15 12:04:13 CEST
< 2021-06-15 12:33:03.656 CEST > DEBUG: removing all temporary WAL segments
< 2021-06-15 12:33:04.525 CEST > DEBUG: checkpoint record is at 60/7C377C78

[snip]
< 2021-06-15 12:33:04.537 CEST > DEBUG: resetting unlogged relations: cleanup 1 init 0
< 2021-06-15 12:33:04.553 CEST > DEBUG: unlinked file "base/16384/107877"
[... snip ... ]
< 2021-06-15 12:33:27.556 CEST > DEBUG: copying base/16384/80948_init to base/16384/80948
< 2021-06-15 12:33:36.705 CEST > DEBUG: performing replication slot checkpoint
< 2021-06-15 12:33:38.394 CEST > DEBUG: attempting to remove WAL segments older than log file 00000000000000600000007C
< 2021-06-15 12:33:38.394 CEST > DEBUG: removing write-ahead log file "00000001000000600000007C"
< 2021-06-15 12:33:38.403 CEST > DEBUG: MultiXactId wrap limit is 2147483648, limited by database with OID 1
< 2021-06-15 12:33:38.419 CEST > DEBUG: oldest MultiXactId member is at offset 1
< 2021-06-15 12:33:38.419 CEST > DEBUG: MultiXact member stop limit is now 4294914944 based on MultiXact 1
< 2021-06-15 12:33:38.428 CEST > DEBUG: shmem_exit(0): 1 before_shmem_exit callbacks to make
< 2021-06-15 12:33:38.428 CEST > DEBUG: shmem_exit(0): 5 on_shmem_exit callbacks to make

So it looks as if the database jumps back "half an hour" to ensure consistent data. Everything in between is lost.

What would be configuration settings to look into to make this more stable? Is there a way to "force flush state to disk" from the application/through postgres API/SQL?

Thank you,

Manuel

Attachment

Re: Losing data because of problematic configuration?

From

Ron

Date:

15 June 2021, 11:07:57

On 6/15/21 5:42 AM, Holtgrewe, Manuel wrote:

Hi,

I have a database that is meant to have high-performance for bulk insert operations. I've attached my postgres.conf file.

However, I'm seeing the following behaviour. At around 12:04, I have started the database. Then, I did a bulk insert and that completed.

What kind of transaction did you use?
Did you commit the transaction?

I then went on to kill postgres processes at 12:33 with SIGSEGV signal.

Why?

Did you CHECKPOINT beforehand?

(I'm hypothesizing that data didn't get flushed to disk, and so Pg "cleaned itself up" after the crash.)

I'm getting the following log excerpt:

< 2021-06-15 12:33:03.656 CEST > LOG: database system was interrupted; last known up at 2021-06-15 12:04:13 CEST
< 2021-06-15 12:33:03.656 CEST > DEBUG: removing all temporary WAL segments
< 2021-06-15 12:33:04.525 CEST > DEBUG: checkpoint record is at 60/7C377C78
[snip]
< 2021-06-15 12:33:04.537 CEST > DEBUG: resetting unlogged relations: cleanup 1 init 0
< 2021-06-15 12:33:04.553 CEST > DEBUG: unlinked file "base/16384/107877"
[... snip ... ]
< 2021-06-15 12:33:27.556 CEST > DEBUG: copying base/16384/80948_init to base/16384/80948
< 2021-06-15 12:33:36.705 CEST > DEBUG: performing replication slot checkpoint
< 2021-06-15 12:33:38.394 CEST > DEBUG: attempting to remove WAL segments older than log file 00000000000000600000007C
< 2021-06-15 12:33:38.394 CEST > DEBUG: removing write-ahead log file "00000001000000600000007C"
< 2021-06-15 12:33:38.403 CEST > DEBUG: MultiXactId wrap limit is 2147483648, limited by database with OID 1
< 2021-06-15 12:33:38.419 CEST > DEBUG: oldest MultiXactId member is at offset 1
< 2021-06-15 12:33:38.419 CEST > DEBUG: MultiXact member stop limit is now 4294914944 based on MultiXact 1
< 2021-06-15 12:33:38.428 CEST > DEBUG: shmem_exit(0): 1 before_shmem_exit callbacks to make
< 2021-06-15 12:33:38.428 CEST > DEBUG: shmem_exit(0): 5 on_shmem_exit callbacks to make

So it looks as if the database jumps back "half an hour" to ensure consistent data. Everything in between is lost.

What would be configuration settings to look into to make this more stable? Is there a way to "force flush state to disk" from the application/through postgres API/SQL?

Thank you,
Manuel

--
Angular momentum makes the world go 'round.

Re: [ext] Re: Losing data because of problematic configuration?

From

"Holtgrewe, Manuel"

Date:

15 June 2021, 11:40:33

Hi,

thanks for your answer.

Let me give some background. I have a postgres instance that serves as the data storage for a web-based data analytics application. For some queries, I'm seeing postgres going OOM because the query grows too large and subsequently the linux kernel kills the postgres system. I'm then observing similar log messages and data loss. I'm now trying to reproduce this behaviour in a more deterministic way to learn about the root cause and resolve the issue.

The bulk import is not running inside of a transaction for performance reasons. My understanding is that in case of crashes I might end up with partial data (that I take care of in the application). However, I would not expect rollback behaviour a few minutes after the bulk import went through correctly.

FWIW, the analytics application also allows users to annotate the data and these annotations are written to the database in transactions.

So to answer your questions:

> What kind of transaction did you use?

No transaction for bulk import. Also, bulk import completed minutes before the kill. After the bulk import, a number of transactions are performed touching different tables.

> Did you commit the transaction?

The bulk import was not done in a transaction. The other transactions were committed through the database access framework I'm using in my (Python/django) application.

> Why?

To reproduce the problematic beahaviour that I'm seeing in my application.

Does this help? Where could I look for understanding this better?

Thanks,

Manuel

From: Ron <ronljohnsonjr@gmail.com>
Sent: Tuesday, June 15, 2021 13:07
To: pgsql-general@lists.postgresql.org
Subject: [ext] Re: Losing data because of problematic configuration?

On 6/15/21 5:42 AM, Holtgrewe, Manuel wrote:

Hi,

I have a database that is meant to have high-performance for bulk insert operations. I've attached my postgres.conf file.

However, I'm seeing the following behaviour. At around 12:04, I have started the database. Then, I did a bulk insert and that completed.

What kind of transaction did you use?
Did you commit the transaction?

I then went on to kill postgres processes at 12:33 with SIGSEGV signal.

Why?

Did you CHECKPOINT beforehand?

(I'm hypothesizing that data didn't get flushed to disk, and so Pg "cleaned itself up" after the crash.)

I'm getting the following log excerpt:

< 2021-06-15 12:33:03.656 CEST > LOG: database system was interrupted; last known up at 2021-06-15 12:04:13 CEST
< 2021-06-15 12:33:03.656 CEST > DEBUG: removing all temporary WAL segments
< 2021-06-15 12:33:04.525 CEST > DEBUG: checkpoint record is at 60/7C377C78
[snip]
< 2021-06-15 12:33:04.537 CEST > DEBUG: resetting unlogged relations: cleanup 1 init 0
< 2021-06-15 12:33:04.553 CEST > DEBUG: unlinked file "base/16384/107877"
[... snip ... ]
< 2021-06-15 12:33:27.556 CEST > DEBUG: copying base/16384/80948_init to base/16384/80948
< 2021-06-15 12:33:36.705 CEST > DEBUG: performing replication slot checkpoint
< 2021-06-15 12:33:38.394 CEST > DEBUG: attempting to remove WAL segments older than log file 00000000000000600000007C
< 2021-06-15 12:33:38.394 CEST > DEBUG: removing write-ahead log file "00000001000000600000007C"
< 2021-06-15 12:33:38.403 CEST > DEBUG: MultiXactId wrap limit is 2147483648, limited by database with OID 1
< 2021-06-15 12:33:38.419 CEST > DEBUG: oldest MultiXactId member is at offset 1
< 2021-06-15 12:33:38.419 CEST > DEBUG: MultiXact member stop limit is now 4294914944 based on MultiXact 1
< 2021-06-15 12:33:38.428 CEST > DEBUG: shmem_exit(0): 1 before_shmem_exit callbacks to make
< 2021-06-15 12:33:38.428 CEST > DEBUG: shmem_exit(0): 5 on_shmem_exit callbacks to make

So it looks as if the database jumps back "half an hour" to ensure consistent data. Everything in between is lost.

What would be configuration settings to look into to make this more stable? Is there a way to "force flush state to disk" from the application/through postgres API/SQL?

Thank you,
Manuel

--
Angular momentum makes the world go 'round.

Re: Losing data because of problematic configuration?

From

Tom Lane

Date:

15 June 2021, 13:39:31

"Holtgrewe, Manuel" <manuel.holtgrewe@bih-charite.de> writes:
> So it looks as if the database jumps back "half an hour" to ensure consistent data. Everything in between is lost.

Postgres does not lose committed data --- if it did, we'd consider that a
fairly serious bug.  (Well, there are caveats of course.  But most of them
have to do with operating-system crashes or power loss, neither of which
are at stake here.)

I am wondering about this though:

> < 2021-06-15 12:33:04.537 CEST > DEBUG:  resetting unlogged relations: cleanup 1 init 0

Are you perhaps keeping your data in an UNLOGGED table?  If so, resetting
it to empty after a crash is exactly what's supposed to happen.  The
entire point of UNLOGGED is that the performance benefits come at the
cost of losing the data on crash.

            regards, tom lane

Re: [ext] Re: Losing data because of problematic configuration?

From

"Holtgrewe, Manuel"

Date:

15 June 2021, 15:53:54

>> < 2021-06-15 12:33:04.537 CEST > DEBUG: resetting unlogged relations: cleanup 1 init 0
>
> Are you perhaps keeping your data in an UNLOGGED table? If so, resetting
> it to empty after a crash is exactly what's supposed to happen. The
> entire point of UNLOGGED is that the performance benefits come at the
> cost of losing the data on crash.

D'oh!

Yes, that is the case. I once used this but I was convinced that I took this back in some version. It is not in my main deployment, though. Now I have to find out (a) why I have diverging deployment and (b) how that bug came about to be.

Thanks a lot!

From: Tom Lane <tgl@sss.pgh.pa.us>
Sent: Tuesday, June 15, 2021 3:39:31 PM
To: Holtgrewe, Manuel
Cc: pgsql-general@lists.postgresql.org
Subject: [ext] Re: Losing data because of problematic configuration?

"Holtgrewe, Manuel" <manuel.holtgrewe@bih-charite.de> writes:
> So it looks as if the database jumps back "half an hour" to ensure consistent data. Everything in between is lost.

Postgres does not lose committed data --- if it did, we'd consider that a
fairly serious bug. (Well, there are caveats of course. But most of them
have to do with operating-system crashes or power loss, neither of which
are at stake here.)

I am wondering about this though:

> < 2021-06-15 12:33:04.537 CEST > DEBUG: resetting unlogged relations: cleanup 1 init 0

Are you perhaps keeping your data in an UNLOGGED table? If so, resetting
it to empty after a crash is exactly what's supposed to happen. The
entire point of UNLOGGED is that the performance benefits come at the
cost of losing the data on crash.

regards, tom lane

Re: [ext] Re: Losing data because of problematic configuration?

From

"Holtgrewe, Manuel"

Date:

16 June 2021, 08:54:49

Hi again,

thank you for pinpointing the issue.

I have now updated the table with "ALTER TABLE $table SET LOGGED" (actually it's a partitioned table and I've altered both the main table and the partitions).

I wanted to double-check the result and what I found out using "select relpersistence, relname from pg_class" that, e.g., ${table_name}_383_pkey still has its relpersistence set to "u" whereas ${table_name}_383 has its relpersistence set to "p" now.

Does anyone have an idea what I'm doing wrong here?

Thank you!

Manuel

From: Holtgrewe, Manuel <manuel.holtgrewe@bih-charite.de>
Sent: Tuesday, June 15, 2021 5:53:54 PM
To: Tom Lane
Cc: pgsql-general@lists.postgresql.org
Subject: Re: [ext] Re: Losing data because of problematic configuration?

D'oh!

Thanks a lot!

Re: [ext] Re: Losing data because of problematic configuration?

From

"Holtgrewe, Manuel"

Date:

16 June 2021, 09:59:10

Please ignore this email. I could not reproduce it after all.

--
Dr. Manuel Holtgrewe, Dipl.-Inform.
Bioinformatician
Core Unit Bioinformatics – CUBI
Berlin Institute of Health / Max Delbrück Center for Molecular Medicine in the Helmholtz Association / Charité – Universitätsmedizin Berlin

Visiting Address: Invalidenstr. 80, 3rd Floor, Room 03 028, 10117 Berlin
Postal Address: Chariteplatz 1, 10117 Berlin

E-Mail: manuel.holtgrewe@bihealth.de
Phone: +49 30 450 543 607
Fax: +49 30 450 7 543 901
Web: cubi.bihealth.org www.bihealth.org www.mdc-berlin.de www.charite.de

From: Holtgrewe, Manuel <manuel.holtgrewe@bih-charite.de>
Sent: Wednesday, June 16, 2021 10:54:49 AM
To: Tom Lane
Cc: pgsql-general@lists.postgresql.org
Subject: Re: [ext] Re: Losing data because of problematic configuration?

Hi again,

thank you for pinpointing the issue.

I have now updated the table with "ALTER TABLE $table SET LOGGED" (actually it's a partitioned table and I've altered both the main table and the partitions).

Does anyone have an idea what I'm doing wrong here?

Thank you!

Manuel

D'oh!

Thanks a lot!