Home > mailing lists

[GENERAL] standby database crash - Mailing list pgsql-general

From	Seong Son (US)
Subject	[GENERAL] standby database crash
Date	August 1, 2017 00:15:18
Msg-id	BY2PR17MB032844BB5004388A0D6632E784B20@BY2PR17MB0328.namprd17.prod.outlook.com Whole thread Raw
Responses	Re: [GENERAL] standby database crash
List	pgsql-general

Tree view

I have a client who has streaming replication setup with the primary in one city and standby in another city both identical servers with Postgresql 9.6 on Windows Server 2012.

They have some network issues, which is causing the connection from the primary to standby to drop sometimes. And recently standby crashed with the following log. And it could not be restarted.

2017-07-18 09:21:13 UTC FATAL: invalid memory alloc request size 4148830208

2017-07-18 09:21:14 UTC LOG: startup process (PID 5608) exited with exit code 1

2017-07-18 09:21:14 UTC LOG: terminating any other active server processes

2017-07-18 09:21:14 UTC LOG: database system is shut down

Last entry from the pg_xlogdump shows the following

pg_xlogdump: FATAL: error in WAL record at D5/D1BD5FD0: unexpected pageaddr D1/E7BD6000 in log segment 00000000000000D5000000D1, offset 12410880

So my questions are, could an old WAL segment being resent through the network cause crash like this? Shouldn’t Postgresql be able to handle out of order WAL segments instead of just crashing?

And what would be the best way to recover the standby server? Resynching the entire database seems to be too time consuming.

Thanks in advance for any info.

-Seong

pgsql-general by date:

From: Scott Marlowe
Date: 01 August 2017, 00:06:57
Subject: Re: [GENERAL] Schemas and serials

From: armand pirvu
Date: 01 August 2017, 00:26:34
Subject: [GENERAL] upsert and update filtering

[GENERAL] standby database crash - Mailing list pgsql-general

Previous

Next