Re: BUG #7500: hot-standby replica crash after an initial rsync - Mailing list pgsql-bugs

From Maxim Boguk
Subject Re: BUG #7500: hot-standby replica crash after an initial rsync
Date
Msg-id CAK-MWwQy0RNzkDSn_JdtKEZVATBYT0K_V3aFgXXo9iSPQaeL4g@mail.gmail.com
Whole thread Raw
In response to Re: BUG #7500: hot-standby replica crash after an initial rsync  (Andres Freund <andres@2ndquadrant.com>)
Responses Re: BUG #7500: hot-standby replica crash after an initial rsync  (Maxim Boguk <maxim.boguk@gmail.com>)
List pgsql-bugs
On Wed, Aug 22, 2012 at 6:08 AM, Andres Freund <andres@2ndquadrant.com>wrote:

> On Tuesday, August 21, 2012 03:30:44 PM Maxim Boguk wrote:
> > Hi Andres,
>
> I would add something akin to
>
> elog(WARNING, "pid of startup is: %d, sleeping for 10s", getpid());
> sleep(10);
>

Hi Andres,

There is it:

gdb backtrace:

#0  0x0000000800bd1ab0 in exit () from /lib/libc.so.7
#1  0x00000000005d77f6 in proc_exit (code=1) at ipc.c:137
#2  0x0000000000697948 in errfinish (dummy=<optimized out>) at elog.c:501
#3  0x0000000000488956 in SlruReportIOError (ctl=<optimized out>,
pageno=<optimized out>, xid=361788644) at slru.c:857
#4  0x000000000048956a in SimpleLruReadPage (ctl=0x95bf80, pageno=176654,
write_ok=1 '\001', xid=361788644) at slru.c:427
#5  0x0000000000489f7f in RecordNewMultiXact (multi=361788644,
offset=822580965, nxids=2, xids=0x800e5302c) at multixact.c:764
#6  0x000000000048a126 in multixact_redo (lsn=..., record=0x800e53000) at
multixact.c:2017
#7  0x0000000000484bea in StartupXLOG () at xlog.c:6277
#8  0x00000000004870a8 in StartupProcessMain () at xlog.c:9335
#9  0x000000000049cab2 in AuxiliaryProcessMain (argc=2,
argv=0x7fffffffe450) at bootstrap.c:412
#10 0x00000000005ba926 in StartChildProcess (type=StartupProcess) at
postmaster.c:4442
#11 0x00000000005bccb2 in PostmasterMain (argc=5, argv=0x7fffffffea38) at
postmaster.c:1093
#12 0x0000000000566f3b in main (argc=5, argv=0x7fffffffea38) at main.c:188

relevant database log:
2012-08-22 07:47:21.330 MSK 18155 @ from  [vxid:1/0 txid:0] []LOG:  00000:
redo starts at 925/2C0004C8
2012-08-22 07:47:21.330 MSK 18155 @ from  [vxid:1/0 txid:0] []LOCATION:
StartupXLOG, xlog.c:6202
2012-08-22 07:47:21.384 MSK 18155 @ from  [vxid:1/0 txid:0] []FATAL:
XX000: could not access status of transaction 361788644
2012-08-22 07:47:21.384 MSK 18155 @ from  [vxid:1/0 txid:0] []DETAIL:
Could not read from file "pg_multixact/offsets/1590" at offset 114688:
Unknown error: 0.
2012-08-22 07:47:21.384 MSK 18155 @ from  [vxid:1/0 txid:0] []CONTEXT:
xlog redo create multixact 361788644 offset 822580965: 1768508061 1768508078
2012-08-22 07:47:21.384 MSK 18155 @ from  [vxid:1/0 txid:0] []LOCATION:
SlruReportIOError, slru.c:840


Kind Regards,
Maksym

pgsql-bugs by date:

Previous
From: Robert Haas
Date:
Subject: Re: BUG #4958: Stats collector hung on WaitForMultipleObjectsEx while attempting to recv a datagram
Next
From: Maxim Boguk
Date:
Subject: Re: BUG #7500: hot-standby replica crash after an initial rsync