BUG #8701: recover process hang on slave - Mailing list pgsql-bugs

From amutu@amutu.com
Subject BUG #8701: recover process hang on slave
Date
Msg-id E1Vw0zD-00042M-PU@wrigleys.postgresql.org
Whole thread Raw
Responses Re: BUG #8701: recover process hang on slave  (Alvaro Herrera <alvherre@2ndquadrant.com>)
Re: BUG #8701: recover process hang on slave  (Sergey Konoplev <gray.ru@gmail.com>)
List pgsql-bugs
The following bug has been logged on the website:

Bug reference:      8701
Logged by:          amutu
Email address:      amutu@amutu.com
PostgreSQL version: 9.1.9
Operating system:   CentOS 6 x86-64
Description:

we have a master and two streaming salve pg.we find One of the slave
replay_location is far behand the other.


both sent_location is BF1/921F6000;the write_location and flush_location is
similar;but one of the server replay_location is BF1/9210DD10,the oter is
6DE/D958E8.


on the abnormal server,top show that a postgres process replay the
00000001000006DE00000000 WAL,and the process take up 100% usage of the cpu
core.


I try to restart the salve,but failed.
I get the core of the process,it shows:


Loaded symbols for /lib64/ld-linux-x86-64.so.2
Core was generated by `postgres: startup process   recovering
00000001000006DE00000000'.
#0  0x00000000006264e8 in smgrclose ()
Missing separate debuginfos, use: debuginfo-install
glibc-2.12-1.49.tl1.x86_64
(gdb) bt
#0  0x00000000006264e8 in smgrclose ()
#1  0x00000000006265c8 in smgrcloseall ()
#2  0x0000000000495322 in XLogDropDatabase ()
#3  0x0000000000516253 in dbase_redo ()
#4  0x0000000000492d40 in StartupXLOG ()
#5  0x0000000000495148 in StartupProcessMain ()
#6  0x00000000004ac26f in AuxiliaryProcessMain ()
#7  0x00000000005eb383 in StartChildProcess ()
#8  0x00000000005ef3dc in PostmasterMain ()
#9  0x0000000000590fe8 in main ()

pgsql-bugs by date:

Previous
From: Peter Geoghegan
Date:
Subject: Obsolete comment above _bt_doinsert()
Next
From: Alexey Bashtanov
Date:
Subject: postgresql tries to reuse plan but fails because the (dynamic) query has changed