Hot standby 9.2.6 -> 9.2.6 PANIC: WAL contains references to invalid pages - Mailing list pgsql-bugs
| From | Sergey Konoplev |
|---|---|
| Subject | Hot standby 9.2.6 -> 9.2.6 PANIC: WAL contains references to invalid pages |
| Date | |
| Msg-id | CAL_0b1s4QCkFy_55kk_8XWcJPs7wsgVWf8vn4=jXe6V4R7Hxmg@mail.gmail.com Whole thread |
| Responses |
Re: Hot standby 9.2.6 -> 9.2.6 PANIC: WAL contains references
to invalid pages
Re: Hot standby 9.2.6 -> 9.2.6 PANIC: WAL contains references to invalid pages Re: Hot standby 9.2.6 -> 9.2.6 PANIC: WAL contains references to invalid pages |
| List | pgsql-bugs |
Hi,
Linux 2.6.32
PostgreSQL 9.2.6
It was suffering from this problem on 9.2.4, mostly last couple of
weeks when I had to rebuild the replica almost every 3 days, and I
hoped it would be fixed in 9.2.6, but it is not.
Yesterday I upgraded both master and replica servers to 9.2.6,
performed SET vacuum_freeze_table_age =3D 0; VACUUM; and then rebuild
the replica from scratch.
Below are replica logs, pg_controldata output, configuration and the
backtrace of the core dump. I can keep the replica untouched (without
rebuilding it) for several more hours, so please let me know if you
need me to show you something else.
Replica logs:
2013-12-19 20:51:22 MSK 19938 @ from [vxid:1/0 txid:0] [] WARNING:
page 14833 of relation base/16436/3321003988 is uninitialized
2013-12-19 20:51:22 MSK 19938 @ from [vxid:1/0 txid:0] [] CONTEXT:
xlog redo vacuum: rel 1663/16436/3321003988; blk 38538,
lastBlockVacuumed 0
2013-12-19 20:51:22 MSK 19938 @ from [vxid:1/0 txid:0] [] PANIC: WAL
contains references to invalid pages
2013-12-19 20:51:22 MSK 19938 @ from [vxid:1/0 txid:0] [] CONTEXT:
xlog redo vacuum: rel 1663/16436/3321003988; blk 38538,
lastBlockVacuumed 0
2013-12-19 20:51:22 MSK 19935 @ from [vxid: txid:0] [] LOG: startup
process (PID 19938) was terminated by signal 6: Aborted
2013-12-19 20:51:22 MSK 19935 @ from [vxid: txid:0] [] LOG:
terminating any other active server processes
pg_controldata output:
pg_control version number: 922
Catalog version number: 201204301
Database system identifier: 5858109675396804534
Database cluster state: in archive recovery
pg_control last modified: Thu 19 Dec 2013 08:37:31 PM MSK
Latest checkpoint location: 3271/79F16848
Prior checkpoint location: 3270/77E32598
Latest checkpoint's REDO location: 3270/929671E0
Latest checkpoint's TimeLineID: 2
Latest checkpoint's full_page_writes: on
Latest checkpoint's NextXID: 4/1572485244
Latest checkpoint's NextOID: 3349674903
Latest checkpoint's NextMultiXactId: 6730476
Latest checkpoint's NextMultiOffset: 30866630
Latest checkpoint's oldestXID: 1340099111
Latest checkpoint's oldestXID's DB: 16436
Latest checkpoint's oldestActiveXID: 1572038146
Time of latest checkpoint: Thu 19 Dec 2013 07:00:09 PM MSK
Minimum recovery ending location: 3272/666EE368
Backup start location: 0/0
Backup end location: 0/0
End-of-backup record required: no
Current wal_level setting: hot_standby
Current max_connections setting: 550
Current max_prepared_xacts setting: 0
Current max_locks_per_xact setting: 64
Maximum data alignment: 8
Database block size: 8192
Blocks per segment of large relation: 131072
WAL block size: 8192
Bytes per WAL segment: 16777216
Maximum length of identifiers: 64
Maximum columns in an index: 32
Maximum size of a TOAST chunk: 1996
Date/time type storage: 64-bit integers
Float4 argument passing: by value
Float8 argument passing: by value
Non default configuration:
application_name =3D psql
archive_command =3D (disabled)
autovacuum_analyze_scale_factor =3D 0.05
autovacuum_max_workers =3D 5
autovacuum_naptime =3D 5
autovacuum_vacuum_cost_delay =3D 5
autovacuum_vacuum_scale_factor =3D 0.05
bgwriter_delay =3D 10
bgwriter_lru_multiplier =3D 10
checkpoint_completion_target =3D 0.9
checkpoint_segments =3D 1024
checkpoint_timeout =3D 3600
checkpoint_warning =3D 300
client_encoding =3D UTF8
DateStyle =3D ISO, DMY
default_text_search_config =3D pg_catalog.russian
effective_cache_size =3D 17432576
effective_io_concurrency =3D 32
hot_standby =3D on
hot_standby_feedback =3D on
lc_collate =3D ru_RU.UTF-8
lc_ctype =3D ru_RU.UTF-8
lc_messages =3D en_US.UTF-8
lc_monetary =3D ru_RU.UTF-8
lc_numeric =3D ru_RU.UTF-8
lc_time =3D ru_RU.UTF-8
listen_addresses =3D *
log_checkpoints =3D on
log_directory =3D /var/log/pgsql
log_file_mode =3D 0600
log_filename =3D postgresql-%Y-%m-%d.log
log_line_prefix =3D %t %p %u@%d from %h [vxid:%v txid:%x] [%i]
log_lock_waits =3D on
log_rotation_size =3D 0
log_statement =3D ddl
log_timezone =3D W-SU
log_truncate_on_rotation =3D on
logging_collector =3D on
maintenance_work_mem =3D 1048576
max_connections =3D 550
max_stack_depth =3D 2048
max_standby_streaming_delay =3D 300000
max_wal_senders =3D 3
pg_stat_statements.track =3D all
port =3D 6432
server_encoding =3D UTF8
shared_buffers =3D 4587520
shared_preload_libraries =3D pg_stat_statements
synchronous_commit =3D off
TimeZone =3D W-SU
track_activity_query_size =3D 4096
transaction_isolation =3D read committed
unix_socket_permissions =3D 0777
wal_buffers =3D 2048
wal_keep_segments =3D 2048
wal_level =3D hot_standby
work_mem =3D 262144
Backtrace of the coredump:
Core was generated by `postgres: startup process recovering
000000020000327200000066 '.
Program terminated with signal 6, Aborted.
#0 0x00007f6a5a8818a5 in raise () from /lib64/libc.so.6
Missing separate debuginfos, use: debuginfo-install
postgresql92-server-9.2.6-1PGDG.el6.x86_64
(gdb) set print pretty on
(gdb) bt f
#0 0x00007f6a5a8818a5 in raise () from /lib64/libc.so.6
No symbol table info available.
#1 0x00007f6a5a883085 in abort () from /lib64/libc.so.6
No symbol table info available.
#2 0x000000000071c580 in errfinish (dummy=3D<value optimized out>) at elog=
.c:534
edata =3D 0xb88fc0
elevel =3D 22
oldcontext =3D 0x134a740
econtext =3D 0x0
__func__ =3D "errfinish"
#3 0x000000000071d09c in elog_finish (elevel=3D<value optimized out>,
fmt=3D<value optimized out>) at elog.c:1213
edata =3D 0xb88fc0
oldcontext =3D 0x134a740
__func__ =3D "elog_finish"
#4 0x00000000004a6d6d in log_invalid_page (node=3D...,
forkno=3DMAIN_FORKNUM, blkno=3D14833, present=3D1 '\001') at xlogutils.c:91
key =3D {
node =3D {
spcNode =3D 1663,
dbNode =3D 16436,
relNode =3D 3321003988
},
forkno =3D 25867822,
blkno =3D 0
}
hentry =3D <value optimized out>
found =3D <value optimized out>
__func__ =3D "log_invalid_page"
#5 0x00000000004a6ea0 in XLogReadBufferExtended (rnode=3D...,
forknum=3DMAIN_FORKNUM, blkno=3D14833, mode=3DRBM_NORMAL) at xlogutils.c:35=
8
page =3D <value optimized out>
lastblock =3D 38539
buffer =3D <value optimized out>
smgr =3D <value optimized out>
#6 0x00000000004897a2 in btree_xlog_vacuum (lsn=3D<value optimized
out>, record=3D0x7f6a4fb20010) at nbtxlog.c:510
blkno =3D 14833
xlrec =3D 0x7f6a4fb20030
buffer =3D <value optimized out>
page =3D <value optimized out>
opaque =3D <value optimized out>
#7 btree_redo (lsn=3D<value optimized out>, record=3D0x7f6a4fb20010) at
nbtxlog.c:1062
info =3D <value optimized out>
__func__ =3D "btree_redo"
#8 0x00000000004a3468 in StartupXLOG () at xlog.c:6931
recoveryContinue =3D 1 '\001'
recoveryApply =3D <value optimized out>
errcontext =3D {
previous =3D 0x0,
callback =3D 0x49d5a0 <rm_redo_error_callback>,
arg =3D 0x7f6a4fb20010
}
xtime =3D <value optimized out>
rmid =3D <value optimized out>
xlogctl =3D 0x7f6a6de08480
Insert =3D <value optimized out>
checkPoint =3D {
redo =3D {
xlogid =3D 12896,
xrecoff =3D 4261478784
},
ThisTimeLineID =3D 2,
fullPageWrites =3D 1 '\001',
nextXidEpoch =3D 4,
nextXid =3D 1452297001,
nextOid =3D 3344849815,
nextMulti =3D 6695025,
nextMultiOffset =3D 30699263,
oldestXid =3D 1256929431,
oldestXidDB =3D 12865,
time =3D 1387392623,
oldestActiveXid =3D 1452282892
}
wasShutdown =3D <value optimized out>
reachedStopPoint =3D 0 '\000'
haveBackupLabel =3D <value optimized out>
RecPtr =3D <value optimized out>
checkPointLoc =3D {
xlogid =3D 12896,
xrecoff =3D 4271100680
}
EndOfLog =3D <value optimized out>
endLogId =3D <value optimized out>
endLogSeg =3D <value optimized out>
record =3D 0x7f6a4fb20010
freespace =3D <value optimized out>
oldestActiveXID =3D <value optimized out>
backupEndRequired =3D <value optimized out>
backupFromStandby =3D <value optimized out>
dbstate_at_startup =3D <value optimized out>
__func__ =3D "StartupXLOG"
#9 0x00000000006153b8 in StartupProcessMain () at startup.c:220
__func__ =3D "StartupProcessMain"
#10 0x00000000004c3837 in AuxiliaryProcessMain (argc=3D2,
argv=3D0x7fffa02b0100) at bootstrap.c:418
progname =3D 0x854a1a "postgres"
flag =3D <value optimized out>
userDoption =3D 0x0
__func__ =3D "AuxiliaryProcessMain"
#11 0x00000000006105f3 in StartChildProcess (type=3DStartupProcess) at
postmaster.c:4522
pid =3D <value optimized out>
av =3D {0x854a1a "postgres", 0x7fffa02b0150 "-x2", 0x0,
0x7fffa02f0814
"H\213\025e\376\377\377H+B(\213J8H#B0H\017\257\301\213J<H\323\370D;\"u\245H=
\003C\b1\322\061\311H\213\063H=3D\377=C9=9A;v\033\017\037\204",
0x7fffa02b0170 "\300\226\064\001", 0x0, 0x7fffa02b0150 "-x2",
0x7fffa02f0814
"H\213\025e\376\377\377H+B(\213J8H#B0H\017\257\301\213J<H\323\370D;\"u\245H=
\003C\b1\322\061\311H\213\063H=3D\377=C9=9A;v\033\017\037\204",
0xb41080 ">-\262R",
0x0}
ac =3D 2
typebuf =3D
"-x2\000\377\177\000\000\274\b/\240\377\177\000\000\005\000\000\000\000\000=
\000\000\005\000\000\000\000\000\000"
__func__ =3D "StartChildProcess"
#12 0x0000000000614c3c in PostmasterMain (argc=3D<value optimized out>,
argv=3D<value optimized out>) at postmaster.c:1123
opt =3D <value optimized out>
status =3D <value optimized out>
userDoption =3D <value optimized out>
listen_addr_saved =3D 1 '\001'
i =3D <value optimized out>
__func__ =3D "PostmasterMain"
#13 0x00000000005b37e0 in main (argc=3D5, argv=3D0x13496a0) at main.c:199
No locals.
--=20
Kind regards,
Sergey Konoplev
PostgreSQL Consultant and DBA
http://www.linkedin.com/in/grayhemp
+1 (415) 867-9984, +7 (901) 903-0499, +7 (988) 888-1979
gray.ru@gmail.com
pgsql-bugs by date: