7.0.2 crash (maybe linux kernel bug??) - Mailing list pgsql-hackers
From | Michael J Schout |
---|---|
Subject | 7.0.2 crash (maybe linux kernel bug??) |
Date | |
Msg-id | Pine.LNX.4.10.10010311309570.20237-100000@galaxy.gkg-com.com Whole thread Raw |
Responses |
Re: 7.0.2 crash (maybe linux kernel bug??)
|
List | pgsql-hackers |
Hi. Ive had a crash in postgresql 7.0.2. Looking at what happened, I actually suspect that this is a filesystem bug, and not a postgresql bug necessarily, but I wanted to report it here and see if anyone else had any opinions. The platform this happened on was linux (redhat 6.2), kernel 2.2.16 (SMP) dual pentium III 500MHz cpus, Mylex DAC960 raid controller running in raid5 mode. During regular activity, I got a kernel oops. Looking at the call trace from the kernel, as well as the EIP, I think maybe there is a bug here int the fs buffer code, and that htis is a linux kernel problem (not a postgresql problem). Bug I'm no expert here.. Does this sould correct looking at the kernel erros below? Sorry if this is off topic. I just want to make sure this is a kernel bug and not a postgresql bug. Mike The oopses: kernel: Unable to handle kernel NULL pointer dereference at virtual address 00000134 kernel: current->tss.cr3 = 1a325000, %%cr3 = 1a325000 kernel: *pde = 00000000 kernel: Oops: 0002 kernel: CPU: 0 kernel: EIP: 0010:[remove_from_queues+169/328] kernel: EFLAGS: 00010206 kernel: eax: 00000100 ebx: 00000002 ecx: df022e40 edx: efba76b8 kernel: esi: df022e40 edi: 00000000 ebp: 00000000 esp: da327ea4 kernel: ds: 0018 es: 0018 ss: 0018 kernel: Process postmaster (pid: 11527, process nr: 51, stackpage=da327000) kernel: Stack: df022e40 c012be79 df022e40 df022e40 00001000 c0142cb8 c0142cc7 df022e40 kernel: ec247140 ffffffea ec0b026c da326000 df022e40 df022e40 df022e40 000a4000 kernel: 00000000 da327f08 00000000 00000000 eff29200 00001000 000000a5 000a5000 kernel: Call Trace: [refile_buffer+77/184] [ext2_file_write+996/1584] [ext2_file_write+1011/1584] [kfree_skbmem+51/64] [__kfree_skb+162/168][lockd:__insmod_lockd_O/lib/modules/2.2.16-3smp/fs/lockd.o_M394EA7+-76392/76] [handle_IRQ_event+90/140] kernel: [sys_write+240/292] [ext2_file_write+0/1584] [system_call+52/56] [startup_32+43/164] kernel: Code: 89 50 34 c7 01 00 00 00 00 89 02 c7 41 34 00 00 00 00 ff 0d kernel: Unable to handle kernel NULL pointer dereference at virtual address 00000100 kernel: current->tss.cr3 = 1ba46000, %%cr3 = 1ba46000 kernel: *pde = 00000000 kernel: Oops: 0000 kernel: CPU: 1 kernel: EIP: 0010:[find_buffer+104/144] kernel: EFLAGS: 00010206 kernel: eax: 00000100 ebx: 00000007 ecx: 00069dae edx: 00000100 kernel: esi: 0000000d edi: 00003006 ebp: 0005ce4b esp: e53a19f4 kernel: ds: 0018 es: 0018 ss: 0018 kernel: Process postmaster (pid: 5545, process nr: 37, stackpage=e53a1000) kernel: Stack: 0005ce4b 00003006 00069dae c012b953 00003006 0005ce4b 00001000 c012bcc6 kernel: 00003006 0005ce4b 00001000 00003006 eff29200 00003006 00004e4b ef18c960 kernel: c0141ee7 00003006 0005ce4b 00001000 0005ce4b e53a1bb0 edc3c660 edc3c660 kernel: Call Trace: [get_hash_table+23/36] [getblk+30/324] [ext2_new_block+2291/2756] [getblk+271/324] [ext2_alloc_block+344/356][block_getblk+305/624] [ext2_getblk+256/524] kernel: [ext2_file_write+1308/1584] [__brelse+19/84] [permission+36/248] [dump_seek+53/104] [dump_seek+53/104] [dump_write+48/84][elf_core_dump+3104/3216] [do_IRQ+82/92] kernel: [tcp_write_xmit+407/472] [__release_sock+36/124] [tcp_do_sendmsg+2125/2144] [inet_sendmsg+0/144] [cprt+1553/20096][cprt+1553/20096] [cprt+1553/20096] [do_signal+458/724] kernel: [force_sig_info+168/180] [force_sig+17/24] [do_general_protection+54/160] [error_code+45/52] [signal_return+20/24] kernel: Code: 8b 00 39 6a 04 75 15 8b 4c 24 20 39 4a 08 75 0c 66 39 7a 0c
pgsql-hackers by date: