Re: 7.4Beta1 hang? - Mailing list pgsql-hackers

From Robert Creager
Subject Re: 7.4Beta1 hang?
Date
Msg-id 20030810104633.46acbdbd.Robert_Creager@LogicalChaos.org
Whole thread Raw
In response to 7.4Beta1 hang?  (Robert Creager <Robert_Creager@LogicalChaos.org>)
List pgsql-hackers
On Sat, 9 Aug 2003 17:48:38 -0600
Robert Creager <Robert_Creager@logicalchaos.org> said something like:

I received another hang, this time without a vacuum occurring.  It occurred during a copy operation.  While there is a
checkpointprocess running, I don't believe it was there at the start of the hang, but I'm not sure.
 

I did a <make distclean> before configure/make for this install.

Here is the info you asked for last time.

tassiv=# select relname, pg_locks.* from pg_locks left join pg_class on ( pg_locks.relation = pg_class.relfilenode );
 relname      | relation | database | transaction |  pid  |        mode         | granted 
 
-------------------+----------+----------+-------------+-------+---------------------+---------pg_class          |
1259|    17140 |             | 12804 | AccessShareLock     | tpg_locks          |    16759 |    17140 |             |
12804| AccessShareLock     | tgroups            |    17835 |    17140 |             | 12671 | AccessShareLock     |
tgroups           |    17835 |    17140 |             | 12671 | RowShareLock        | tpairs_pair_id_seq |    17844 |
17140 |             | 12671 | AccessShareLock     | tpairs             |    17846 |    17140 |             | 12671 |
AccessShareLock    | tpairs             |    17846 |    17140 |             | 12671 | RowShareLock        | tpairs
      |    17846 |    17140 |             | 12671 | RowExclusiveLock    | tfiles_file_id_seq |    17855 |    17140 |
        | 12671 | AccessShareLock     | tfiles             |    17857 |    17140 |             | 12671 |
AccessShareLock    | tfiles             |    17857 |    17140 |             | 12671 | RowShareLock        | tfiles
      |    17857 |    17140 |             | 12671 | RowExclusiveLock    | tfits              |    17879 |    17140 |
        | 12671 | AccessShareLock     | tfits              |    17879 |    17140 |             | 12671 |
RowExclusiveLock   | tobs_id_seq        |    17912 |    17140 |             | 12671 | AccessShareLock     | ttemp_obs_i
      |    18028 |    17140 |             | 12671 | RowExclusiveLock    | timported          |    18045 |    17140 |
        | 12671 | AccessShareLock     | timported          |    18045 |    17140 |             | 12671 |
RowExclusiveLock   | t                  |    18101 |    17140 |             | 12671 | AccessExclusiveLock | t
      |  3021077 |    17140 |             | 12671 | AccessExclusiveLock | t                  |          |          |
  80133 | 12671 | ExclusiveLock       | t                  |          |          |       80147 | 12804 | ExclusiveLock
    | t
 

tassiv=# select * from pg_stat_activity;datid | datname | procpid | usesysid | usename | current_query | query_start 
-------+---------+---------+----------+---------+---------------+-------------17140 | tassiv  |   12671 |      100 |
robert |               | 17140 | tassiv  |   12672 |      100 | robert  |               | 17140 | tassiv  |   12976 |
  100 | robert  |               | 
 

Here are the backtraces.  I've build under debug and have ddd ready to go...

postgres: stats collector process
#0  0x4031fec2 in select () from /lib/i686/libc.so.6
#1  0xbfffefb0 in ?? ()
#2  0x0813711e in pgstat_start () at pgstat.c:402
#3  0x08134093 in PostmasterMain (argc=3, argv=0x82861f8) at postmaster.c:871
#4  0x08109edc in main (argc=3, argv=0xbffff954) at main.c:215
#5  0x4025e7f7 in __libc_start_main () from /lib/i686/libc.so.6

postgres: stats buffer process
(gdb) backtrace
#0  0x4031fec2 in select () from /lib/i686/libc.so.6
#1  0x00000005 in ?? ()
#2  0x08138376 in pgstat_main () at pgstat.c:1279
#3  0x0813711e in pgstat_start () at pgstat.c:402
#4  0x08134093 in PostmasterMain (argc=3, argv=0x82861f8) at postmaster.c:871
#5  0x08109edc in main (argc=3, argv=0xbffff954) at main.c:215
#6  0x4025e7f7 in __libc_start_main () from /lib/i686/libc.so.6

postgres: robert tassiv 192.168.0.250 COPY
(gdb) backtrace
#0  0x403279d7 in semop () from /lib/i686/libc.so.6
#1  0x08133151 in PGSemaphoreLock (sema=0x0, interruptOK=1 '\001')   at pg_sema.c:419
#2  0x081517cc in LWLockAcquire (lockid=11097, mode=LW_EXCLUSIVE)   at lwlock.c:312
#3  0x081482fd in LockBuffer (buffer=1, mode=2) at bufmgr.c:1925
#4  0x0808439f in _bt_getbuf (rel=0x457767d8, blkno=4294967292, access=2)   at nbtpage.c:419
#5  0x080827e4 in _bt_split (rel=0x457767d8, buf=5538, firstright=330,    newitemoff=510, newitemsz=12,
newitem=0x82fadb8,newitemonleft=0 '\0',    itup_off=0xbfffe2fe, itup_blkno=0xbfffe300) at nbtinsert.c:675
 
#6  0x08082202 in _bt_insertonpg (rel=0x457767d8, buf=5538, stack=0x8316fa0,    keysz=1, scankey=0x8316ff0,
btitem=0x82fadb8,afteritem=0,    split_only_page=0 '\0') at nbtinsert.c:492
 
#7  0x08081ce8 in _bt_doinsert (rel=0x457767d8, btitem=0x82fadb8,    index_is_unique=0 '\0', heapRel=0x456fc898) at
nbtinsert.c:141
#8  0x080857ea in btinsert (fcinfo=0xfffffffc) at nbtree.c:263
#9  0x081b77df in OidFunctionCall6 (functionId=331, arg1=4294967292,    arg2=4294967292, arg3=4294967292,
arg4=4294967292,arg5=4294967292,    arg6=4294967292) at fmgr.c:1345
 
#10 0x08081190 in index_insert (indexRelation=0x457767d8, datums=0xbfffe5b0,    nulls=0xbfffe590 " \004",
heap_t_ctid=0x840f494,heapRelation=0x456fc898,    check_uniqueness=0 '\0') at indexam.c:226
 
#11 0x080f75f4 in ExecInsertIndexTuples (slot=0x0, tupleid=0x840f494,    estate=0x834a730, is_vacuum=0 '\0') at
execUtils.c:860
#12 0x080cb178 in CopyFrom (rel=0x456fc898, attnumlist=0x836dcf0,    binary=0 '\0', oids=0 '\0', delim=0x8238f5a "\t",
 null_print=0x820ef58 "\\N") at copy.c:1598
 
#13 0x080c9d38 in DoCopy (stmt=0x1) at copy.c:830
#14 0x08158752 in PortalRunUtility (portal=0x82eae40, query=0x82e5e28,    dest=0x82e5cd8, completionTag=0xbfffe950 "")
atpquery.c:757
 
#15 0x081589a8 in PortalRunMulti (portal=0x82eae40, dest=0x82e5cd8,    altdest=0x82e5cd8, completionTag=0xbfffe950 "")
atpquery.c:821
 
#16 0x081582de in PortalRun (portal=0x82eae40, count=2147483647,    dest=0x82e5cd8, altdest=0x82e5cd8,
completionTag=0xbfffe950"")   at pquery.c:483
 
#17 0x081555ec in exec_simple_query (   query_string=0x82e5ac8 "COPY temp_obs_I        FROM stdin")   at
postgres.c:875
#18 0x08157602 in PostgresMain (argc=4, argv=0x8287678,    username=0x8287648 "robert") at postgres.c:2850
#19 0x081362e8 in BackendFork (port=0x82941c0) at postmaster.c:2536
#20 0x08135da3 in BackendStartup (port=0x82941c0) at postmaster.c:2184
#21 0x081346cd in ServerLoop () at postmaster.c:1108
#22 0x081340bd in PostmasterMain (argc=3, argv=0x82861f8) at postmaster.c:886
#23 0x08109edc in main (argc=3, argv=0xbffff954) at main.c:215
#24 0x4025e7f7 in __libc_start_main () from /lib/i686/libc.so.6

postgres: checkpoint subprocess
#0  0x403279d7 in semop () from /lib/i686/libc.so.6
#1  0x08133151 in PGSemaphoreLock (sema=0x0, interruptOK=1 '\001')   at pg_sema.c:419
#2  0x081517cc in LWLockAcquire (lockid=11097, mode=LW_SHARED) at lwlock.c:312
#3  0x08147454 in BufferSync () at bufmgr.c:786
#4  0x0814785a in FlushBufferPool () at bufmgr.c:990
#5  0x08095007 in CreateCheckPoint (shutdown=0 '\0', force=0 '\0')   at xlog.c:3282
#6  0x0809900a in BootstrapMain (argc=0, argv=0x16) at bootstrap.c:445
#7  0x08136a69 in SSDataBase (xlop=3) at postmaster.c:2842
#8  0x08134797 in ServerLoop () at postmaster.c:1039
#9  0x081340bd in PostmasterMain (argc=3, argv=0x82861f8) at postmaster.c:886
#10 0x08109edc in main (argc=3, argv=0xbffff954) at main.c:215
#11 0x4025e7f7 in __libc_start_main () from /lib/i686/libc.so.6

-- 10:11:47 up 9 days,  2:57,  4 users,  load average: 0.11, 0.41, 0.96

pgsql-hackers by date:

Previous
From: Andrew Dunstan
Date:
Subject: Re: Another day, another SCO Compiler Error...
Next
From: nmueller@cs.wisc.edu
Date:
Subject: simple_heap_update problem