Re: 7.4Beta1 hang?

From: Robert Creager <Robert_Creager(at)LogicalChaos(dot)org>
To: PostgreSQL Hackers <pgsql-hackers(at)postgresql(dot)org>
Cc: Tom Lane <tgl(at)sss(dot)pgh(dot)pa(dot)us>
Subject: Re: 7.4Beta1 hang?
Date: 2003-08-10 16:46:33
Message-ID: 20030810104633.46acbdbd.Robert_Creager@LogicalChaos.org
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-hackers

On Sat, 9 Aug 2003 17:48:38 -0600
Robert Creager <Robert_Creager(at)logicalchaos(dot)org> said something like:

I received another hang, this time without a vacuum occurring. It occurred during a copy operation. While there is a checkpoint process running, I don't believe it was there at the start of the hang, but I'm not sure.

I did a <make distclean> before configure/make for this install.

Here is the info you asked for last time.

tassiv=# select relname, pg_locks.* from pg_locks left join pg_class on ( pg_locks.relation = pg_class.relfilenode );
relname | relation | database | transaction | pid | mode | granted
-------------------+----------+----------+-------------+-------+---------------------+---------
pg_class | 1259 | 17140 | | 12804 | AccessShareLock | t
pg_locks | 16759 | 17140 | | 12804 | AccessShareLock | t
groups | 17835 | 17140 | | 12671 | AccessShareLock | t
groups | 17835 | 17140 | | 12671 | RowShareLock | t
pairs_pair_id_seq | 17844 | 17140 | | 12671 | AccessShareLock | t
pairs | 17846 | 17140 | | 12671 | AccessShareLock | t
pairs | 17846 | 17140 | | 12671 | RowShareLock | t
pairs | 17846 | 17140 | | 12671 | RowExclusiveLock | t
files_file_id_seq | 17855 | 17140 | | 12671 | AccessShareLock | t
files | 17857 | 17140 | | 12671 | AccessShareLock | t
files | 17857 | 17140 | | 12671 | RowShareLock | t
files | 17857 | 17140 | | 12671 | RowExclusiveLock | t
fits | 17879 | 17140 | | 12671 | AccessShareLock | t
fits | 17879 | 17140 | | 12671 | RowExclusiveLock | t
obs_id_seq | 17912 | 17140 | | 12671 | AccessShareLock | t
temp_obs_i | 18028 | 17140 | | 12671 | RowExclusiveLock | t
imported | 18045 | 17140 | | 12671 | AccessShareLock | t
imported | 18045 | 17140 | | 12671 | RowExclusiveLock | t
| 18101 | 17140 | | 12671 | AccessExclusiveLock | t
| 3021077 | 17140 | | 12671 | AccessExclusiveLock | t
| | | 80133 | 12671 | ExclusiveLock | t
| | | 80147 | 12804 | ExclusiveLock | t

tassiv=# select * from pg_stat_activity;
datid | datname | procpid | usesysid | usename | current_query | query_start
-------+---------+---------+----------+---------+---------------+-------------
17140 | tassiv | 12671 | 100 | robert | |
17140 | tassiv | 12672 | 100 | robert | |
17140 | tassiv | 12976 | 100 | robert | |

Here are the backtraces. I've build under debug and have ddd ready to go...

postgres: stats collector process
#0 0x4031fec2 in select () from /lib/i686/libc.so.6
#1 0xbfffefb0 in ?? ()
#2 0x0813711e in pgstat_start () at pgstat.c:402
#3 0x08134093 in PostmasterMain (argc=3, argv=0x82861f8) at postmaster.c:871
#4 0x08109edc in main (argc=3, argv=0xbffff954) at main.c:215
#5 0x4025e7f7 in __libc_start_main () from /lib/i686/libc.so.6

postgres: stats buffer process
(gdb) backtrace
#0 0x4031fec2 in select () from /lib/i686/libc.so.6
#1 0x00000005 in ?? ()
#2 0x08138376 in pgstat_main () at pgstat.c:1279
#3 0x0813711e in pgstat_start () at pgstat.c:402
#4 0x08134093 in PostmasterMain (argc=3, argv=0x82861f8) at postmaster.c:871
#5 0x08109edc in main (argc=3, argv=0xbffff954) at main.c:215
#6 0x4025e7f7 in __libc_start_main () from /lib/i686/libc.so.6

postgres: robert tassiv 192.168.0.250 COPY
(gdb) backtrace
#0 0x403279d7 in semop () from /lib/i686/libc.so.6
#1 0x08133151 in PGSemaphoreLock (sema=0x0, interruptOK=1 '\001')
at pg_sema.c:419
#2 0x081517cc in LWLockAcquire (lockid=11097, mode=LW_EXCLUSIVE)
at lwlock.c:312
#3 0x081482fd in LockBuffer (buffer=1, mode=2) at bufmgr.c:1925
#4 0x0808439f in _bt_getbuf (rel=0x457767d8, blkno=4294967292, access=2)
at nbtpage.c:419
#5 0x080827e4 in _bt_split (rel=0x457767d8, buf=5538, firstright=330,
newitemoff=510, newitemsz=12, newitem=0x82fadb8, newitemonleft=0 '\0',
itup_off=0xbfffe2fe, itup_blkno=0xbfffe300) at nbtinsert.c:675
#6 0x08082202 in _bt_insertonpg (rel=0x457767d8, buf=5538, stack=0x8316fa0,
keysz=1, scankey=0x8316ff0, btitem=0x82fadb8, afteritem=0,
split_only_page=0 '\0') at nbtinsert.c:492
#7 0x08081ce8 in _bt_doinsert (rel=0x457767d8, btitem=0x82fadb8,
index_is_unique=0 '\0', heapRel=0x456fc898) at nbtinsert.c:141
#8 0x080857ea in btinsert (fcinfo=0xfffffffc) at nbtree.c:263
#9 0x081b77df in OidFunctionCall6 (functionId=331, arg1=4294967292,
arg2=4294967292, arg3=4294967292, arg4=4294967292, arg5=4294967292,
arg6=4294967292) at fmgr.c:1345
#10 0x08081190 in index_insert (indexRelation=0x457767d8, datums=0xbfffe5b0,
nulls=0xbfffe590 " \004", heap_t_ctid=0x840f494, heapRelation=0x456fc898,
check_uniqueness=0 '\0') at indexam.c:226
#11 0x080f75f4 in ExecInsertIndexTuples (slot=0x0, tupleid=0x840f494,
estate=0x834a730, is_vacuum=0 '\0') at execUtils.c:860
#12 0x080cb178 in CopyFrom (rel=0x456fc898, attnumlist=0x836dcf0,
binary=0 '\0', oids=0 '\0', delim=0x8238f5a "\t",
null_print=0x820ef58 "\\N") at copy.c:1598
#13 0x080c9d38 in DoCopy (stmt=0x1) at copy.c:830
#14 0x08158752 in PortalRunUtility (portal=0x82eae40, query=0x82e5e28,
dest=0x82e5cd8, completionTag=0xbfffe950 "") at pquery.c:757
#15 0x081589a8 in PortalRunMulti (portal=0x82eae40, dest=0x82e5cd8,
altdest=0x82e5cd8, completionTag=0xbfffe950 "") at pquery.c:821
#16 0x081582de in PortalRun (portal=0x82eae40, count=2147483647,
dest=0x82e5cd8, altdest=0x82e5cd8, completionTag=0xbfffe950 "")
at pquery.c:483
#17 0x081555ec in exec_simple_query (
query_string=0x82e5ac8 "COPY temp_obs_I FROM stdin")
at postgres.c:875
#18 0x08157602 in PostgresMain (argc=4, argv=0x8287678,
username=0x8287648 "robert") at postgres.c:2850
#19 0x081362e8 in BackendFork (port=0x82941c0) at postmaster.c:2536
#20 0x08135da3 in BackendStartup (port=0x82941c0) at postmaster.c:2184
#21 0x081346cd in ServerLoop () at postmaster.c:1108
#22 0x081340bd in PostmasterMain (argc=3, argv=0x82861f8) at postmaster.c:886
#23 0x08109edc in main (argc=3, argv=0xbffff954) at main.c:215
#24 0x4025e7f7 in __libc_start_main () from /lib/i686/libc.so.6

postgres: checkpoint subprocess
#0 0x403279d7 in semop () from /lib/i686/libc.so.6
#1 0x08133151 in PGSemaphoreLock (sema=0x0, interruptOK=1 '\001')
at pg_sema.c:419
#2 0x081517cc in LWLockAcquire (lockid=11097, mode=LW_SHARED) at lwlock.c:312
#3 0x08147454 in BufferSync () at bufmgr.c:786
#4 0x0814785a in FlushBufferPool () at bufmgr.c:990
#5 0x08095007 in CreateCheckPoint (shutdown=0 '\0', force=0 '\0')
at xlog.c:3282
#6 0x0809900a in BootstrapMain (argc=0, argv=0x16) at bootstrap.c:445
#7 0x08136a69 in SSDataBase (xlop=3) at postmaster.c:2842
#8 0x08134797 in ServerLoop () at postmaster.c:1039
#9 0x081340bd in PostmasterMain (argc=3, argv=0x82861f8) at postmaster.c:886
#10 0x08109edc in main (argc=3, argv=0xbffff954) at main.c:215
#11 0x4025e7f7 in __libc_start_main () from /lib/i686/libc.so.6

--
10:11:47 up 9 days, 2:57, 4 users, load average: 0.11, 0.41, 0.96

In response to

Responses

Browse pgsql-hackers by date

  From Date Subject
Next Message Alexey Mahotkin 2003-08-10 19:06:29 Proper Unicode support
Previous Message Andrew Dunstan 2003-08-10 15:46:09 Re: Another day, another SCO Compiler Error...