MOO-cows Mailing List Archive

[Prev][Next][Index][Thread]

crash: checkpointing while file server burps




This isn't a bug--it's more of an un-robust response to a network problem.

MediaMOO was in the middle of checkpointing when NSF lost contact with our
file server for a few minutes.  This caused a panic.  No panic dump was made.
We are running LambdaMOO 1.8.0p5 (unmodified) on a Sun Sparcstation/ipc
running SunOs 4.1.4.  Here's the output from the core file:

microworld:~MediaMOO/db% dbx moo core
Reading symbolic information...
Read 25232 symbols
warning: core file read error: address not in data space
program terminated by signal ABRT (abort)
(dbx) where
warning: core file read error: address not in data space
kill() at 0xf775f040
abort_server(), line 196 in "code/MOO-1.8.0p5/server.c"
panic(), line 207 in "code/MOO-1.8.0p5/server.c"
panic_signal(), line 254 in "code/MOO-1.8.0p5/server.c"
_sigtramp() at 0xf7732c4c
enqueue_output(), line 462 in "code/MOO-1.8.0p5/net_multi.c"
network_send_line(), line 539 in "code/MOO-1.8.0p5/net_multi.c"
send_shutdown_message(), line 175 in "code/MOO-1.8.0p5/server.c"
panic(), line 220 in "code/MOO-1.8.0p5/server.c"
panic_signal(), line 254 in "code/MOO-1.8.0p5/server.c"
_sigtramp() at 0xf7732c4c
libc_xstr() at 0xf77e1c20
libc_xstr() at 0xf77e007c
etext() at 0x4e07c


The partial checkpoint has a time of 11:12.  This is from /var/adm/messages:

Jul 30 11:07:19 microworld vmunix: NFS write failed for server mc: RPC: Timed o
Jul 30 11:07:19 microworld vmunix: NFS write error 60 on host mc fh ab09a64a a00000 c000000 51ff0100 12800000 c000000 2000000 1800000 
Jul 30 11:07:19 microworld vmunix: NFS write failed for server mc: RPC: Timed o
Jul 30 11:07:19 microworld vmunix: NFS write error 60 on host mc fh ab09a64a a00000 c000000 51ff0100 12800000 c000000 2000000 1800000 
Jul 30 11:07:19 microworld vmunix: NFS write failed for server mc: RPC: Timed o-t
Jul 30 11:07:19 microworld vmunix: NFS write error 60 on host mc fh ab09a64a a00000 c000000 51ff0100 12800000 c000000 2000000 1800000 
Jul 30 11:07:19 microworld vmunix: NFS write failed for server mc: RPC: Timed o
Jul 30 11:07:19 microworld vmunix: NFS write error 60 on host mc fh ab09a64a a00000 c000000 51ff0100 12800000 c000000 2000000 1800000 
Jul 30 11:07:19 microworld vmunix: NFS write failed for server mc: RPC: Timed o
Jul 30 11:07:19 microworld vmunix: NFS write error 60 on host mc fh ab09a64a a00000 c000000 51ff0100 12800000 c000000 2000000 1800000 
Jul 30 11:07:39 microworld vmunix: NFS getattr failed for server mc: RPC: Timedout
Jul 30 11:08:01 microworld vmunix: NFS getattr failed for server mc: RPC: Timedout
Jul 30 11:08:12 microworld vmunix: NFS write failed for server mc: RPC: Timed o
Jul 30 11:08:12 microworld vmunix: NFS write error 60 on host mc fh ab09a64a a00000 c000000 dbb80000 20800000 c000000 2000000 1800000 
Jul 30 11:08:22 microworld vmunix: NFS getattr failed for server mc: RPC: Timedout
Jul 30 11:08:43 microworld vmunix: NFS read failed for server mc: RPC: Timed ouJul 30 11:09:04 microworld vmunix: NFS getattr failed for server mc: RPC: Timed out
Jul 30 11:09:04 microworld vmunix: NFS getattr failed for server mc: RPC: Timedout
Jul 30 11:09:24 microworld vmunix: NFS read failed for server mc: RPC: Timed ouJul 30 11:09:25 microworld vmunix: NFS getattr failed for server mc: RPC: Timed out
Jul 30 11:10:06 microworld last message repeated 3 times
Jul 30 11:10:06 microworld vmunix: NFS create failed for server mc: RPC: Timed ut
Jul 30 11:10:27 microworld vmunix: NFS create failed for server mc: RPC: Timed ut
Jul 30 11:10:27 microworld vmunix: NFS getattr failed for server mc: RPC: Timedout
Jul 30 11:10:47 microworld vmunix: NFS getattr failed for server mc: RPC: Timedout
Jul 30 11:11:08 microworld vmunix: NFS read failed for server mc: RPC: Timed ouJul 30 11:11:28 microworld vmunix: NFS getattr failed for server mc: RPC: Timed out

-- Amy


Follow-Ups:

Home | Subject Index | Thread Index