From owner-freebsd-bugs Wed Feb 5 11:10:06 1997 Return-Path: Received: (from root@localhost) by freefall.freebsd.org (8.8.5/8.8.5) id LAA02647 for bugs-outgoing; Wed, 5 Feb 1997 11:10:06 -0800 (PST) Received: from mail1.sirius.com (mail1.sirius.com [205.134.253.131]) by freefall.freebsd.org (8.8.5/8.8.5) with SMTP id LAA02614 for ; Wed, 5 Feb 1997 11:09:56 -0800 (PST) Received: from ds9.sirius.com (ds9.sirius.com [205.134.226.34]) by mail1.sirius.com (8.6.12/960710) with ESMTP id LAA16478 for ; Wed, 5 Feb 1997 11:08:20 -0800 Received: from localhost (dlowe@localhost) by ds9.sirius.com (8.6.12/961127) with SMTP id LAA04554 for ; Wed, 5 Feb 1997 11:09:58 -0800 X-Authentication-Warning: ds9.sirius.com: dlowe owned process doing -bs Date: Wed, 5 Feb 1997 11:09:57 -0800 (PST) From: David Lowe X-Sender: dlowe@ds9 To: freebsd-bugs@freefall.freebsd.org Subject: disk crash (AHA-2940AU) Message-ID: MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII Sender: owner-bugs@FreeBSD.ORG X-Loop: FreeBSD.org Precedence: bulk Bug-people - Just had a nasty crash. I'm running FreeBSD 2.2-BETA, on a Pentium 133 with 64MB RAM. It has an adaptec AHA-2940AU SCSI controller, and a single 1GB disk. After weeks of running without a hitch, this morning (running with about 7 or 8 console logins, 4 running emacs, one running pine, others idle) something hiccupped and the machine fell over. The error that appears to have started the problem was "/kernel: ahc0: Someone reset channel A". After that every attempt to access the disk met with errors (file system full, Input/output error, out of inodes, bad file descriptor...) When it became obvious (to me, at least) the machine was hosed, I copied out the syslog messages to the console, then ran df and iostat, copying those results as well. iostat caused a panic. After rebooting, things look fine, but I'm a bit nervous. I seem to recall that the drivers for the 2940AU are not fully tested/supported? Is this just a hardware hiccup? Is there something that can be done to recover from such a crash? Thanks much, David Lowe syslog (logged to console): --- /kernel: ahc0: Someone reset channel A /kernel: pid 933 (emacs), uid 0 on /: file system full sysolgd: /var/log/messages: Input/output error sendmail[1991]: HAA00231: SYSERR(root): Can't create transcript file xfHAA00231: Input/output error /kernel: pid 1991 (sendmail), uid 0 on /var: out of inodes sendmail[1991]: 0: fl=0x0, mode=20666: CHR: dev=4/0, ino=258, nlink=1, u/gid=0/0, size=0 sendmail[1991]: 1: fl=0x1, mode=20666: CHR: dev=4/0, ino=258, nlink=1, u/gid=0/0, size=0 sendmail[1991]: 2: fl=0x1, mode=20666: CHR: dev=4/0, ino=258, nlink=1, u/gid=0/0, size=0 sendmail[1991]: 3: fl=0x0, mode=140000: SOCK [0]->[[UNIX: /var/run/log]] sendmail[1991]: 4: fl=0x0, mode=100644: dev=4/0, ino=671, nlink=1, u/gid=0/0, size=32768 sendmail[1991]: 5: fl=0x0, mode=100644: dev=4/131077, ino=116804, nlink=1, u/gid=0/0, size=32768 sendmail[1991]: 6: fl=0x4002, mode=100600: dev=4/131076, ino=198, nlink=1, u/gid=0/0, size=551 sendmail[1991]: 7: fl=0x1, mode=20666: CHR: dev=4/0, ino=258, nlink=1, u/gid=0/0, size=0 sendmail[1991]: 8: fl=0x0, mode=100600: dev=4/131076, ino=195, nlink=1, u/gid=0/0, size=0 sendmail[1991]: HAA00231: SYSERR(root): queueup: cannot create queue temp file tfHAA00231, uid=0: Bad file descriptor sendmail[1991]: MCI@0: NULL /kernel: pid 153 (bash), uid 1002 on /usr: out of inodes df: --- Filesystem 512-blocks Used Avail Capacity Mounted on /dev/sd0a 63550 28280 30186 48% / /dev/sd0s1f 1630646 685004 815192 46% /usr /dev/sd0s1e 59454 2042 52656 4% /var procfs 8 8 0 100% /proc iostat: --- spec_getpages: I/O read error vm_fault: pager input (probably hardware) error, PID 2028 failure Fatal trap 12: oage fault while in kernel mode fault virtual address = 0x53b80 fault code = supervisor read, page not present instruction pointer = 0x8:0xf0100767 stack pointer = 0x10:0xefbffde4 frame pointer = 0x10:0xefbffe10 code segment = base 0x0, limit 0xfffff, type 0x1b = DPL 0, pres 1, def32 1, gran 1 processor eflags = interrupt enabled, resume, IOPL = 0 current process = 2028 (bash) interrupt mask = panic: page fault syncing disks... 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 giving up dumping to dev 401, offset 151664 dump 64 63 62 61 (at this point I rebooted, as it appeared to have frozen) uname -a --- FreeBSD list.sirius.com 2.2-BETA_A FreeBSD 2.2-BETA_A #0: Thu Jan 23 18:53:39 PST 1997 root@news3.sirius.com:/usr/src/sys/compile/FRUMPY i386 FRUMPY (kernel config file): --- # FRUMPY -- config file for sirius.com FreeBSD-2.2-BETA machines. machine "i386" cpu "I586_CPU" ident FRUMPY config kernel root on sd0 dumps on sd0 maxusers 64 options "MAXMEM=(64*1024)" options CHILD_MAX=256 options OPEN_MAX=1024 options "NMBCLUSTERS=2048" options GPL_MATH_EMULATE options "COMPAT_43" options SYSVSHM options "SHMMAXPGS=4096" # increase the size of the shared pages options SYSVSEM options SYSVMSG options UCONSOLE options MAXCONS=14 options USERCONFIG options VISUAL_USERCONFIG options INET options "TCP_COMPAT_42" options FFS options NFS options PROCFS options QUOTA options BOUNCE_BUFFERS pseudo-device ether #Generic Ethernet pseudo-device loop #Network loopback device pseudo-device bpfilter 4 #Berkeley packet filter pseudo-device pty 32 pseudo-device log pseudo-device gzip controller scbus0 #base SCSI code device ch0 #SCSI media changers device sd0 #SCSI disks device st0 #SCSI tapes device cd0 #SCSI CD-ROMs device od0 #SCSI optical disk controller isa0 device lpt0 at isa? port? tty irq 7 vector lptintr device sio0 at isa? port "IO_COM1" tty irq 4 vector siointr device sc0 at isa? port "IO_KBD" tty irq 1 vector scintr device npx0 at isa? port "IO_NPX" iosiz 0x0 flags 0x0 irq 13 vector npxintr controller fdc0 at isa? port "IO_FD1" bio irq 6 drq 2 vector fdintr disk fd0 at fdc0 drive 0 controller pci0 controller ahc1 device de0 ---