From owner-freebsd-questions Fri Aug 25 00:21:43 1995 Return-Path: questions-owner Received: (from majordom@localhost) by freefall.FreeBSD.org (8.6.11/8.6.6) id AAA05467 for questions-outgoing; Fri, 25 Aug 1995 00:21:43 -0700 Received: from rflab1.gtri.gatech.edu (rflab1.gtri.gatech.edu [130.207.199.83]) by freefall.FreeBSD.org (8.6.11/8.6.6) with ESMTP id AAA05458 for ; Fri, 25 Aug 1995 00:21:41 -0700 Received: (from ken@localhost) by rflab1.gtri.gatech.edu (8.6.12/8.6.12) id DAA23125 for questions@freebsd.org; Fri, 25 Aug 1995 03:21:21 -0400 From: "Kenneth D. Merry" Message-Id: <199508250721.DAA23125@rflab1.gtri.gatech.edu> Subject: Adaptec 2842, Quantum XP34300 problem To: questions@freebsd.org Date: Fri, 25 Aug 1995 03:21:20 -0400 (EDT) X-Mailer: ELM [version 2.4 PL23] Content-Type: text Content-Length: 5523 Sender: questions-owner@freebsd.org Precedence: bulk I've been having a few problems with my Adaptec 2842 and Quantum XP34300...perhaps someone out there might have some insight into this. First, a little history on the problem. This morning, the power blipped on and off about 6 or 7 times. (I wasn't home at the time) My surge protector is designed to just shut off the power if it drops too low, and apparantly that's what happened. Anyway, the point is, the machine crashed without syncing the disks. BTW, I was running 2.0.5R with the ahc1 driver from early July. (around July 5th or so, I upgraded it because of the bug fixes, performance gains, etc.) Anyway, when I cranked the machine back up after the crash, fsck couldn't get all the way through /usr without crapping out during Phase 2, the 'Check Pathnames' stage. I would just hear the drive spin down all of a sudden, and then spin back up, meanwhile, the ahc1 driver spits out sd0: timed out errors... /usr is a 3.6 gig partition (f), disklabel is included below. The output from the ahc1 driver initialization is included below as well. I tried rebooting several times, and running through the fsck again and again, and got the same result (drive spinning down during Phase 2 of the fsck, sd0 timeouts) every time. The only way I was able to get /usr to work at all was to mount -f it, unmount it, (so it would look like a clean dismount) and then mount it again. So, I decided to upgrade my scsi code and the Adaptec driver to -current. I grabbed the stuff from sys/i386/scsi, sys/dev/aic7xxx, sys/pci/aic7870.c, and all of sys/scsi. sys/i386/isa/aic7770.c had not changed since the last time I grabbed the driver. The current driver seemed to work fine, so I decided to have a go at fscking /usr, to see if the upgraded driver would make any difference as far as behavior. It made a difference, but not exactly what I wanted. When I fscked the partition, it failed in Phase 2, as before, and the drive spun down, but instead of the plain timeout messages, I got the following error messages (I had to write them down.../var is on the drive in question, so there was no logging or anything that could be done): ahc1: target 0, lun 0 (sd0) timed out sd0(ahc1:0:0): BUS DEVICE RESET message queued. ahc1: target 0, lun 0 (sd0) timed out ahc1: Issued Channel A Bus Reset #1. 1 SCBs aborted panic: ahc1: brkadrint, Illegal Host Access at seqaddr = 0x0 syncing disks... 1 1 1 1 1 1 1 1 1 1 1..and so on So, I'm not sure whether this is a hardware problem, or a software problem, or what. Does anyone have any ideas about this? The machine seems to be working fine now, but it kinda makes me nervous not to be able to get through a fsck of the system. I was able to do a full backup this afternoon (using tar), but not without several timeouts: Aug 24 14:01:03 wc206 /kernel: ahc1: target 0, lun 0 (sd0) timed out Aug 24 14:01:13 wc206 /kernel: Aug 24 14:01:13 wc206 /kernel: sd0(ahc1:0:0): ABORTED COMMAND asc:4e,0 Overlapped commands attempted Aug 24 14:01:13 wc206 /kernel: , retries:4 Aug 24 14:01:14 wc206 /kernel: ahc1: target 0, lun 0 (sd0) timed out Aug 24 14:01:14 wc206 /kernel: Aug 24 14:39:52 wc206 /kernel: ahc1: target 0, lun 0 (sd0) timed out Aug 24 14:40:00 wc206 /kernel: Aug 24 14:40:00 wc206 /kernel: ahc1: target 0, lun 0 (sd0) timed out Aug 24 14:40:00 wc206 /kernel: Aug 24 14:40:00 wc206 /kernel: sd0(ahc1:0:0): ABORTED COMMAND asc:4e,0 Overlapped commands attempted Aug 24 14:40:00 wc206 /kernel: , retries:4 I wasn't running the -current ahc1 driver when I did the backup, so the error messages aren't as verbose as they might be. So, if anyone has any idea how I might at the very least diagnose what kind of problem my drive has, and hopefully be able to fsck the entire thing, I'd be very grateful. Thanks, Ken -- Kenneth Merry ken@rflab1.gtri.gatech.edu Disclaimer: I don't speak for GTRI, GT, or Elvis. ****** ahc1: reading board settings ahc1: 284x Single Channel, SCSI Id=7, aic7770 >= Rev E, 4 SCBs ahc1: Using Edge Triggered Interrupts ahc1: Downloading Sequencer Program...Done ahc1 at 0x1000-0x10ff irq 11 on eisa slot 1 ahc1: Probing channel A ahc1 waiting for scsi devices to settle ahc1: target 0 synchronous at 10.0MB/s, offset = 0xf ahc1: target 0 Tagged Queuing Device (ahc1:0:0): "Quantum XP34300 55B1" type 0 fixed SCSI 2 sd0(ahc1:0:0): Direct-Access 4101MB (8399520 512 byte sectors) ahc1: target 1 synchronous at 5.0MB/s, offset = 0xf (ahc1:1:0): "NEC CD-ROM DRIVE:500 1.0" type 5 removable SCSI 2 cd0(ahc1:1:0): CD-ROM cd0(ahc1:1:0): NOT READY asc:3a,0 Medium not present can't get the size ahc1: target 4 synchronous at 5.0MB/s, offset = 0xf (ahc1:4:0): "SONY SDT-5000 3.02" type 1 removable SCSI 2 st0(ahc1:4:0): Sequential-Access density code 0x13, drive empty ****** ****** # /dev/rsd0c: type: SCSI disk: sd0s1 label: flags: bytes/sector: 512 sectors/track: 32 tracks/cylinder: 64 sectors/cylinder: 2048 cylinders: 4100 sectors/unit: 8398816 rpm: 0 interleave: 0 trackskew: 0 cylinderskew: 0 headswitch: 0 # milliseconds track-to-track seek: 0 # milliseconds drivedata: 0 8 partitions: # size offset fstype [fsize bsize bps/cpg] b: 409600 0 swap # (Cyl. 0 - 199) c: 8398816 0 unused 0 0 # (Cyl. 0 - 4100*) e: 409600 409600 4.2BSD 0 0 0 # (Cyl. 200 - 399) f: 7579616 819200 4.2BSD 0 0 0 # (Cyl. 400 - 4100*) Warning, revolutions/minute 0 super block size 0 ******