From owner-freebsd-scsi Sun Nov 24 11:30:45 2002 Delivered-To: freebsd-scsi@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 2F4F037B401; Sun, 24 Nov 2002 11:30:41 -0800 (PST) Received: from sabre-wulf.nvg.ntnu.no (sabre-wulf.nvg.ntnu.no [129.241.210.67]) by mx1.FreeBSD.org (Postfix) with ESMTP id 2DF0643E3B; Sun, 24 Nov 2002 11:30:40 -0800 (PST) (envelope-from taliesin@nvg.ntnu.no) Received: from tyrell.nvg.ntnu.no ([IPv6:::ffff:129.241.210.70]:39325 "EHLO tyrell.nvg.ntnu.no" ident: "[2OU0UI96r+5tiZjPzTLesBLZcFtn6hQV]" whoson: "-unregistered-") by sabre-wulf.nvg.ntnu.no with ESMTP id ; Sun, 24 Nov 2002 20:30:32 +0100 Received: (from taliesin@localhost) by tyrell.nvg.ntnu.no (8.11.6/8.8.4) id gAOJUV719881; Sun, 24 Nov 2002 20:30:31 +0100 Date: Sun, 24 Nov 2002 20:30:31 +0100 From: taliesin@nvg.ntnu.no To: freebsd-scsi@freebsd.org, freebsd-stable@freebsd.org Subject: SCSI parity error; software- or hardware-problem? Message-ID: <20021124193031.GA18148@nvg.ntnu.no> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline User-Agent: Mutt/1.4i Sender: owner-freebsd-scsi@FreeBSD.ORG Precedence: bulk List-ID: List-Archive: (Web Archive) List-Help: (List Instructions) List-Subscribe: List-Unsubscribe: X-Loop: FreeBSD.org Last thursday I started getting error-messages like these whenever I cvsupped or compiled something or du-ed: (Made the world from 4.6.2 to 4.7-stable as of the 6th of November (see dmesg at the bottom of the mail). (da0:ahc0:0:0:0): WRITE(10). CDB: 2a 0 0 8a de b2 0 0 10 0 (da0:ahc0:0:0:0): ABORTED COMMAND asc:47,0 (da0:ahc0:0:0:0): SCSI parity error (da0:ahc0:0:0:0): WRITE(10). CDB: 2a 0 0 74 de a2 0 0 10 0 (da0:ahc0:0:0:0): ABORTED COMMAND asc:47,0 (da0:ahc0:0:0:0): SCSI parity error (da1:ahc0:0:1:0): WRITE(10). CDB: 2a 0 0 24 0 40 0 0 10 0 (da1:ahc0:0:1:0): ABORTED COMMAND asc:47,0 (da1:ahc0:0:1:0): SCSI parity error (da1:ahc0:0:1:0): WRITE(10). CDB: 2a 0 0 dc 0 40 0 0 10 0 (da1:ahc0:0:1:0): ABORTED COMMAND asc:47,0 (da1:ahc0:0:1:0): SCSI parity error etc., there's about 20 times as many for da0 as for da1. # camcontrol devlist at scbus0 target 0 lun 0 (pass0,da0) at scbus0 target 1 lun 0 (pass1,da1) at scbus0 target 4 lun 0 (pass2,cd0) (The cd-burner is on its own cable of course, spends its time making backups.) This machine runs my desktop and compiles kernels, worlds etc for itself and the other freebsd-machines I have, which have less memory and slower disks. According to camcontrol and the verifier in bios there are no new defects since the disks left the factory. No hardware has been changed since sometime in early September so the cabling haven't been touched. Prior to the makeworld the tags were at their default 253, lowering them didn't improve matters. Then these turned up today: (da0:ahc0:0:0:0): parity error detected in Data-in phase. SEQADDR(0x56) SCSIRATE(0x93) (da0:ahc0:0:0:0): parity error detected in Data-in phase. SEQADDR(0x8a) SCSIRATE(0x93) (da0:ahc0:0:0:0): parity error detected in Data-in phase. SEQADDR(0x8a) SCSIRATE(0x93) (da0:ahc0:0:0:0): READ(10). CDB: 28 0 0 8f df 22 0 0 70 0 (da0:ahc0:0:0:0): ABORTED COMMAND asc:48,0 (da0:ahc0:0:0:0): Initiator detected error message received (da1:ahc0:0:1:0): parity error detected in Data-in phase. SEQADDR(0x8b) SCSIRATE(0x93) etc. The system is currently not particularly usable as things are coredumping east and west; it can no longer tackle a make world. If it is *only* a hardware-problem it is easy though expensive to fix; but what part of the system is the problem? Cable, disks or controller- card? In that case, what controller-card/disks would you recommend? If it's something that was changed for 4.7, does it also exist in 5.x and what other information do you need to have a look at it? The motherboard is an Asus A7V and it's all been rock stable until now. Here's the dmesg: Copyright (c) 1992-2002 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 4.7-STABLE #0: Wed Nov 6 23:43:00 CET 2002 toor@shrdlu:/mnt/obj/usr/src/sys/SHRDLU Timecounter "i8254" frequency 1193182 Hz CPU: AMD Athlon(tm) Processor (807.19-MHz 686-class CPU) Origin = "AuthenticAMD" Id = 0x642 Stepping = 2 Features=0x183f9ff AMD Features=0xc0440000<,AMIE,DSP,3DNow!> real memory = 268353536 (262064K bytes) config> q avail memory = 256901120 (250880K bytes) Preloaded elf kernel "kernel" at 0xc0422000. Preloaded userconfig_script "/boot/kernel.conf" at 0xc042209c. Preloaded elf module "vesa.ko" at 0xc04220ec. VESA: v3.0, 32768k memory, flags:0x1, mode table:0xc041f282 (1000022) VESA: NVidia Pentium Pro MTRR support enabled md0: Malloc disk Using $PIR table, 9 entries at 0xc00f1720 npx0: on motherboard npx0: INT 16 interface pcib0: on motherboard pci0: on pcib0 pcib1: at device 1.0 on pci0 pci1: on pcib1 pci1: at 0.0 irq 11 isab0: at device 4.0 on pci0 isa0: on isab0 atapci0: at device 4.1 on pci0 atapci0: ATA channel disabled by BIOS uhci0: port 0xd400-0xd41f irq 14 at device 4.2 on pci0 usb0: on uhci0 usb0: USB revision 1.0 uhub0: VIA UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub0: 2 ports with 2 removable, self powered uhci1: port 0xd000-0xd01f irq 14 at device 4.3 on pci0 usb1: on uhci1 usb1: USB revision 1.0 uhub1: VIA UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub1: 2 ports with 2 removable, self powered uhub2: ALCOR Generic USB Hub, class 9/0, rev 1.10/1.00, addr 2 uhub2: 4 ports with 4 removable, self powered viapropm0: SMBus I/O base at 0xe800 viapropm0: port 0xe800-0xe80f at device 4.4 on pci0 viapropm0: SMBus revision code 0x0 smb0: on smbus0 pcm0: port 0xa400-0xa43f irq 15 at device 10.0 on pci0 dc0: <82c169 PNIC 10/100BaseTX> port 0xa000-0xa0ff mem 0xd5800000-0xd58000ff irq 10 at device 11.0 on pci0 dc0: Ethernet address: 00:c0:f0:4e:07:87 miibus0: on dc0 ukphy0: on miibus0 ukphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto ahc0: port 0x9800-0x98ff mem 0xd5000000-0xd5000fff irq 14 at device 13.0 on pci0 aic7890/91: Ultra2 Wide Channel A, SCSI Id=7, 32/253 SCBs atapci1: port 0x8000-0x803f,0x8400-0x8403,0x8800-0x8807,0x9000-0x9003,0x9400-0x9407 mem 0xd4800000-0xd481ffff irq 10 at device 17.0 on pci0 ata2: at 0x9400 on atapci1 ata3: at 0x8800 on atapci1 orm0: