From owner-freebsd-current@FreeBSD.ORG Fri Oct 17 08:06:44 2003 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 59D3316A4BF for ; Fri, 17 Oct 2003 08:06:44 -0700 (PDT) Received: from web9601.mail.yahoo.com (web9601.mail.yahoo.com [216.136.129.180]) by mx1.FreeBSD.org (Postfix) with SMTP id 99AF043FBF for ; Fri, 17 Oct 2003 08:06:42 -0700 (PDT) (envelope-from chancedj@yahoo.com) Message-ID: <20031017150642.20140.qmail@web9601.mail.yahoo.com> Received: from [65.107.244.186] by web9601.mail.yahoo.com via HTTP; Fri, 17 Oct 2003 08:06:42 PDT Date: Fri, 17 Oct 2003 08:06:42 -0700 (PDT) From: Daryl Chance To: current@freebsd.org MIME-Version: 1.0 Content-Type: multipart/mixed; boundary="0-94019671-1066403202=:20122" Subject: vinum, ,GEOM, ATANG, or bad disk? X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list Reply-To: chancedj@yahoo.com List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 17 Oct 2003 15:06:44 -0000 --0-94019671-1066403202=:20122 Content-Type: text/plain; charset=us-ascii Content-Id: Content-Disposition: inline Hi, I'm trying to setup vinum on 2 identical 30G HD's. The MB I'm using has a built in Raid 0 controller, so i have 6 ATA Slots. My Main HD is hooked up to the primary non-raid. and the other 2 are hooked up to primary 1 and primary 2 on the raid. I can get vinum up and running, but when i force a disk failure to see if it can recover, I start to get errors. Here are the steps i'm going through: copy /usr/src to the vinum vol stop mp3s.p1 (2 plex's - p0, p1) delete the contents start mp3s.p1 It starts to recover, then I start getting the following errors: Oct 16 23:00:34 mp3 kernel: ad6: TIMEOUT - WRITE_DMA retrying (2 retries left) Oct 16 23:00:34 mp3 kernel: ata3: resetting devices .. Oct 16 23:00:34 mp3 kernel: done Oct 16 23:00:44 mp3 kernel: ad6: TIMEOUT - WRITE_DMA retrying (1 retry left) Oct 16 23:00:44 mp3 kernel: ata3: resetting devices .. Oct 16 23:00:44 mp3 kernel: done Oct 16 23:00:54 mp3 kernel: ata3: resetting devices .. Oct 16 23:00:54 mp3 kernel: done Oct 16 23:00:54 mp3 kernel: ad6: FAILURE - WRITE_DMA timed out Oct 16 23:00:54 mp3 kernel: ad6: FAILURE - WRITE_DMA status=1 error=0 Oct 16 23:00:54 mp3 vinum[517]: can't revive mp3s.p1.s0: Input/output error Oct 16 23:04:18 mp3 kernel: vinum: mp3s.p1.s0 is down by force Oct 16 23:04:18 mp3 kernel: vinum: mp3s.p1 is faulty Oct 16 23:04:28 mp3 kernel: ad6: TIMEOUT - WRITE_DMA retrying (2 retries left) Oct 16 23:04:28 mp3 kernel: ata3: resetting devices .. Oct 16 23:04:28 mp3 kernel: done Oct 16 23:04:38 mp3 kernel: ad6: TIMEOUT - WRITE_DMA retrying (1 retry left) Oct 16 23:04:38 mp3 kernel: ata3: resetting devices .. Oct 16 23:04:38 mp3 kernel: done Oct 16 23:04:48 mp3 kernel: ata3: resetting devices .. Oct 16 23:04:48 mp3 kernel: done Oct 16 23:04:48 mp3 kernel: ad6: FAILURE - WRITE_DMA timed out Oct 16 23:04:48 mp3 kernel: ad6: FAILURE - WRITE_DMA status=1 error=0 Oct 16 23:04:48 mp3 kernel: vinum: Can't write config to /dev/ad6s1d, error 5 Oct 16 23:04:48 mp3 kernel: vinum: drive drive2 is down ad6 is the volume trying to get restored. I'm going to re-setup the volume once I get done zeroing the drives out (incase that could help) and see if I can still reproduce the problem. I believe (one of the things I'll document when i get it set back up), when i type [vinum] info, it identifies the ad4 slice correctly, but comes back as unknown for the ad6 slice (which should be /dev/ad6s1d). FreeBSD 5.1-CURRENT #0: Tue Oct 14 00:33:48 EDT 2003 root@mp3.earthlink.net:/usr/obj/usr/src/sys/MP3 I've attached my dmesg. vinum.cfg is below (where i run create -f ) drive drive1 device /dev/ad4s1d drive drive2 device /dev/ad6s1d volume mp3s setupstate plex org concat sd length 0 drive drive1 plex org concat sd length 0 drive drive2 __________________________________ Do you Yahoo!? The New Yahoo! Shopping - with improved product search http://shopping.yahoo.com --0-94019671-1066403202=:20122 Content-Type: text/plain; name="dmesg.txt" Content-Description: dmesg.txt Content-Disposition: inline; filename="dmesg.txt" Copyright (c) 1992-2003 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 5.1-CURRENT #0: Tue Oct 14 00:33:48 EDT 2003 root@mp3.earthlink.net:/usr/obj/usr/src/sys/MP3 Preloaded elf kernel "/boot/kernel/kernel" at 0xc0a0b000. Preloaded elf module "/boot/kernel/acpi.ko" at 0xc0a0b21c. Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: AMD Athlon(tm) Processor (1311.69-MHz 686-class CPU) Origin = "AuthenticAMD" Id = 0x644 Stepping = 4 Features=0x183f9ff AMD Features=0xc0440000 real memory = 268353536 (255 MB) avail memory = 251019264 (239 MB) Pentium Pro MTRR support enabled npx0: [FAST] npx0: on motherboard npx0: INT 16 interface acpi0: on motherboard pcibios: BIOS version 2.10 Using $PIR table, 9 entries at 0xc00f1750 acpi0: Power Button (fixed) Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000 acpi_timer0: <24-bit timer at 3.579545MHz> port 0xe408-0xe40b on acpi0 acpi_cpu0: on acpi0 acpi_button0: on acpi0 pcib0: port 0xcf8-0xcff on acpi0 pci0: on pcib0 pcib0: slot 4 INTD is routed to irq 12 pcib0: slot 4 INTD is routed to irq 12 pcib0: slot 9 INTA is routed to irq 12 pcib0: slot 12 INTA is routed to irq 11 pcib0: slot 17 INTA is routed to irq 10 agp0: mem 0xe4000000-0xe7ffffff at device 0.0 on pci0 pcib1: at device 1.0 on pci0 pci1: on pcib1 isab0: at device 4.0 on pci0 isa0: on isab0 atapci0: port 0xd800-0xd80f at device 4.1 on pci0 ata0: at 0x1f0 irq 14 on atapci0 ata0: [MPSAFE] ata1: at 0x170 irq 15 on atapci0 ata1: [MPSAFE] uhci0: port 0xd400-0xd41f irq 12 at device 4.2 on pci0 usb0: on uhci0 usb0: USB revision 1.0 uhub0: VIA UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub0: 2 ports with 2 removable, self powered uhci1: port 0xd000-0xd01f irq 12 at device 4.3 on pci0 usb1: on uhci1 usb1: USB revision 1.0 uhub1: VIA UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub1: 2 ports with 2 removable, self powered pci0: at device 4.4 (no driver attached) dc0: <82c169 PNIC 10/100BaseTX> port 0xa400-0xa4ff mem 0xe3000000-0xe30000ff irq 12 at device 9.0 on pci0 dc0: Ethernet address: 00:a0:cc:53:6e:57 miibus0: on dc0 bmtphy0: on miibus0 bmtphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto pci0: at device 12.0 (no driver attached) atapci1: port 0x8800-0x883f,0x9000-0x9003,0x9400-0x9407,0x9800-0x9803,0xa000-0xa007 mem 0xdb800000-0xdb81ffff irq 10 at device 17.0 on pci0 atapci1: [MPSAFE] ata2: at 0xa000 on atapci1 ata2: [MPSAFE] ata3: at 0x9400 on atapci1 ata3: [MPSAFE] fdc0: port 0x3f7,0x3f2-0x3f5 irq 6 drq 2 on acpi0 fdc0: FIFO enabled, 8 bytes threshold fd0: <1440-KB 3.5" drive> on fdc0 drive 0 atkbdc0: port 0x64,0x60 irq 1 on acpi0 atkbd0: flags 0x1 irq 1 on atkbdc0 kbd0 at atkbd0 orm0: