From owner-freebsd-questions Sat Oct 6 12:32:29 2001 Delivered-To: freebsd-questions@freebsd.org Received: from mailout6.nyroc.rr.com (mailout6-1.nyroc.rr.com [24.92.226.177]) by hub.freebsd.org (Postfix) with ESMTP id E76D737B401 for ; Sat, 6 Oct 2001 12:32:18 -0700 (PDT) Received: from sonic.rit.edu (roc-24-93-27-75.rochester.rr.com [24.93.27.75]) by mailout6.nyroc.rr.com (8.11.6/Road Runner 1.12) with ESMTP id f96JV4H15038 for ; Sat, 6 Oct 2001 15:31:07 -0400 (EDT) Message-Id: <5.1.0.14.0.20011006144741.03ab0d70@vmspop.rit.edu> X-Sender: mdp1261@osfpop.rit.edu X-Mailer: QUALCOMM Windows Eudora Version 5.1 Date: Sat, 06 Oct 2001 15:30:13 -0400 To: questions@freebsd.org From: Matt Penna Subject: Dump problem: MEDIUM ERROR Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii"; format=flowed Sender: owner-freebsd-questions@FreeBSD.ORG Precedence: bulk List-ID: List-Archive: (Web Archive) List-Help: (List Instructions) List-Subscribe: List-Unsubscribe: X-Loop: FreeBSD.ORG Dear folks, After running FreeBSD for a couple of years in what was basically a sandbox, I recently put several machines into production and am now enjoying all of the privileges and problems that go along with such a move! I've checked the archives but haven't found anything that sounds quite like this. I am attempting to get backups working on a fileserver running Samba on 4.3-Release with a custom kernel. This is a dual-PPro machine using vinum to stripe data over 7 physical disks. Full dmesg is at the end of this message. When I try to do a dump, it seems to run fine for a short period and then fails. I initially thought it may have been dump choking on the striped vinum volume for some reason, but it also fails on /, which is not part of the concatenated plex and resides entirely on a single disk. I have tried various command line arguments in an attempt to find something that works, but haven't had any luck. I have tried: dump -0af /dev/sa0 /mnt/mainvol dump -0a /dev/da0s1a dump -0a -f - /dev/da0s1a | gzip -9v > /dev/sa0 and probably a few other variations over the past few days, all with similar results. This is what I see on the terminal when trying 'dump -0a /dev/vinum/mainvol' -- DUMP: Date of this level 0 dump: Sat Oct 6 12:36:33 2001 DUMP: Date of last level 0 dump: the epoch DUMP: Dumping /dev/vinum/mainvol (/mnt/mainvol) to /dev/rsa0 DUMP: mapping (Pass I) [regular files] DUMP: mapping (Pass II) [directories] DUMP: estimated 27032919 tape blocks. DUMP: dumping (Pass III) [directories] DUMP: dumping (Pass IV) [regular files] DUMP: write error 90750 blocks into volume 1 DUMP: Do you want to restart?: ("yes" or "no") Oct 6 12:42:16 KingMax /kernel: (sa0:ahc0:0:2:0): WRITE(06). CDB: a 0 0 28 0 0 Oct 6 12:42:16 KingMax /kernel: (sa0:ahc0:0:2:0): WRITE(06). CDB: a 0 0 28 0 0 Oct 6 12:42:16 KingMax /kernel: (sa0:ahc0:0:2:0): MEDIUM ERROR info:17bb96 asc:c,0 Oct 6 12:42:16 KingMax /kernel: (sa0:ahc0:0:2:0): MEDIUM ERROR info:17bb96 asc:c,0 Oct 6 12:42:16 KingMax /kernel: (sa0:ahc0:0:2:0): Write error Oct 6 12:42:16 KingMax /kernel: (sa0:ahc0:0:2:0): Write error no DUMP: The ENTIRE dump is aborted. Oct 6 12:54:17 KingMax /kernel: (sa0:ahc0:0:2:0): WRITE FILEMARKS. CDB: 10 0 0 0 2 0 Oct 6 12:54:17 KingMax /kernel: (sa0:ahc0:0:2:0): WRITE FILEMARKS. CDB: 10 0 0 0 2 0 Oct 6 12:54:17 KingMax /kernel: (sa0:ahc0:0:2:0): MEDIUM ERROR info:8e asc:c,0 Oct 6 12:54:17 KingMax /kernel: (sa0:ahc0:0:2:0): MEDIUM ERROR info:8e asc:c,0 Oct 6 12:54:17 KingMax /kernel: (sa0:ahc0:0:2:0): Write error Oct 6 12:54:17 KingMax /kernel: (sa0:ahc0:0:2:0): Write error Oct 6 12:54:17 KingMax /kernel: (sa0:ahc0:0:2:0): failed to write terminating filemark(s) Oct 6 12:54:17 KingMax /kernel: (sa0:ahc0:0:2:0): failed to write terminating filemark(s) -- When I attempted to pipe it through gzip, it ran for a time and then terminated with a broken pipe, followed by more errors like the above. I have tried 3 different tapes with similar results. Am I looking at some kind of problem with the tape drive? Have I failed to do something to prepare the tapes for use by dump? Is it an Idiot Operator error? (The most likely cause!) Any advice would be welcome. Thank you! Matt -- Full dmesg: FreeBSD 4.3-RELEASE-p12 #1: Sun Aug 19 15:05:09 EDT 2001 Timecounter "i8254" frequency 1193182 Hz CPU: Pentium Pro (199.43-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0x617 Stepping = 7 Features=0xfbff real memory = 67108864 (65536K bytes) avail memory = 62373888 (60912K bytes) Programming 24 pins in IOAPIC #0 IOAPIC #0 intpin 2 -> irq 0 FreeBSD/SMP: Multiprocessor motherboard cpu0 (BSP): apic id: 0, version: 0x00040011, at 0xfee00000 cpu1 (AP): apic id: 1, version: 0x00040011, at 0xfee00000 io0 (APIC): apic id: 2, version: 0x00170011, at 0xfec00000 Preloaded elf kernel "kernel" at 0xc02ef000. Preloaded userconfig_script "/boot/kernel.conf" at 0xc02ef09c. Pentium Pro MTRR support enabled md0: Malloc disk npx0: on motherboard npx0: INT 16 interface pcib0: on motherboard IOAPIC #0 intpin 16 -> irq 2 IOAPIC #0 intpin 17 -> irq 16 IOAPIC #0 intpin 19 -> irq 17 pci0: on pcib0 isab0: at device 7.0 on pci0 isa0: on isab0 pci0: at 7.1 pci0: at 7.2 irq 15 xl0: <3Com 3c905-TX Fast Etherlink XL> port 0x9400-0x943f irq 2 at device 17.0 on pci0 xl0: Ethernet address: 00:60:08:a6:e8:a1 miibus0: on xl0 nsphy0: on miibus0 nsphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto pci0: at 18.0 irq 16 ahc0: port 0x9800-0x98ff mem 0xe4000000-0xe4000fff irq 17 at device 20.0 on pci0 aic7890/91: Wide Channel A, SCSI Id=7, 32/255 SCBs fdc0: at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0 fdc0: FIFO enabled, 8 bytes threshold fd0: <1440-KB 3.5" drive> on fdc0 drive 0 atkbdc0: at port 0x60,0x64 on isa0 atkbd0: flags 0x1 irq 1 on atkbdc0 kbd0 at atkbd0 psm0: irq 12 on atkbdc0 psm0: model Generic PS/2 mouse, device ID 0 vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 sc0: at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0 sio0: type 16550A sio1 at port 0x2f8-0x2ff irq 3 on isa0 sio1: type 16550A ppc0: at port 0x378-0x37f irq 7 on isa0 ppc0: SMC-like chipset (ECP/EPP/PS2/NIBBLE) in COMPATIBLE mode ppc0: FIFO with 16/16/16 bytes threshold lpt0: on ppbus0 lpt0: Interrupt-driven port ppi0: on ppbus0 APIC_IO: Testing 8254 interrupt delivery APIC_IO: Broken MP table detected: 8254 is not connected to IOAPIC #0 intpin 2 APIC_IO: routing 8254 via 8259 and IOAPIC #0 intpin 0 Waiting 5 seconds for SCSI devices to settle SMP: AP CPU #1 Launched! sa0 at ahc0 bus 0 target 2 lun 0 sa0: Removable Sequential Access SCSI-2 device sa0: 10.000MB/s transfers (10.000MHz, offset 7) da0 at ahc0 bus 0 target 0 lun 0 da0: Fixed Direct Access SCSI-2 device da0: 10.000MB/s transfers (10.000MHz, offset 15), Tagged Queueing Enabled da0: 4101MB (8399520 512 byte sectors: 255H 63S/T 522C) Mounting root from ufs:/dev/da0s1a cd0 at ahc0 bus 0 target 1 lun 0 cd0: Removable CD-ROM SCSI-2 device cd0: 10.000MB/s transfers (10.000MHz, offset 15) cd0: Attempt to query device size failed: NOT READY, Medium not present da7 at ahc0 bus 0 target 15 lun 0 da7: Fixed Direct Access SCSI-2 device da7: 40.000MB/s transfers (20.000MHz, offset 15, 16bit), Tagged Queueing Enabled da7: 8192MB (16777216 512 byte sectors: 255H 63S/T 1044C) da5 at ahc0 bus 0 target 13 lun 0 da5: Fixed Direct Access SCSI-2 device da5: 40.000MB/s transfers (20.000MHz, offset 15, 16bit), Tagged Queueing Enabled da5: 8192MB (16777216 512 byte sectors: 255H 63S/T 1044C) da6 at ahc0 bus 0 target 14 lun 0 da6: Fixed Direct Access SCSI-2 device da6: 40.000MB/s transfers (20.000MHz, offset 15, 16bit), Tagged Queueing Enabled da6: 8192MB (16777216 512 byte sectors: 255H 63S/T 1044C) da3 at ahc0 bus 0 target 11 lun 0 da3: Fixed Direct Access SCSI-2 device da3: 40.000MB/s transfers (20.000MHz, offset 15, 16bit), Tagged Queueing Enabled da3: 4340MB (8888543 512 byte sectors: 255H 63S/T 553C) da1 at ahc0 bus 0 target 9 lun 0 da1: Fixed Direct Access SCSI-2 device da1: 40.000MB/s transfers (20.000MHz, offset 15, 16bit), Tagged Queueing Enabled da1: 4340MB (8888543 512 byte sectors: 255H 63S/T 553C) da4 at ahc0 bus 0 target 12 lun 0 da4: Fixed Direct Access SCSI-2 device da4: 40.000MB/s transfers (20.000MHz, offset 15, 16bit), Tagged Queueing Enabled da4: 4340MB (8888543 512 byte sectors: 255H 63S/T 553C) da2 at ahc0 bus 0 target 10 lun 0 da2: Fixed Direct Access SCSI-2 device da2: 40.000MB/s transfers (20.000MHz, offset 15, 16bit), Tagged Queueing Enabled da2: 4340MB (8888543 512 byte sectors: 255H 63S/T 553C) To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-questions" in the body of the message