Date: Sun, 19 Mar 2000 21:00:35 -0500 (EST) From: "John W. DeBoskey" <jwd@unx.sas.com> To: freebsd-current@freebsd.org Subject: AMI MegaRAID lockup? not accepting commands. Message-ID: <200003200200.VAA11571@bb01f39.unx.sas.com>
next in thread | raw e-mail | index | archive | help
Hi, We have a system with a new AMI card in it controlling a pair of shelves from Dell (fbsd dated: 4.0-20000313-SNAP). The relevant dmesg output is below: (complete dmesg at end) amr0: <AMI MegaRAID> mem 0xf6c00000-0xf6ffffff irq 14 at device 10.1 on pci2 amr0: firmware 1.01 bios 1p00 128MB memory amrd0: <MegaRAID logical drive> on amr0 amrd0: 172780MB (353853440 sectors) RAID 5 (optimal) The adapter does not lockup while testing with bonnie and such. However, we have a 50Gig CVS repository sitting on the raid volume. When we do a 'cvs co' of -HEAD, it causes it to lockup. The following messages are repeating continuously: Mar 19 16:02:59 cvs /kernel: amr0: controller wedged (not taking commands) Mar 19 16:03:00 cvs /kernel: amr0: I/O error - dead Mar 19 16:03:00 cvs /kernel: amr0: cmd 2 ident 178 drive 0 Mar 19 16:03:00 cvs /kernel: amr0: blkcount 12 lba 59506736 Mar 19 16:03:00 cvs /kernel: amr0: virtaddr 0xd3089000 length 6144 Mar 19 16:03:00 cvs /kernel: amr0: physaddr 0000c880 nsg 2 Mar 19 16:03:00 cvs /kernel: amr0: 1abea000/4096 Mar 19 16:03:00 cvs /kernel: amr0: 25d2b000/2048 Mar 19 16:03:00 cvs /kernel: amr0: controller wedged (not taking commands) Mar 19 16:03:00 cvs /kernel: amr0: I/O error - dead Mar 19 16:03:00 cvs /kernel: amr0: cmd 2 ident 178 drive 0 Mar 19 16:03:00 cvs /kernel: amr0: blkcount 16 lba 59506768 Mar 19 16:03:00 cvs /kernel: amr0: virtaddr 0xd330f000 length 8192 Mar 19 16:03:00 cvs /kernel: amr0: physaddr 0000c880 nsg 2 Mar 19 16:03:00 cvs /kernel: amr0: 2396e000/4096 Mar 19 16:03:00 cvs /kernel: amr0: 28fef000/4096 Mar 19 16:03:00 cvs /kernel: amr0: controller wedged (not taking commands) Mar 19 16:03:00 cvs /kernel: amr0: I/O error - dead Mar 19 16:03:00 cvs /kernel: amr0: cmd 2 ident 178 drive 0 Mar 19 16:03:00 cvs /kernel: amr0: blkcount 16 lba 59506784 Mar 19 16:03:00 cvs /kernel: amr0: virtaddr 0xcebf3000 length 8192 Mar 19 16:03:01 cvs /kernel: amr0: physaddr 0000c880 nsg 2 Mar 19 16:03:01 cvs /kernel: amr0: 25470000/4096 Mar 19 16:03:01 cvs /kernel: amr0: 28ab1000/4096 We have locked the card up twice in a row. In looking through the cvs logs, I find the following concerning amr.c 1.7: http://www.FreeBSD.org/cgi/cvsweb.cgi/src/sys/dev/amr/amr.c --------------------- Increase the time we spend waiting for the controller to become ready to accept a new command; in high load cases it may be too busy for the old value. This loop needs something to tie it to real time, rather than just the CPU's ability to fetch from the L1 data cache, but this hack works for now. --------------------- Also, looking through freebsd-current archives, this topic seems to have been discussed in mid February, and then disappeared. If anyone has any ideas, or if there is anything we can try to help debug this problem, please let me know. Thanks, John ps: Complete dmesg output below. Copyright (c) 1992-2000 The FreeBSD Project. Copyright (c) 1982, 1986, 1989, 1991, 1993 The Regents of the University of California. All rights reserved. FreeBSD 4.0-20000313-SNAP #0: Sun Mar 19 15:07:38 EST 2000 root@cvs.unx.sas.com:/usr/src/sys/compile/CVSKERN Timecounter "i8254" frequency 1193182 Hz CPU: Pentium III/Pentium III Xeon (548.63-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0x673 Stepping = 3 Features=0x383f9ff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR,XMM> real memory = 805306368 (786432K bytes) avail memory = 777945088 (759712K bytes) Preloaded elf kernel "kernel" at 0xc02fe000. Pentium Pro MTRR support enabled md0: Malloc disk npx0: <math processor> on motherboard npx0: INT 16 interface pcib0: <Intel 82443BX (440 BX) host to PCI bridge> on motherboard pci0: <PCI bus> on pcib0 pcib1: <Intel 82443BX (440 BX) PCI-PCI (AGP) bridge> at device 1.0 on pci0 pci1: <PCI bus> on pcib1 pci1: <ATI Mach64-GB graphics accelerator> at 0.0 irq 9 isab0: <Intel 82371AB PCI to ISA bridge> at device 7.0 on pci0 isa0: <ISA bus> on isab0 atapci0: <Intel PIIX4 ATA33 controller> port 0xffa0-0xffaf at device 7.1 on pci0 ata1: at 0x170 irq 15 on atapci0 pci0: <Intel 82371AB/EB (PIIX4) USB controller> at 7.2 irq 14 chip1: <Intel 82371AB Power management controller> port 0x850-0x85f at device 7.3 on pci0 fxp0: <Intel EtherExpress Pro 10/100B Ethernet> port 0xcc80-0xccbf mem 0xff100000-0xff1fffff,0xff201000-0xff201fff irq 11 at device 13.0 on pci0 fxp0: Ethernet address 00:90:27:89:eb:9b fxp1: <Intel EtherExpress Pro 10/100B Ethernet> port 0xcc40-0xcc7f mem 0xff000000-0xff0fffff,0xff200000-0xff200fff irq 10 at device 14.0 on pci0 fxp1: Ethernet address 00:90:27:89:ef:10 pcib2: <DEC 21152 PCI-PCI bridge> at device 15.0 on pci0 pci2: <PCI bus> on pcib2 ahc0: <Adaptec 2940 Ultra SCSI adapter> port 0xdc00-0xdcff mem 0xfafff000-0xfaffffff irq 9 at device 9.0 on pci2 ahc0: aic7880 Wide Channel A, SCSI Id=7, 16/255 SCBs pcib3: <PCI to PCI bridge (vendor=8086 device=0964)> at device 10.0 on pci2 pci3: <PCI bus> on pcib3 amr0: <AMI MegaRAID> mem 0xf6c00000-0xf6ffffff irq 14 at device 10.1 on pci2 amr0: firmware 1.01 bios 1p00 128MB memory amrd0: <MegaRAID logical drive> on amr0 amrd0: 172780MB (353853440 sectors) RAID 5 (optimal) xl0: <3Com 3c905B-TX Fast Etherlink XL> port 0xc880-0xc8ff mem 0xff202000-0xff20207f irq 14 at device 17.0 on pci0 xl0: Ethernet address: 00:b0:d0:19:18:81 miibus0: <MII bus> on xl0 xlphy0: <3Com internal media interface> on miibus0 xlphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto fdc0: <NEC 72065B or clone> at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0 fdc0: FIFO enabled, 8 bytes threshold fd0: <1440-KB 3.5" drive> on fdc0 drive 0 atkbdc0: <keyboard controller (i8042)> at port 0x60-0x6f on isa0 atkbd0: <AT Keyboard> irq 1 on atkbdc0 psm0: <PS/2 Mouse> irq 12 on atkbdc0 psm0: model Generic PS/2 mouse, device ID 0 vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 sc0: <System console> on isa0 sc0: VGA <16 virtual consoles, flags=0x200> sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0 sio0: type 16550A sio1 at port 0x2f8-0x2ff irq 3 on isa0 sio1: type 16550A ppc0: <Parallel port> at port 0x378-0x37f irq 7 on isa0 ppc0: SMC-like chipset (ECP/EPP/PS2/NIBBLE) in COMPATIBLE mode ppc0: FIFO with 16/16/8 bytes threshold lpt0: <Printer> on ppbus0 lpt0: Interrupt-driven port unknown0: <WSS/SB> at port 0x534-0x537,0x388-0x38b,0x220-0x22f irq 5 drq 1,0 on isa0 unknown1: <Game> at port 0x3a0-0x3a7 on isa0 unknown2: <Ctrl> at port 0xf00-0xf07 on isa0 unknown3: <MPU> at port 0x330-0x331 on isa0 acd0: CDROM <SAMSUNG CD-ROM SC-140F> at ata1-master using PIO4 Waiting 2 seconds for SCSI devices to settle Mounting root from ufs:/dev/da0s1a da0 at ahc0 bus 0 target 0 lun 0 da0: <QUANTUM ATLAS IV 9 WLS 0808> Fixed Direct Access SCSI-3 device da0: 40.000MB/s transfers (20.000MHz, offset 8, 16bit), Tagged Queueing Enabled da0: 8683MB (17783249 512 byte sectors: 255H 63S/T 1106C) da1 at ahc0 bus 0 target 1 lun 0 da1: <QUANTUM ATLAS IV 9 WLS 0808> Fixed Direct Access SCSI-3 device da1: 40.000MB/s transfers (20.000MHz, offset 8, 16bit), Tagged Queueing Enabled da1: 8683MB (17783249 512 byte sectors: 255H 63S/T 1106C) To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-current" in the body of the message
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?200003200200.VAA11571>