From owner-freebsd-stable@FreeBSD.ORG Tue Jul 7 00:42:06 2009 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 7BDDE1065678 for ; Tue, 7 Jul 2009 00:42:06 +0000 (UTC) (envelope-from dan@langille.org) Received: from nyi.unixathome.org (nyi.unixathome.org [64.147.113.42]) by mx1.freebsd.org (Postfix) with ESMTP id 4450A8FC15 for ; Tue, 7 Jul 2009 00:42:05 +0000 (UTC) (envelope-from dan@langille.org) Received: from localhost (localhost [127.0.0.1]) by nyi.unixathome.org (Postfix) with ESMTP id 185C2509E1; Tue, 7 Jul 2009 01:22:19 +0100 (BST) X-Virus-Scanned: amavisd-new at unixathome.org Received: from nyi.unixathome.org ([127.0.0.1]) by localhost (nyi.unixathome.org [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id L0hFnzFfRPdI; Tue, 7 Jul 2009 01:22:17 +0100 (BST) Received: from bast.unixathome.org (bast.unixathome.org [10.8.1.1]) by nyi.unixathome.org (Postfix) with ESMTP id 334D250994; Tue, 7 Jul 2009 01:22:17 +0100 (BST) Received: from polo.unixathome.org (polo.unixathome.org [10.55.0.23]) by bast.unixathome.org (Postfix) with ESMTP id CD5A9B857; Tue, 7 Jul 2009 01:22:15 +0100 (BST) Message-ID: <4A529537.8070206@langille.org> Date: Mon, 06 Jul 2009 20:22:15 -0400 From: Dan Langille User-Agent: Thunderbird 2.0.0.22 (X11/20090627) MIME-Version: 1.0 To: Dan Langille References: <49774BAE.3000809@ksu.ru> <20090122071845.GF4881@alf.bsdes.net> <4978A10A.9060006@langille.org> <7697CDAB-B4E7-480A-B31A-1F54275B8D54@langille.org> In-Reply-To: <7697CDAB-B4E7-480A-B31A-1F54275B8D54@langille.org> Content-Type: text/plain; charset=US-ASCII; format=flowed Content-Transfer-Encoding: 7bit Cc: Victor Balada Diaz , freebsd-stable@freebsd.org, Pete French , "Marat N.Afanasyev" Subject: Re: interrupt storm on MSI IXP600 based motherboards X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 07 Jul 2009 00:42:06 -0000 Dan Langille wrote: > > On Jan 22, 2009, at 11:38 AM, Dan Langille wrote: > >> Victor Balada Diaz wrote: >>> On Wed, Jan 21, 2009 at 07:22:06PM +0300, Marat N.Afanasyev wrote: >>>>>> trouble with onboard re(4) was resolved in -CURRENT and -STABLE, >>>>>> but storms are not bound to ethernet only. storm may appear on any >>>>>> device. if any device generates enough interrupts rate, storm will >>>>>> arrive. >>>>> Yes, I just got another storm, on my ATA controller this time. Ah >>>>> well, so much for the idea of disabling unneeded devices! >>>>> >>>>> -pete. >>>>> >>>> it's a kind of magic, really. I built a new kernel with KDB and DDB >>>> and after 1 day, 13:15 I'm still waiting for storm to arrive. And I >>>> added >>>> hw.acpi.osname="Linux" to /boot/loader.conf. >>> Try doing lots of IO and you will get the problem soon. You might >>> want to try: >>> while true; do dd if=/dev/zero of=BAH bs=1M count=1024; sync; done >> >> FWIW, last night I changed the address of the comm port IO in my BIOS. >> Then I ran the Bacula regression test suite (lots of IO). For my >> machine, once the interrupt storm starts, it continues. I do not know >> if that happens to everyone. >> >> Since changing the address, I have had no interrupt storms. I have >> been running the above IO loop for about ten minutes. >> >> No storm yet (knock on wood). > > > And it's back: > > Jan 22 17:21:46 polo kernel: interrupt storm detected on "irq22:"; > throttling interrupt source > Jan 22 17:23:19 polo kernel: interrupt storm detected on "irq22:"; > throttling interrupt source > Jan 22 17:28:20 polo kernel: interrupt storm detected on "irq22:"; > throttling interrupt source > Jan 22 17:33:20 polo kernel: interrupt storm detected on "irq22:"; > throttling interrupt source > Jan 22 17:38:20 polo kernel: interrupt storm detected on "irq22:"; > throttling interrupt source > > I shall try the hw.acpi.osname="Linux" option now. > > From dmsg: Jan 22 18:10:07 polo kernel: ACPI: Overriding _OS definition > with "Linux" The problem returns: Jul 6 20:12:10 polo kernel: interrupt storm detected on "irq22:"; throttling interrupt source Jul 6 20:12:41 polo last message repeated 31 times Jul 6 20:14:42 polo last message repeated 121 times Jul 6 20:17:09 polo last message repeated 147 times FreeBSD polo.example.org 7.2-STABLE FreeBSD 7.2-STABLE #10: Mon Jun 1 19:19:13 EDT 2009 dan@example.org:/usr/obj/usr/src/sys/PHENOM amd64 Copyright (c) 1992-2009 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD is a registered trademark of The FreeBSD Foundation. FreeBSD 7.2-STABLE #10: Mon Jun 1 19:19:13 EDT 2009 dan@example.org:/usr/obj/usr/src/sys/PHENOM Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: AMD Phenom(tm) 9600 Quad-Core Processor (2300.17-MHz K8-class CPU) Origin = "AuthenticAMD" Id = 0x100f22 Stepping = 2 Features=0x178bfbff Features2=0x802009> AMD Features=0xee500800 AMD Features2=0x7ff,,,Prefetch,,> TSC: P-state invariant Cores per package: 4 usable memory = 4281012224 (4082 MB) avail memory = 4108423168 (3918 MB) ACPI APIC Table: <122107 APIC0947> FreeBSD/SMP: Multiprocessor System Detected: 4 CPUs cpu0 (BSP): APIC ID: 0 cpu1 (AP): APIC ID: 1 cpu2 (AP): APIC ID: 2 cpu3 (AP): APIC ID: 3 ioapic0 irqs 0-23 on motherboard kbd1 at kbdmux0 acpi0: <122107 RSDT0947> on motherboard ACPI: Overriding _OS definition with "Linux" acpi0: [ITHREAD] acpi0: Power Button (fixed) acpi0: reservation of fee00000, 1000 (3) failed acpi0: reservation of ffb80000, 80000 (3) failed acpi0: reservation of 0, a0000 (3) failed acpi0: reservation of 100000, cff00000 (3) failed ACPI HPET table warning: Sequence is non-zero (2) Timecounter "ACPI-safe" frequency 3579545 Hz quality 850 acpi_timer0: <32-bit timer at 3.579545MHz> port 0x808-0x80b on acpi0 acpi_hpet0: iomem 0xfed00000-0xfed003ff on acpi0 Timecounter "HPET" frequency 14318180 Hz quality 900 pcib0: port 0xcf8-0xcff on acpi0 pci0: on pcib0 pcib1: at device 11.0 on pci0 pci1: on pcib1 vgapci0: port 0xd800-0xd87f mem 0xfd000000-0xfdffffff,0xd0000000-0xdfffffff,0xfa000000-0xfbffffff irq 19 at device 0.0 on pci1 atapci0: port 0xc000-0xc007,0xb000-0xb003,0xa000-0xa007,0x9000-0x9003,0x8000-0x800f mem 0xf9fff800-0xf9fffbff irq 22 at device 18.0 on pci0 atapci0: [ITHREAD] atapci0: AHCI Version 01.10 controller with 4 ports detected ata2: on atapci0 ata2: [ITHREAD] ata3: on atapci0 ata3: [ITHREAD] ata4: on atapci0 ata4: [ITHREAD] ata5: on atapci0 ata5: [ITHREAD] ohci0: mem 0xf9ffe000-0xf9ffefff irq 16 at device 19.0 on pci0 ohci0: [GIANT-LOCKED] ohci0: [ITHREAD] usb0: OHCI version 1.0, legacy support usb0: SMM does not respond, resetting usb0: on ohci0 usb0: USB revision 1.0 uhub0: on usb0 uhub0: 2 ports with 2 removable, self powered ohci1: mem 0xf9ffd000-0xf9ffdfff irq 17 at device 19.1 on pci0 ohci1: [GIANT-LOCKED] ohci1: [ITHREAD] usb1: OHCI version 1.0, legacy support usb1: SMM does not respond, resetting usb1: on ohci1 usb1: USB revision 1.0 uhub1: on usb1 uhub1: 2 ports with 2 removable, self powered ohci2: mem 0xf9ffc000-0xf9ffcfff irq 18 at device 19.2 on pci0 ohci2: [GIANT-LOCKED] ohci2: [ITHREAD] usb2: OHCI version 1.0, legacy support usb2: SMM does not respond, resetting usb2: on ohci2 usb2: USB revision 1.0 uhub2: on usb2 uhub2: 2 ports with 2 removable, self powered ohci3: mem 0xf9ffb000-0xf9ffbfff irq 17 at device 19.3 on pci0 ohci3: [GIANT-LOCKED] ohci3: [ITHREAD] usb3: OHCI version 1.0, legacy support usb3: SMM does not respond, resetting usb3: on ohci3 usb3: USB revision 1.0 uhub3: on usb3 uhub3: 2 ports with 2 removable, self powered ohci4: mem 0xf9ffa000-0xf9ffafff irq 18 at device 19.4 on pci0 ohci4: [GIANT-LOCKED] ohci4: [ITHREAD] usb4: OHCI version 1.0, legacy support usb4: SMM does not respond, resetting usb4: on ohci4 usb4: USB revision 1.0 uhub4: on usb4 uhub4: 2 ports with 2 removable, self powered ehci0: mem 0xf9fff000-0xf9fff0ff irq 19 at device 19.5 on pci0 ehci0: [GIANT-LOCKED] ehci0: [ITHREAD] usb5: EHCI version 1.0 usb5: companion controllers, 2 ports each: usb0 usb1 usb2 usb3 usb4 usb5: on ehci0 usb5: USB revision 2.0 uhub5: on usb5 uhub5: 10 ports with 10 removable, self powered pci0: at device 20.0 (no driver attached) atapci1: port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xff00-0xff0f at device 20.1 on pci0 ata0: on atapci1 ata0: [ITHREAD] hdac0: mem 0xf9ff4000-0xf9ff7fff irq 16 at device 20.2 on pci0 hdac0: HDA Driver Revision: 20090329_0131 hdac0: [ITHREAD] isab0: at device 20.3 on pci0 isa0: on isab0 pcib2: at device 20.4 on pci0 pci2: on pcib2 fwohci0: port 0xe800-0xe87f mem 0xfebff800-0xfebfffff irq 23 at device 0.0 on pci2 fwohci0: [FILTER] fwohci0: OHCI version 1.10 (ROM=1) fwohci0: No. of Isochronous channels is 4. fwohci0: EUI64 00:dc:10:00:01:53:a0:bc fwohci0: Phy 1394a available S400, 2 ports. fwohci0: Link S400, max_rec 2048 bytes. firewire0: on fwohci0 dcons_crom0: on firewire0 dcons_crom0: bus_addr 0x1464000 fwe0: on firewire0 if_fwe0: Fake Ethernet address: 02:dc:10:53:a0:bc fwe0: Ethernet address: 02:dc:10:53:a0:bc fwip0: on firewire0 fwip0: Firewire address: 00:dc:10:00:01:53:a0:bc @ 0xfffe00000000, S400, maxrec 2048 sbp0: on firewire0 fwohci0: Initiate bus reset fwohci0: BUS reset fwohci0: node_id=0xc800ffc0, gen=1, CYCLEMASTER mode fxp0: port 0xe400-0xe43f mem 0xfebfe000-0xfebfefff,0xfea00000-0xfeafffff irq 22 at device 2.0 on pci2 miibus0: on fxp0 inphy0: PHY 1 on miibus0 inphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto fxp0: Ethernet address: 00:04:ac:d3:78:23 fxp0: [ITHREAD] acpi_button0: on acpi0 sio0: configured irq 3 not in bitmap of probed irqs 0 sio0: port may not be enabled sio0: configured irq 3 not in bitmap of probed irqs 0 sio0: port may not be enabled sio0: <16550A-compatible COM port> port 0x2f8-0x2ff irq 3 flags 0x10 on acpi0 sio0: type 16550A sio0: [FILTER] fdc0: port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on acpi0 fdc0: [FILTER] fd0: <1440-KB 3.5" drive> on fdc0 drive 0 atkbdc0: port 0x60,0x64 irq 1 on acpi0 atkbd0: irq 1 on atkbdc0 kbd0 at atkbd0 atkbd0: [GIANT-LOCKED] atkbd0: [ITHREAD] psm0: irq 12 on atkbdc0 psm0: [GIANT-LOCKED] psm0: [ITHREAD] psm0: model IntelliMouse, device ID 3 cpu0: on acpi0 acpi_throttle0: on cpu0 acpi_throttle0: CLK_VAL field overlaps THT_EN bit device_attach: acpi_throttle0 attach returned 6 cpu1: on acpi0 cpu2: on acpi0 cpu3: on acpi0 ppc0: cannot reserve I/O port range sc0: at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 ugen0: on uhub4 Timecounters tick every 1.000 msec firewire0: 1 nodes, maxhop <= 0, cable IRM = 0 (me) firewire0: bus manager 0 (me) acd0: CDRW at ata0-slave UDMA33 ad4: 476940MB at ata2-master SATA300 ad6: 476940MB at ata3-master SATA300 hdac0: HDA Codec #0: Realtek ALC888 pcm0: at cad 0 nid 1 on hdac0 pcm1: at cad 0 nid 1 on hdac0 pcm2: at cad 0 nid 1 on hdac0 GEOM_MIRROR: Device mirror/gm0 launched (2/2). GEOM_LABEL: Label for provider mirror/gm0s1a is ufsid/47c95ad36ed40e80. GEOM_LABEL: Label for provider mirror/gm0s1d is ufsid/47c95ae2ac3d5ead. GEOM_LABEL: Label for provider mirror/gm0s1e is ufsid/47c95ad3fa3660aa. GEOM_LABEL: Label for provider mirror/gm0s1f is ufsid/47c95ad37ffedafc. acd0: FAILURE - INQUIRY ILLEGAL REQUEST asc=0x24 ascq=0x00 (probe0:ata0:0:1:0): TEST UNIT READY. CDB: 0 0 0 0 0 0 (probe0:ata0:0:1:0): CAM Status: SCSI Status Error (probe0:ata0:0:1:0): SCSI Status: Check Condition (probe0:ata0:0:1:0): NOT READY asc:3a,1 (probe0:ata0:0:1:0): Medium not present - tray closed (probe0:ata0:0:1:0): Unretryable error acd0: FAILURE - INQUIRY ILLEGAL REQUEST asc=0x24 ascq=0x00 cd0 at ata0 bus 0 target 1 lun 0 cd0: Removable CD-ROM SCSI-0 device cd0: 33.000MB/s transfers cd0: Attempt to query device size failed: NOT READY, Medium not present - tray closed SMP: AP CPU #1 Launched! SMP: AP CPU #2 Launched! SMP: AP CPU #3 Launched! Trying to mount root from ufs:/dev/mirror/gm0s1a GEOM_LABEL: Label ufsid/47c95ad36ed40e80 removed. GEOM_LABEL: Label for provider mirror/gm0s1a is ufsid/47c95ad36ed40e80. GEOM_LABEL: Label ufsid/47c95ad3fa3660aa removed. GEOM_LABEL: Label for provider mirror/gm0s1e is ufsid/47c95ad3fa3660aa. GEOM_LABEL: Label ufsid/47c95ad37ffedafc removed. GEOM_LABEL: Label for provider mirror/gm0s1f is ufsid/47c95ad37ffedafc. GEOM_LABEL: Label ufsid/47c95ae2ac3d5ead removed. GEOM_LABEL: Label for provider mirror/gm0s1d is ufsid/47c95ae2ac3d5ead. GEOM_LABEL: Label ufsid/47c95ad36ed40e80 removed. GEOM_LABEL: Label ufsid/47c95ad3fa3660aa removed. GEOM_LABEL: Label ufsid/47c95ad37ffedafc removed. GEOM_LABEL: Label ufsid/47c95ae2ac3d5ead removed. [dan@polo:/usr/home/dan] $