From owner-freebsd-scsi@FreeBSD.ORG Fri Mar 24 21:08:45 2006 Return-Path: X-Original-To: freebsd-scsi@freebsd.org Delivered-To: freebsd-scsi@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id CBB9416A41F for ; Fri, 24 Mar 2006 21:08:45 +0000 (UTC) (envelope-from jhb@freebsd.org) Received: from server.baldwin.cx (66-23-211-162.clients.speedfactory.net [66.23.211.162]) by mx1.FreeBSD.org (Postfix) with ESMTP id 6885043D55 for ; Fri, 24 Mar 2006 21:08:42 +0000 (GMT) (envelope-from jhb@freebsd.org) Received: from localhost (john@localhost [127.0.0.1]) by server.baldwin.cx (8.13.4/8.13.4) with ESMTP id k2OL8XNo040778; Fri, 24 Mar 2006 16:08:37 -0500 (EST) (envelope-from jhb@freebsd.org) From: John Baldwin To: Scott Long Date: Fri, 24 Mar 2006 16:07:51 -0500 User-Agent: KMail/1.9.1 References: <20060215102749.D58480@brain.cc.rsu.ru> <200603241440.30487.jhb@freebsd.org> <4424532E.30004@samsco.org> In-Reply-To: <4424532E.30004@samsco.org> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200603241607.55075.jhb@freebsd.org> X-Virus-Scanned: ClamAV 0.87.1/1356/Fri Mar 24 13:41:06 2006 on server.baldwin.cx X-Virus-Status: Clean X-Spam-Status: No, score=-3.8 required=4.2 tests=ALL_TRUSTED,AWL,BAYES_00 autolearn=ham version=3.1.0 X-Spam-Checker-Version: SpamAssassin 3.1.0 (2005-09-13) on server.baldwin.cx Cc: freebsd-scsi@freebsd.org, Andrey Beresovsky Subject: Re: Boot hangs on ips0: resetting adapter, this may take up to 5 minutes X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 24 Mar 2006 21:08:45 -0000 On Friday 24 March 2006 15:14, Scott Long wrote: > John Baldwin wrote: > > On Friday 24 March 2006 13:12, Oleg Sharoiko wrote: > > > >>On Fri, 24 Mar 2006, John Baldwin wrote: > >> > >>JB>Hmm, well that interrupt storm is not due to a misrouted interrupt, it > >>JB>might be a bug in the bge(4) driver. Can you try compiling a kernel w/o > >>JB>USB and seeing if you can still reproduce the interrupt storm? > >> > >>Yes. > >> > >>db> show intrcnt > >>irq1: atkbd0 2 > >>irq4: sio0 3672 > >>irq6: fdc0 6 > >>irq9: acpi0 1 > >>irq14: ata0 36 > >>irq16: bge0 2592958 > >>irq28: ips0 728 > >>cpu0: timer 143147 > > > > > > Hmm, you might need to look at bge(4) and figure out what condition it > > is interrupting on, and why the driver isn't handling that condition. > > > > > >>John, can you tell anything about another case, for which interrupt > >>counters are: > >> > >>db> show intrcnt > >>irq1: atkbd0 1 > >>irq4: sio0 3 > >>irq6: fdc0 2 > >>irq9: acpi0 345147 > >>irq14: ata0 1 > >>cpu0: timer 57995 > >> > >>Does this also show an interrupt storm? This one has happened without bge > >>in kernel. I'm asking because I've seen several different scenarios of > >>hangs and this is from one of them. I'll try to reproduce all cases and > >>gather interrupt statistics from all of them. I'm wondering can't all this > >>cases be caused by one common reason behind all of them. > > > > > > This just looks like you aren't getting interrups from devices at all. > > > > 345,000 interrupts from scpi0 seems high. Yeah, I guess it might be. It's smaller than the bge interrupt storm number. :) It could be that the BIOS has bungled the trigger or polarity of the SCI (this is common, but we have a workaround for the more common breakage). There are tunables that let you set the trigger and polarity of the SCI (acpi interrupt). MADT: Interrupt override: source 9, irq 9 ioapic0: intpin 9 trigger: level ioapic0: intpin 9 polarity: high Yeah, level/high seems weird. Try setting either 'hw.acpi.sci.trigger=edge' or 'hw.acpi.sci.polarity=low' from the loader to see if that makes your machine happier. -- John Baldwin <>< http://www.FreeBSD.org/~jhb/ "Power Users Use the Power to Serve" = http://www.FreeBSD.org