From owner-freebsd-stable@FreeBSD.ORG Wed Oct 24 14:11:01 2007 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 1BAF016A469 for ; Wed, 24 Oct 2007 14:11:01 +0000 (UTC) (envelope-from ghelmer@palisadesys.com) Received: from magellan.palisadesys.com (magellan.palisadesys.com [192.188.162.211]) by mx1.freebsd.org (Postfix) with ESMTP id BFFB613C4A8 for ; Wed, 24 Oct 2007 14:11:00 +0000 (UTC) (envelope-from ghelmer@palisadesys.com) Received: from [172.16.2.242] (cetus.palisadesys.com [192.188.162.7]) (authenticated bits=0) by magellan.palisadesys.com (8.14.1/8.14.1) with ESMTP id l9ODmsue034224 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Wed, 24 Oct 2007 08:48:54 -0500 (CDT) (envelope-from ghelmer@palisadesys.com) Message-ID: <471F4D52.3000706@palisadesys.com> Date: Wed, 24 Oct 2007 08:49:06 -0500 From: Guy Helmer User-Agent: Thunderbird 2.0.0.6 (Windows/20070728) MIME-Version: 1.0 To: John Baldwin References: <45B64469.9020002@palisadesys.com> <2a41acea0701230937p7c0f6400ida76a956fabd9b94@mail.gmail.com> <45B65155.5080304@palisadesys.com> <200710190958.24912.jhb@freebsd.org> In-Reply-To: <200710190958.24912.jhb@freebsd.org> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Greylist: Sender succeeded SMTP AUTH authentication, not delayed by milter-greylist-3.0 (magellan.palisadesys.com [192.188.162.5]); Wed, 24 Oct 2007 08:48:55 -0500 (CDT) X-Palisade-MailScanner-Information: Please contact the ISP for more information X-Palisade-MailScanner: Found to be clean X-Palisade-MailScanner-SpamCheck: not spam (whitelisted), SpamAssassin (not cached, score=-4.399, required 6, autolearn=not spam, ALL_TRUSTED -1.80, BAYES_00 -2.60) X-Palisade-MailScanner-From: ghelmer@palisadesys.com Cc: freebsd-stable@freebsd.org Subject: Re: Supermicro X7DBR-8+ hang at boot X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 24 Oct 2007 14:11:01 -0000 John Baldwin wrote: > On Tuesday 23 January 2007 01:17:57 pm Guy Helmer wrote: > >> Jack Vogel wrote: >> >>> On 1/23/07, Guy Helmer wrote: >>> >>>> Using FreeBSD 6.2, I'm having trouble with the Supermicro X7DBR-8+ >>>> motherboard (dual Xeon 5130 CPUs on the Blackford chipset - >>>> http://www.supermicro.com/products/motherboard/Xeon1333/5000P/X7DBR-8+.cfm) >>>> >>>> hanging after printing the "Waiting 5 seconds for SCSI devices to >>>> settle" message. The hang doesn't always happen - sometimes we have to >>>> go through several reboot cycles for it to happen - but sometimes it >>>> happens with every reboot. For those who would suggest that this >>>> happens because I'm using Seagate drives, it happens even if we totally >>>> remove the SCSI drive (but leave the aic7902 SCSI interfaces enabled) >>>> and boot from a SATA disk. Using FreeBSD 6.1, the Intel gigabit >>>> ethernet NICs aren't found but the hang doesn't occur. >>>> >>> ... >>> If that isnt it, I would suggest installing using ACPI disabled or >>> SAFE if >>> needed, and then tweak the kernel after. >>> >> hint.apic.0.disabled=1 helped - it hasn't hung yet in several boot >> cycles. New dmesg is attached below in case it helps anyone see a >> better fix than disabling the APICs. >> > > So you got an interrupt storm on IRQ 18 when ahd0 tried to probe and ahd0 got > interrupt timeouts. This indicates that ahd0 really lives on IRQ 18, not IRQ > 30. Your BIOS is likely busted since ACPI hardcodes these sort of IRQs. > > You can override the BIOS by doing: > > set hw.pci5.2.INTA.irq=18 > > in the loader (or adding a line to loader.conf) and seeing if that fixes the > boot with APIC enabled. > > I'm trying to resolve what looks like a similar problem with an IBM Blade Server unit. I'm reviewing my previous emails on this subject with the verbose boot messages to try to learn what lead you to determine the correct interrupt would be 18, but I can't seem to figure out what data leads to this conclusion. Any hints? Thanks, Guy