From owner-freebsd-stable@FreeBSD.ORG Tue Jan 2 17:10:43 2007 Return-Path: X-Original-To: freebsd-stable@freebsd.org Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id 0AADD16A504; Tue, 2 Jan 2007 17:10:43 +0000 (UTC) (envelope-from gavin.atkinson@ury.york.ac.uk) Received: from mail-gw4.york.ac.uk (mail-gw4.york.ac.uk [144.32.128.249]) by mx1.freebsd.org (Postfix) with ESMTP id 98F5213C45D; Tue, 2 Jan 2007 17:10:42 +0000 (UTC) (envelope-from gavin.atkinson@ury.york.ac.uk) Received: from buffy.york.ac.uk (buffy-128.york.ac.uk [144.32.128.160]) by mail-gw4.york.ac.uk (8.13.6/8.13.6) with ESMTP id l02Gdq4p009466; Tue, 2 Jan 2007 16:39:52 GMT Received: from buffy.york.ac.uk (localhost [127.0.0.1]) by buffy.york.ac.uk (8.13.8/8.13.6) with ESMTP id l02GdqRN086561; Tue, 2 Jan 2007 16:39:52 GMT (envelope-from gavin.atkinson@ury.york.ac.uk) Received: (from ga9@localhost) by buffy.york.ac.uk (8.13.8/8.13.6/Submit) id l02Gdqjw086560; Tue, 2 Jan 2007 16:39:52 GMT (envelope-from gavin.atkinson@ury.york.ac.uk) X-Authentication-Warning: buffy.york.ac.uk: ga9 set sender to gavin.atkinson@ury.york.ac.uk using -f From: Gavin Atkinson To: Jeremy Chadwick In-Reply-To: <20070102153608.GA78405@icarus.home.lan> References: <20070102153608.GA78405@icarus.home.lan> Content-Type: text/plain Content-Transfer-Encoding: 7bit Date: Tue, 02 Jan 2007 16:39:51 +0000 Message-Id: <1167755991.84652.6.camel@buffy.york.ac.uk> Mime-Version: 1.0 X-Mailer: Evolution 2.6.1 FreeBSD GNOME Team Port X-York-MailScanner: Found to be clean X-York-MailScanner-From: gavin.atkinson@ury.york.ac.uk Cc: freebsd-stable@freebsd.org Subject: Re: Interrupt (SCSI?) hang on 4.x X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 02 Jan 2007 17:10:43 -0000 On Tue, 2007-01-02 at 07:36 -0800, Jeremy Chadwick wrote: > Yes, I know 4.11 is EOL'd at the end of this month, but hopefully > someone can shed some light on this problem anyways. I simply don't > have the knowledge of what's going on on a low-level to determine > the cause. > > I do have serial console on this box, and after enabling some > debugging for the ahc(4) driver a few months back, was able to > get something intelligent out of the system regarding SCBs this > morning. This may not be useful (or the cause), though. I also > cannot enable drop-to-DDB-on-serial-break because our Portmaster 2 > has been known to send a serial break on rare occasion. :-( > > Every so often (sometimes hours, sometimes months -- usually months), > the 4.11 box we have "locks up" in the sense that both NICs on the > box stop working, and the SCSI controller also appears hung. This > problem has existed for a couple years; it's not specific to 4.11 > (versus 4.10 or 4.9). > > # vmstat -i > ata0 irq14 6 0 > fxp0 irq10 14874 28 > mux irq11 65028 125 > fdc0 irq6 1 0 > sio0 irq4 948 1 > clk irq0 516187 998 > rtc irq8 66071 127 > Total 663115 1282 Do any of these numbers continue to increase after the hang? You may find that if you are already logged in over the serial port before the hang and have run vmstat recently, it'll still be runnable due to it being cached. If the serial port is dead, you will probably still find you can get output from the serial port, so start "date; vmstat -i" in a loop over the serial port before it hangs, and watch the output once it wedges. Gavin