From owner-freebsd-stable@FreeBSD.ORG Fri May 25 15:09:27 2007 Return-Path: X-Original-To: freebsd-stable@freebsd.org Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id A7D7916A421 for ; Fri, 25 May 2007 15:09:27 +0000 (UTC) (envelope-from volker@vwsoft.com) Received: from frontmail.ipactive.de (frontmail.maindns.de [85.214.95.103]) by mx1.freebsd.org (Postfix) with ESMTP id 3ED6913C484 for ; Fri, 25 May 2007 15:09:27 +0000 (UTC) (envelope-from volker@vwsoft.com) Received: from mail.vtec.ipme.de (Q7c56.q.ppp-pool.de [89.53.124.86]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by frontmail.ipactive.de (Postfix) with ESMTP id 844C212881C; Fri, 25 May 2007 17:09:20 +0200 (CEST) Received: from epeios.sz.vwsoft.com (epeios.sz.vwsoft.com [192.168.16.5]) by mail.vtec.ipme.de (Postfix) with ESMTP id 3F57F3FA01; Fri, 25 May 2007 17:08:49 +0200 (CEST) Message-ID: <4656FC01.20309@vwsoft.com> Date: Fri, 25 May 2007 17:08:49 +0200 From: Volker User-Agent: Thunderbird 2.0.0.0 (X11/20070521) MIME-Version: 1.0 To: "Roger Miranda - Digital Relay Inc." References: <200705230717.l4N7HuPW010071@lurza.secnetix.de> <20070523215818.GB64723@xor.obsecurity.org> <4656A73E.9040109@vwsoft.com> <200705250801.25112.rmiranda@digitalrelay.ca> In-Reply-To: <200705250801.25112.rmiranda@digitalrelay.ca> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-VWSoft-MailScanner: Found to be clean X-MailScanner-From: volker@vwsoft.com X-ipactive-MailScanner-Information: Please contact the ISP for more information X-ipactive-MailScanner: Found to be clean X-ipactive-MailScanner-From: volker@vwsoft.com Cc: freebsd-stable@freebsd.org Subject: Re: ghosthunting: machine freeze 6.2R X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 25 May 2007 15:09:27 -0000 On 05/25/07 15:01, Roger Miranda - Digital Relay Inc. wrote: ... > Volker & Kris, > > Sorry for kind of hi-jacking your thread. Just hopefully we all can work > together to fix this issue (if there is one). For me, that's ok as it may (or may not) be a similar problem. > Out machine did goto in to a "hard lock". Nothing showed up on the screen or > logs. > > I did see something from WITNESS in dmesg. See below. But this does not show > up at the time of the crash. If I remember correctly this show up at boot > time. > > We have added the following kernel options: > options DDB > options KDB > options BREAK_TO_DEBUGGER > options INVARIANTS > options INVARIANTS_SUPPORT > options WITNESS > options DEBUG_VFS_LOCKS > options DEBUG_MEMGUARD > > > -dmesg------- > softdep_waitidle: Failed to flush worklist for 0xc44ffcf8 > em1: link state changed to DOWN > bridge0: Ethernet address: 66:9e:c9:a6:f5:27 > em0: promiscuous mode enabled > em1: promiscuous mode enabled > em1: link state changed to UP > lock order reversal: > 1st 0xc06e0820 polling (polling) @ /usr/src/sys/kern/kern_poll.c:422 > 2nd 0xc473070c if_bridge (if_bridge) @ /usr/src/sys/net/if_bridge.c:1976 > KDB: stack backtrace: > witness_checkorder(c473070c,9,c068bfbc,7b8) at witness_checkorder+0x55c > _mtx_lock_flags(c473070c,0,c068bfbc,7b8,c473070c,...) at _mtx_lock_flags+0x41 > bridge_input(c43d9000,c4740000,c43a828c,c43a8000,c4740000,...) at > bridge_input+0x80 > ether_input(c43d9000,c4740000,c43a828c,0,c0675f3c,...) at ether_input+0x122 > e1000_rxeof(d43f2cbc,c0524e5d,d43f2ca8,1,5,...) at e1000_rxeof+0x19b > e1000_poll(c43d9000,0,5) at e1000_poll+0x46 > netisr_poll(0) at netisr_poll+0x70 > swi_net(0,c4315438,c4323a80,c050a484,c4322648,...) at swi_net+0xb0 > ithread_loop(c430f700,d43f2d38,c430f700,c050a484,0,...) at ithread_loop+0x1de > fork_exit(c050a484,c430f700,d43f2d38) at fork_exit+0x7d > fork_trampoline() at fork_trampoline+0x8 > --- trap 0x1, eip = 0, esp = 0xd43f2d6c, ebp = 0 --- > softdep_waitidle: Failed to flush worklist for 0xc44ffcf8 > em0: link state changed to UP > ---------------- From this output, I would assume, we're fighting with two completely different problems. I've catched 3 LORs and a strange (?) ACPI message within the last few hours. I'll post everything I've got in another post in a few minutes. Volker