From owner-freebsd-current Sun Aug 29 23:19:37 1999 Delivered-To: freebsd-current@freebsd.org Received: from allegro.lemis.com (allegro.lemis.com [192.109.197.134]) by hub.freebsd.org (Postfix) with ESMTP id C802C15276 for ; Sun, 29 Aug 1999 23:19:32 -0700 (PDT) (envelope-from grog@freebie.lemis.com) Received: from freebie.lemis.com (freebie.lemis.com [192.109.197.137]) by allegro.lemis.com (8.9.1/8.9.0) with ESMTP id PAA27192; Mon, 30 Aug 1999 15:49:00 +0930 (CST) Received: (from grog@localhost) by freebie.lemis.com (8.9.3/8.9.0) id PAA76732; Mon, 30 Aug 1999 15:48:59 +0930 (CST) Date: Mon, 30 Aug 1999 15:48:59 +0930 From: Greg Lehey To: Parag Patel Cc: Matthew Dillon , Bernd Walter , Mike Smith , freebsd-current@FreeBSD.ORG Subject: Re: 4.0-CURRENT SMP crash with vinum raid-5 and softupdates Message-ID: <19990830154858.B13904@freebie.lemis.com> References: <80967.935993388@pinhead.parag.codegen.com> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Mailer: Mutt 0.95.4i In-Reply-To: <80967.935993388@pinhead.parag.codegen.com>; from Parag Patel on Sun, Aug 29, 1999 at 11:09:48PM -0700 WWW-Home-Page: http://www.lemis.com/~grog X-PGP-Fingerprint: 6B 7B C3 8C 61 CD 54 AF 13 24 52 F8 6D A4 95 EF Organization: LEMIS, PO Box 460, Echunga SA 5153, Australia Phone: +61-8-8388-8286 Fax: +61-8-8388-8725 Mobile: +61-41-739-7062 Sender: owner-freebsd-current@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.ORG On Sunday, 29 August 1999 at 23:09:48 -0700, Parag Patel wrote: > On Sun, 29 Aug 1999 22:59:22 PDT, Matthew Dillon wrote: >> >> This is definitely a pbuf. Did you apply the patches Greg emailed? >> They will panic the machine earlier while it is still in the correct >> stack frame, allowing Greg to track down where the I/O initiation came >> from. > > Greg's debugging this on the machine here at the moment, which is > crashing the same way in what appears to be the same place. > > If anyone else wants to take a crack it, the magic vinum debug number > needed is 328 and not 324. > > So far Greg's discovered that the field is correctly set a little while > before the crash, and then becomes NULL for no apparent reason. Looks > like a really nasty bug. To expand on what Parag says, the check in launch_requests shows no problems with the buffer headers, but the check in biodone catches a bp with B_CALL set and b_iodone set to NULL. I'd guess that something in the iodone chain code is incorrect. Unfortunately, there were so many active requests that I couldn't find the one that caused the crash in the log buffer; theoretically it might even have nothing to do with it. Does this ring a bell with anybody? Greg -- See complete headers for address, home page and phone numbers finger grog@lemis.com for PGP public key To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-current" in the body of the message