From owner-freebsd-stable Sun Aug 25 6:27: 9 2002 Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.FreeBSD.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id B6C9D37B400 for ; Sun, 25 Aug 2002 06:27:05 -0700 (PDT) Received: from perimeter.co.za (obelix.perimeter.co.za [209.212.102.154]) by mx1.FreeBSD.org (Postfix) with ESMTP id 804C543E6E for ; Sun, 25 Aug 2002 06:27:02 -0700 (PDT) (envelope-from bsd@perimeter.co.za) Received: from asterix (ndf-dial-196-30-124-176.mweb.co.za [196.30.124.176]) (AUTH: LOGIN bsd@perimeter.co.za) by perimeter.co.za with esmtp; Sun, 25 Aug 2002 15:26:58 +0200 Message-ID: <01fe01c24c3b$2159a620$0200000a@perimeter.co.za> From: "Patrick O'Reilly" To: "Arnvid Karstad" , freebsd-stable@freebsd.org References: <20020825090730.3341.ARNVID@karstad.org> Subject: Re: Problems with 4.6.2 and aic7899 - fatal trap 12. Date: Sun, 25 Aug 2002 15:27:12 +0200 MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit X-Priority: 3 X-MSMail-Priority: Normal X-Mailer: Microsoft Outlook Express 5.50.4133.2400 X-MIMEOLE: Produced By Microsoft MimeOLE V5.50.4133.2400 Sender: owner-freebsd-stable@FreeBSD.ORG Precedence: bulk List-ID: List-Archive: (Web Archive) List-Help: (List Instructions) List-Subscribe: List-Unsubscribe: X-Loop: FreeBSD.ORG From: "Arnvid Karstad" > Hiya. > > We were testing FreeBSD 4.6.2 on one of our sandbox machines that was > previously running FBSD4.5. After the cvs and additional make's the > machine now seems to be at tad unstable.. > > When disk i/o start to run of.. for instance during find / -name *~ it > just dumps out one nice error msg like this: > > Fatal trap 12: page fault while in kernel mode > fault virtual address = 0x0 > fault code = supervisor write, page not present > instruction pointer code = 0x8:0xc0215293 > stack pointer = 0x10:0xfef98c40 > frame pointer = 0x10:0xfef98c74 > code segment = base 0x0, limit 0xfffff, type 0x1b > = DPL 0, pres 1, def32 1, gran 1 > processor enabled = interupt enabled, resume, IOPL = 0 > current proccess = 142 (cvs) > interupt mask = bio > trap number = 12 > panic: Page fault > > syncing disk....... lotsa numbers.. then rebooots > > I've tested the ram and disks extensively with IBM diagnostics tool, and > didn't find anythig wrong there. (It's an IBM xSeries 342 server btw) > > What now? anyone know where to start looking?? Hoo Boy! I have just been through a week of trouble with similar symptoms, also since I upgraded to 4.6.2. 1) To be sure that the problem is the same, you should build your kernel for debugging, and then after a crash you can check, with help from smarter people than me on this list, where the fault lies. If that turns out to be the same is my problem, then there is a patch available which has made my box stable, for the last 28 hours anyway :) 2) If you are impatient, you could just apply the patch and maybe you are lucky and it works. However, I have a feeling that the more experienced hackers on this list will advice against this option. Here's a link to an easy to follow article explaining how to build your kernel with debugging enabled. Oh, and in the mean time, there is a new port (/usr/ports/devel/gdbmods) which you will find useful when the time comes to run your stack trace. Why not load it up so long... --- Regards, Patrick O'Reilly. ___ _ __ / _ )__ __ (_)_ __ ___ _/ /____ __ / __/ -_) _) / ~ ) -_), ,-/ -_) _) /_/ \__/_//_/_/~/_/\__/ \__/\__/_/ http://www.perimeter.co.za To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-stable" in the body of the message