From owner-freebsd-current Sat Jan 23 01:25:30 1999 Return-Path: Received: (from majordom@localhost) by hub.freebsd.org (8.8.8/8.8.8) id BAA05533 for freebsd-current-outgoing; Sat, 23 Jan 1999 01:25:30 -0800 (PST) (envelope-from owner-freebsd-current@FreeBSD.ORG) Received: from spinner.netplex.com.au (spinner.netplex.com.au [202.12.86.3]) by hub.freebsd.org (8.8.8/8.8.8) with ESMTP id BAA05523 for ; Sat, 23 Jan 1999 01:25:26 -0800 (PST) (envelope-from peter@netplex.com.au) Received: from spinner.netplex.com.au (localhost [127.0.0.1]) by spinner.netplex.com.au (8.9.2/8.9.2/Netplex) with ESMTP id RAA00489; Sat, 23 Jan 1999 17:25:03 +0800 (WST) (envelope-from peter@spinner.netplex.com.au) Message-Id: <199901230925.RAA00489@spinner.netplex.com.au> X-Mailer: exmh version 2.0.2 2/24/98 To: Matthew Dillon cc: current@FreeBSD.ORG Subject: Re: panic: found dirty cache page 0xf046f1c0 In-reply-to: Your message of "Sat, 23 Jan 1999 00:47:03 PST." <199901230847.AAA44590@apollo.backplane.com> Date: Sat, 23 Jan 1999 17:25:03 +0800 From: Peter Wemm Sender: owner-freebsd-current@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.ORG Matthew Dillon wrote: > :It's definately happening still, sorry. :-( I recompiled a 100% static > :kernel and have had three more explosions, usually after starting exmh. > :(exmh takes 10 to 15MB of ram on this system due to my mailbox folder > :sizes). > : > :However, a clue.. The SMP box that is doing fine is a P6, an NFS client > :and server (loading nfs.ko, it fsck's fast, so I use that box for making > :sure the modules work). The one that is crashing, is a P5, an NFS client > :and server (static kernel), and with a MFS /tmp. Both run softupdates (up > :to date src/contrib/sys). > : > :I suspect MFS is the key. There's the new VOP_FREEBLKS() stuff you added, > :and the corresponding calls to madvise to free the pages. > : > :Given madvise()'s murky history, I can't help but feel suspicious about it. > : > :I've unmounted /tmp and am about to thrash the machine. At the > :moment, it's sitting on: Swap: 120M Total, 376K Used, 120M Free > : > :Cheers, > :-Peter > > Hmmm. It's possible. A quick look at the exmh source indicates that > it uses /tmp a lot. I've been doing make buildworld's with a 300MB > MFS /usr/obj, but those are typically nothing more then simple file > creates, reads, and writes. Presumably exmh is doing something more > sophisticated. I've since disabled MFS, compiled out a couple of other things I'm not using very often and generally cleaned things up. I've had three more panics since turning off MFS, so that wasn't it. :-( Anyway, I've just recompiled without SMP. There were some very strange things happening on the serial console again that I really do not like the look of. Console output has been disappearing and getting duplicated. > Try changing the panic in vm/vm_page.c to a printf() ( I'll do that. FWIW, this has happened while the system has been nearly quiescent all the way through to being thrashed with parallel cvs updates etc running. Most times it waits till exmh is running. Last time (when recompiling without SMP) it crashed when it came to linking the kernel (and no exmh running). I'll see if it still crashes in uniprocessor mode, if so, I'll put some debugging in and see if I can find anything out. The kernel was last built on Jan 16, and that one works fine still, so I'm pretty sure it isn't hardware. Cheers, -Peter To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-current" in the body of the message