From owner-freebsd-hackers@FreeBSD.ORG Wed Jul 16 07:15:41 2003 Return-Path: Delivered-To: freebsd-hackers@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 9188437B401 for ; Wed, 16 Jul 2003 07:15:41 -0700 (PDT) Received: from dgap-gw.mipt.ru (dgap-gw.mipt.ru [194.85.81.130]) by mx1.FreeBSD.org (Postfix) with ESMTP id 2C5DB43F3F for ; Wed, 16 Jul 2003 07:15:40 -0700 (PDT) (envelope-from andrew@nas.dgap.mipt.ru) Received: (qmail 24506 invoked from network); 16 Jul 2003 14:15:39 -0000 Received: from unknown (HELO nas.dgap.mipt.ru) ([194.85.81.203]) (envelope-sender ) by dgap-gw.mipt.ru (qmail-ldap-1.03) with SMTP for ; 16 Jul 2003 14:15:39 -0000 Received: from nas.dgap.mipt.ru (localhost [127.0.0.1]) by nas.dgap.mipt.ru (8.12.8p1/8.12.8) with ESMTP id h6GEFcvt081792; Wed, 16 Jul 2003 18:15:38 +0400 (MSD) (envelope-from andrew@nas.dgap.mipt.ru) Received: (from andrew@localhost) by nas.dgap.mipt.ru (8.12.8p1/8.12.8/Submit) id h6GEFcIi081791; Wed, 16 Jul 2003 18:15:38 +0400 (MSD) Date: Wed, 16 Jul 2003 18:15:38 +0400 From: "Andrew L. Neporada" To: Avleen Vig Message-ID: <20030716141538.GA81724@nas.dgap.mipt.ru> References: <20030716125744.GI68950@silverwraith.com> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20030716125744.GI68950@silverwraith.com> User-Agent: Mutt/1.4i cc: hackers@freebsd.org Subject: Re: 4.8 panic "ffs_clusteralloc: map mismatch" X-BeenThere: freebsd-hackers@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Technical Discussions relating to FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 16 Jul 2003 14:15:41 -0000 On Wed, Jul 16, 2003 at 05:57:44AM -0700, Avleen Vig wrote: > > Andrew, > > I spend about two to three years fighting with a system trying to figure > out what was wrong, and why these errors were caused. I got the very > same crashes you're seeing now. > I'm sure others are too, and I think this reply would be useful for the > archives. > > My Solution: > I eventually realised that my problem was with one of three things: > 1) bit flips in main memory > 2) bit flips in cache > 3) bad hard drive > > I replaced all of the memory after a few months. The problems stopped > for a few weeks but quickly returned. So I don't think it was main > memory, unless the new set or the sockets were damaged. > I couldn't replace the cache because I couldn't find any more. The > system was an old P1 (originally 75Mhz). This could have been the > problem. My system has brand new MB (supermicro dual proc mainbord with U160 SCSI & fxp NIC integrated), P3 processor & memory (btw, ECC memory). Other components are not-so-new, but they worked flawlessly for about a year with another MB. > I did once try turning off L2 cache in the BIOS, and I think the crashes > *might* have continued. So it's possible the problems were here. > > Finally I didn't replace the hard drive, but I did find that moving load > off the original drive to a second drive helpped reduce the number of > crashes, although they still continued to happen. > > HTH. > > -- > Avleen Vig "Say no to cheese-eating surrender-monkeys" > Systems Admin "Fast, Good, Cheap. Pick any two." > www.silverwraith.com "Move BSD. For great justice!" Andrew.