From owner-freebsd-current@FreeBSD.ORG Tue May 26 14:24:44 2009 Return-Path: Delivered-To: freebsd-current@FreeBSD.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id DDBAE1065670 for ; Tue, 26 May 2009 14:24:43 +0000 (UTC) (envelope-from Johan@double-l.nl) Received: from smtp-vbr13.xs4all.nl (smtp-vbr13.xs4all.nl [194.109.24.33]) by mx1.freebsd.org (Postfix) with ESMTP id 8AD928FC08 for ; Tue, 26 May 2009 14:24:43 +0000 (UTC) (envelope-from Johan@double-l.nl) Received: from w2003s01.double-l.local (double-l.xs4all.nl [80.126.205.144]) by smtp-vbr13.xs4all.nl (8.13.8/8.13.8) with ESMTP id n4QEOgj9053915; Tue, 26 May 2009 16:24:42 +0200 (CEST) (envelope-from Johan@double-l.nl) MIME-Version: 1.0 Content-Type: text/plain; charset="windows-1250" Content-Transfer-Encoding: quoted-printable Content-class: urn:content-classes:message X-MimeOLE: Produced By Microsoft Exchange V6.5 Date: Tue, 26 May 2009 16:24:41 +0200 Message-ID: <57200BF94E69E54880C9BB1AF714BBCB5DE8B0@w2003s01.double-l.local> X-MS-Has-Attach: X-MS-TNEF-Correlator: Thread-Topic: ZFS Crash Thread-Index: AcneAR6FFyy870WLQT2Jpq7wy8JUFgACqu5w References: <3c1674c90905242253n544c3f0cqb10952f349391ce7@mail.gmail.com> <454b8cc37c60ab7af2663ba70ddbfd59.squirrel@webmail.lerctr.org> <5a9a181a12e9e4ef864d23ae063f7277.squirrel@webmail.lerctr.org> <4A1BE241.5030706@kasimir.com> From: "Johan Hendriks" To: "Florian Smeets" X-Virus-Scanned: by XS4ALL Virus Scanner Cc: freebsd-current@FreeBSD.org Subject: RE: ZFS Crash X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 26 May 2009 14:24:44 -0000 >On 26.05.09 14:04, Larry Rosenman wrote: >> On Mon, 25 May 2009, Larry Rosenman wrote: >> >>> On Mon, 25 May 2009, Larry Rosenman wrote: >>> >>>> after looking at the code, never mind the "don't call doadump", so = we'll >>>> get the textdump. >>>> >>>> Thanks rwatson for the textdump stuff! >>>> >>> Here is current stats before we crash. Does any of this look totally >>> out of line? >>> >> It crashed again, but did *NOT* make it into ddb enough to do the = textdump. >> >> It was hung with the backtrace (looks like the same, but I couldn't >> scroll the screen back). >> >> Ideas? >> >> I'm really concerned that there is a problem. >> >Hi i guess I'm having a similar/the same problem. The panic looks the=20 >same, and the trace is a little different, I've had a lot of these=20 >panics on this machine, and the stack trace looked different quite = often. >The panic and show uma and show malloc can be found here:=20 >http://webmail.solomo.de/~flo/panic.txt >I have a coredump written to swap, but this panic is very easy to=20 >trigger and it panics every time savecore runs... It is also easy to=20 >trigger this with a buildworld -j5. >This is a quadcore Xeon with a a slow SATA disk and only 1GB of mem.=20 >(It's only a test machine, so i can test any suggestion.) This box used = >to run stable with kmem max set to 768M. >Anything i can do? >Cheers, >Florian I have several panics also i am using cvsup-mirror on a zfs raidz2. It did run good for quite some time, but last week after a buildworld it = started to panic as soon as /usr/local/etc/cvsup/update.sh was running. I had vm.kmem_size=3D"512M" and vm.kmem_size_max=3D"512M". I had a custom kernel with some options added and some taken off. The box is a HP Proliant 110 with 1GB memory and is running the amd64 = version of FreeBSD. =20 My custom kernel had the following. # Debugging for use in -current nooptions KDB # Enable kernel debugger support. nooptions DDB # Support DDB. nooptions GDB # Support remote GDB. nooptions INVARIANTS # Enable calls of extra sanity checking nooptions INVARIANT_SUPPORT # Extra sanity checks of internal = structures, required by INVARIANTS nooptions WITNESS # Enable checks to detect deadlocks and = cycles nooptions WITNESS_SKIPSPIN # Don't run witness on spinlocks for = speed #polling options DEVICE_POLLING # pf options ALTQ options ALTQ_CBQ options ALTQ_RED options ALTQ_RIO options ALTQ_HFSC options ALTQ_CDNR options ALTQ_PRIQ device pf device pflog device pfsync I did a new buildworld with a GENERIC kernel with pf part still there. = But now i can not panic it again. So maybe it is the options DEVICE_POLLING option. i will try to add that again to the kernel and try to panic it again. Do you have the DEVICE_POLLING option in your kernel? BTW i am no FreeBSD or debug expert. This is just an observation. Regards, Johan Hendriks No virus found in this outgoing message. Checked by AVG - www.avg.com=20 Version: 8.5.339 / Virus Database: 270.12.39/2134 - Release Date: = 05/25/09 18:14:00