From owner-freebsd-questions@FreeBSD.ORG Thu Dec 18 08:00:12 2008 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 547741065670 for ; Thu, 18 Dec 2008 08:00:12 +0000 (UTC) (envelope-from fbsd.questions@rachie.is-a-geek.net) Received: from mail.rachie.is-a-geek.net (rachie.is-a-geek.net [66.230.99.27]) by mx1.freebsd.org (Postfix) with ESMTP id 062348FC13 for ; Thu, 18 Dec 2008 08:00:11 +0000 (UTC) (envelope-from fbsd.questions@rachie.is-a-geek.net) Received: from localhost (mail.rachie.is-a-geek.net [192.168.2.101]) by mail.rachie.is-a-geek.net (Postfix) with ESMTP id 0D13DAFBC02; Wed, 17 Dec 2008 23:00:10 -0900 (AKST) From: Mel To: freebsd-questions@freebsd.org Date: Thu, 18 Dec 2008 09:00:07 +0100 User-Agent: KMail/1.9.7 References: <4d7dd86f0812150956i3a8d130ak8a4ca462896cd3ff@mail.gmail.com> <200812171015.07390.fbsd.questions@rachie.is-a-geek.net> <4d7dd86f0812170240n28ab5db9qf2816e2d4beefc3d@mail.gmail.com> In-Reply-To: <4d7dd86f0812170240n28ab5db9qf2816e2d4beefc3d@mail.gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200812180900.08295.fbsd.questions@rachie.is-a-geek.net> Cc: David N Subject: Re: Running rsnapshot via cron reboots the machine X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 18 Dec 2008 08:00:12 -0000 On Wednesday 17 December 2008 11:40:00 David N wrote: > 2008/12/17 Mel : > > On Monday 15 December 2008 18:56:46 David N wrote: > >> Hi, > >> > >> I have a machine > >> AMD Sepron LE-1150 > >> ASUS M2A-VM > >> 1GB RAM ECC > >> 2x SATA 300GB > >> > >> in a RAID 1 (gmirror). > >> 7.0-RELEASE-p2 AMD64 generic kernel > >> > >> it was doing backups via bacula to an external disk > >> USB 2.0 SATA disk, and it was working well. (GLabel) /dev/ufs/BackupDisk > >> > >> I changed to rsnapshot recently, with the External HDD in glabel + > >> gjournal (/dev/da0s1.journal -> /dev/ufs/BackupDisk) and it will > >> reboot the machine roughly 30 minutes after the rsnapshot starts via > >> CRON. > > > > Able to get any crash dumps? [1] I doubt it calls reboot system call > > after 30 minutes and if it's a heating issue, then it would power down > > not reboot. So, kernel is probably panicing. > > > > [1] > > http://www.freebsd.org/doc/en_US.ISO8859-1/books/developers-handbook/kern > >eldebug.html > > > > -- > > Mel > > > > Problem with today's modular software: they start with the modules > > and never get to the software part. > > I found something in the vmcore.0 > > panic: Journal overflow (joffset=499758276096 active=498475869184 > inactive=499755984896) > cpuid = 0 > Uptime: 16h7m11s > > I tried kgdb on on the vmcore but it didn't work, I had -p2 installed, > but compiled p6 so it might of overwrittin things in /usr/obj > > [GDB will not be able to debug user-mode threads: > /usr/lib/libthread_db.so: Undefined symbol "ps_pglobal_lookup"] > GNU gdb 6.1.1 [FreeBSD] > Copyright 2004 Free Software Foundation, Inc. > GDB is free software, covered by the GNU General Public License, and you > are welcome to change it and/or distribute copies of it under certain > conditions. Type "show copying" to see the conditions. > There is absolutely no warranty for GDB. Type "show warranty" for details. > This GDB was configured as "amd64-marcel-freebsd". > Cannot access memory at address 0x0 > > > The journal was set to 2GB on the 400GB USB attached disk. (/dev/da0) > > I just formatted the disk without gjournal and see how that goes. I > guess i can't use gjournal over USB? I have gjournal running on > another server (gmirror + gjournal) and i thrash it pretty hard > without any problems. It should not panic, but a journal overflow is more likely with USB, cause of the lower write speed (the journal fills faster then it's being emptied). Your best bet is to reproduce the panic using the sources that match the kernel and file a PR and/or post to freebsd-fs list to find out if there are people with similar problems/usage cases. It could be a tunable that you missed or that it's a known issue. -- Mel Problem with today's modular software: they start with the modules and never get to the software part.