Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 21 Aug 2008 13:36:02 -0700
From:      Jeremy Chadwick <koitsu@FreeBSD.org>
To:        Weldon S Godfrey 3 <weldon@excelsus.com>
Cc:        freebsd-fs@freebsd.org
Subject:   Re: ZFS-NFS kernel panic under load
Message-ID:  <20080821203602.GA22354@eos.sc1.parodius.com>
In-Reply-To: <20080821155118.C76650@emmett.excelsus.com>
References:  <20080806101621.H24586@emmett.excelsus.com> <20080814091337.Y94482@emmett.excelsus.com> <20080821153107.W76650@emmett.excelsus.com> <20080821194742.GA19362@eos.sc1.parodius.com> <20080821155118.C76650@emmett.excelsus.com>

next in thread | previous in thread | raw e-mail | index | archive | help
On Thu, Aug 21, 2008 at 03:55:09PM -0400, Weldon S Godfrey 3 wrote:
> To be hostest, I told it to reboot.  Sorry, I am not familiar with the  
> debugger and I didn't see (but I often overlook) anything to would  
> initiate a savecore when I typed help.  (although I could have tried go,  
> next, or whatever command would force it to step ahead, I didn't try 
> that) If you know the command, I can repeat the test tomorrow and type 
> the right thing.  I do have cores from the crashes with 7.0.

First, please do not remove the mailing list from the CC list; I've
re-added it.  People need to know what you've said.  :-)

If my memory serves me correctly, the problem is that you typed "reboot"
and not "panic".  I think this causes the machine to simply reboot
without dumping memory contents to swap, thus savecore won't find any
panic image in swap when the machine restarts.

Others should be able to help you through using the kernel debugger.

> If memory serves me right, sometime around 12:47pm, Jeremy Chadwick told me:
>
>> On Thu, Aug 21, 2008 at 03:35:04PM -0400, Weldon S Godfrey 3 wrote:
>>> Looks like the bug with NFS and ZFS still exists.
>>>
>>> Well, I got the lastest 8-HEAD on with the most recent ZFS patch and ran
>>> the benchmarks again this morning and after about an hour, it paniced
>>> with the same message about page fault with nfsd.  It dropped to debugger
>>> on shutdown, it didn't do a savecore, dumpdev is set to AUTO.
>>
>> Specifically regarding the debugger/didn't run savecore/dumpdev
>> statement:
>>
>> What exactly did you type once at the debugger prompt?  It matters.
>>
>> There's also this, which I reported nearly a year ago:
>> http://www.freebsd.org/cgi/query-pr.cgi?pr=conf/118255
>>
>> I haven't been able to reproduce my above PR on RELENG_7, but I'm
>> unaware of anything that might have changed in RELENG_7 that fixes this
>> problem.

-- 
| Jeremy Chadwick                                jdc at parodius.com |
| Parodius Networking                       http://www.parodius.com/ |
| UNIX Systems Administrator                  Mountain View, CA, USA |
| Making life hard for others since 1977.              PGP: 4BD6C0CB |




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20080821203602.GA22354>