Skip site navigation (1)Skip section navigation (2)
Date:      Fri, 22 Aug 2008 10:44:11 -0700
From:      Jeremy Chadwick <koitsu@FreeBSD.org>
To:        Weldon S Godfrey 3 <weldon@excelsus.com>
Cc:        freebsd-fs@freebsd.org, pjd@FreeBSD.org
Subject:   Re: ZFS-NFS kernel panic under load
Message-ID:  <20080822174411.GA89610@eos.sc1.parodius.com>
In-Reply-To: <20080822115932.M76650@emmett.excelsus.com>
References:  <20080806101621.H24586@emmett.excelsus.com> <20080814091337.Y94482@emmett.excelsus.com> <20080821153107.W76650@emmett.excelsus.com> <20080821194742.GA19362@eos.sc1.parodius.com> <20080822115932.M76650@emmett.excelsus.com>

next in thread | previous in thread | raw e-mail | index | archive | help
On Fri, Aug 22, 2008 at 12:02:47PM -0400, Weldon S Godfrey 3 wrote:
> Ok, I tried panic, it gave a page of the typical panic page that this  
> crash generates under 7.0.  I rebooted and no core, so I am missing a  
> step.  Sorry for being clueless here.

Then you're probably being bit by what's listed in the below PR.
Supposedly you can do "panic", it should dump memory contents to swap,
then upon rebooting go into single-user mode, "mount -a", then run
savecore.  A real PITA, I know, but supposedly it works.

I can't help with the cause of the actual panic, however; it's outside
of my skillset.

> Since the panic didn't reboot, I did a bt, it said process it was at  
> process 1001 access.nfsrv and access.nfs3srv (sorry, I know that isn't  
> quite right, I meant to write it down, it was definately something with  
> access and nfsrv)
>
> Thanks,
>
> Weldon
>
>
> If memory serves me right, sometime around Yesterday, Jeremy Chadwick told me:
>
>> On Thu, Aug 21, 2008 at 03:35:04PM -0400, Weldon S Godfrey 3 wrote:
>>> Looks like the bug with NFS and ZFS still exists.
>>>
>>> Well, I got the lastest 8-HEAD on with the most recent ZFS patch and ran
>>> the benchmarks again this morning and after about an hour, it paniced
>>> with the same message about page fault with nfsd.  It dropped to debugger
>>> on shutdown, it didn't do a savecore, dumpdev is set to AUTO.
>>
>> Specifically regarding the debugger/didn't run savecore/dumpdev
>> statement:
>>
>> What exactly did you type once at the debugger prompt?  It matters.
>>
>> There's also this, which I reported nearly a year ago:
>> http://www.freebsd.org/cgi/query-pr.cgi?pr=conf/118255
>>
>> I haven't been able to reproduce my above PR on RELENG_7, but I'm
>> unaware of anything that might have changed in RELENG_7 that fixes this
>> problem.
>>
>> -- 
>> | Jeremy Chadwick                                jdc at parodius.com |
>> | Parodius Networking                       http://www.parodius.com/ |
>> | UNIX Systems Administrator                  Mountain View, CA, USA |
>> | Making life hard for others since 1977.              PGP: 4BD6C0CB |

-- 
| Jeremy Chadwick                                jdc at parodius.com |
| Parodius Networking                       http://www.parodius.com/ |
| UNIX Systems Administrator                  Mountain View, CA, USA |
| Making life hard for others since 1977.              PGP: 4BD6C0CB |




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20080822174411.GA89610>