Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 26 Jul 2011 21:44:05 -1000
From:      Tim Newsham <tim.newsham+fbsd@gmail.com>
To:        freebsd-amd64@freebsd.org
Subject:   Re: page fault in kernel
Message-ID:  <CAGSRWbh_O1iNavv3e=PKo3s4tP9gPz4k3V80-OncUE2WoFCtEw@mail.gmail.com>
In-Reply-To: <CAGSRWbibOcLKUOrLFx-K5ftnfO1pkTyNew5kGMdgrJ8suyU8DQ@mail.gmail.com>
References:  <CAGSRWbibOcLKUOrLFx-K5ftnfO1pkTyNew5kGMdgrJ8suyU8DQ@mail.gmail.com>

next in thread | previous in thread | raw e-mail | index | archive | help
finally got to the cpuburn.  Normal cpu burn for K7 didnt turn
anything up, but the burnMMX test which does a heavy memory
test actually did..  It wouldnt report errors for memory sizes
<=3D 256kb, but >=3D 512kb it would return code 254 after
a few seconds (when I ran it twice, once for each core).

So.. looks like memory after all (fits the pattern).  Using the
test I took the same ram and seated it in a different slot and
it looks like the problem went away.  When I tried adding in
the second ram stick I have (each 2G, for a total of 4G) the problem
came back, but I didnt diagnose it further to see if it was the
ram stick or the memory slot that was to blame (save that for
another day).

So anyway, going to try a big overnight build to see if that has
completely resolved the issue or not.

Here's my repro for my original crash:
  - boot single user
  - mount -o ro -a
  - (burnMMX P || echo $?) &
  - (burnMMX P || echo $?) &
  - ... wait ...

(useful program that burncpu package!).
will update tomorow if it fixed the original issue or not...

Tim

On Tue, Jul 26, 2011 at 8:39 PM, Tim Newsham <tim.newsham+fbsd@gmail.com> w=
rote:
> I have an amd64 box that is occasionally panic'ing with a kernel
> page fault while the system is under heavy load. =A0I tried swapping
> out ram and running ram tests and that hasn't seem to turn up
> anything. =A0I originally saw this with 8.1 but I recently updated to
> 8.2 to be sure and I'm still seeing the problem. =A0I'm up to date
> with "freebsd-update" and I verified that the core files all match
> the md5 of a clean 8.2 release (after it is also updated). =A0I also
> tried to disable various frills I had installed (like ip6 tunnel).
>
> The normal way I repro this issue is to start a "portupgrade -caP"
> going and I usually get a crash within a few hours. =A0Sometimes
> I see some unusual behavior instead of or right before a crash
> (ie. program segfaults that shouldn't normally).
>
> Are there any known kernel issues that sound similar to this
> problem? =A0I've been working under the assumption that this is
> probably hardware related, but I haven't yet ruled out software.
>
> I'll probably be trying some tests with cpuburn. =A0Any other advice
> for tracking down this issue would be appreciated.
>
> Tim
>



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?CAGSRWbh_O1iNavv3e=PKo3s4tP9gPz4k3V80-OncUE2WoFCtEw>