Skip site navigation (1)Skip section navigation (2)
Date:      Wed, 22 Jan 2020 10:12:05 -0500
From:      Jason Van Patten <jvp@lateapex.net>
To:        freebsd-questions@freebsd.org
Subject:   12.1 RELEASE General Protection Fault (Trap 9)
Message-ID:  <22046a36-12d3-032a-6325-24e18b1a855b@lateapex.net>

next in thread | raw e-mail | index | archive | help
Hey gang -

Since sometime before Christmas (as far as I know), my NAS has started randomly crashing, reloading, and saving cores in /var/crash.  It was doing this with 12.0 and now with 12.1.  My gut tells me it's hardware related, but I'm not quite sure.  The various bits and pieces are:

 From dmesg:
Fatal trap 9: general protection fault while in kernel mode
cpuid = 1; apic id = 01
instruction pointer     = 0x20:0xffffffff80f09ff9
stack pointer           = 0x28:0xfffffe0087f0b500
frame pointer           = 0x28:0xfffffe0087f0b550
code segment            = base 0x0, limit 0xfffff, type 0x1b
                         = DPL 0, pres 1, long 1, def32 0, gran 1
processor eflags        = interrupt enabled, resume, IOPL = 0
current process         = 27 (dom0)
trap number             = 9
panic: general protection fault
cpuid = 1
time = 1579594170
KDB: stack backtrace:
#0 0xffffffff80c1d297 at kdb_backtrace+0x67
#1 0xffffffff80bd05cd at vpanic+0x19d
#2 0xffffffff80bd0423 at panic+0x43
#3 0xffffffff810a7dcc at trap_fatal+0x39c
#4 0xffffffff810a71dc at trap+0x6c
#5 0xffffffff81081a0c at calltrap+0x8
#6 0xffffffff80f09704 at bucket_cache_drain+0x144
#7 0xffffffff80f03413 at zone_drain_wait+0xa3
#8 0xffffffff80f07a5d at uma_reclaim_locked+0x7d
#9 0xffffffff80f07986 at uma_reclaim+0x36
#10 0xffffffff80f2ac06 at vm_pageout_worker+0x436
#11 0xffffffff80f2a796 at vm_pageout+0x176
#12 0xffffffff80b90c23 at fork_exit+0x83
#13 0xffffffff81082a4e at fork_trampoline+0xe
Uptime: 12d21h37m44s

And from the files in /var/crash:

# cat core.txt.7
/dev/stdin:1: Error in sourced command file:
Cannot access memory at address 0x65657246
/dev/stdin:1: Error in sourced command file:
Cannot access memory at address 0x65657246
/dev/stdin:1: Error in sourced command file:
Cannot access memory at address 0x65657246
Unable to find matching kernel for /var/crash/vmcore.7

# cat info.7
Dump header from device: /dev/ada2p2
   Architecture: amd64
   Architecture Version: 2
   Dump Length: 4114960384
   Blocksize: 512
   Compression: none
   Dumptime: Tue Jan 21 03:09:30 2020
   Hostname: bane
   Magic: FreeBSD Kernel Dump
   Version String: FreeBSD 12.1-RELEASE-p1 GENERIC
   Panic String: general protection fault
   Dump Parity: 2696097602
   Bounds: 7
   Dump Status: good

I have a 4GB vmcore that I might be able to upload to S3 if anyone wants to crawl through it and look.  Let me know if so and I'll work on that one.

For record, the hardware in question:

- Supermicro MBD-X10SAT-O motherboard
- Intel Core i7 4790 CPU
- 16GB DDR3
- 128GB Plextor PCI-E SSD for the OS

The crashes don't seem to have a pattern to them; no certain time of day nor do they relate to any specific activity.  The NAS itself is running:
- OS
- NFS
- SMB
- AFP
- postgresql
- mysql
- apache
- git server

Any guesses?  Do I have some bad RAM or some such?

Thanks!
--
Jason Van Patten



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?22046a36-12d3-032a-6325-24e18b1a855b>