Skip site navigation (1)Skip section navigation (2)
Date:      Wed, 21 Dec 2016 13:58:46 +0200
From:      Konstantin Belousov <kostikbel@gmail.com>
To:        Steve Wills <swills@FreeBSD.org>
Cc:        John Baldwin <jhb@freebsd.org>, freebsd-current@freebsd.org, Adrian Chadd <adrian.chadd@gmail.com>, Slawa Olhovchenkov <slw@zxy.spb.ru>
Subject:   Re: Enabling NUMA in BIOS stop booting FreeBSD
Message-ID:  <20161221115846.GJ94325@kib.kiev.ua>
In-Reply-To: <d3a7a24f-c2bb-914d-e62b-a99c247efd9f@FreeBSD.org>
References:  <20161214102711.GF94325@kib.kiev.ua> <20161215224500.GM98176@zxy.spb.ru> <CAJ-Vmon8NX9c0C6rK896UoSDdT=L0MVLuNkciuwzrgv9dEYjuA@mail.gmail.com> <1995191.rPAPaoHxgb@ralph.baldwin.cx> <d3a7a24f-c2bb-914d-e62b-a99c247efd9f@FreeBSD.org>

next in thread | previous in thread | raw e-mail | index | archive | help
On Tue, Dec 20, 2016 at 04:49:29PM -0500, Steve Wills wrote:
> Hi,
> 
> On 12/16/2016 16:20, John Baldwin wrote:
> > On Thursday, December 15, 2016 03:57:58 PM Adrian Chadd wrote:
> >> heh, an updated BIOS that solves the problem will solve the problem. :)
> >>
> >> I think you have enough information to provide to supermicro. Ie,
> >> "SMAP says X, when physical memory pages at addresses X are accessed,
> >> they don't behave like memory, maybe something is wrong".
> >>
> >> All I can think of is some hack to add a blacklist for that region so
> >> you can boot the unit. But it makes me wonder what else is going on.
> > 
> > We have the blacklist: it is the memory test.  That is the way to workaround
> > this type of BIOS breakage.  This is just the first time in over a decade that
> > test has been relevant.
> 
> I've got a SuperMicro X10SRA board that I bought back in March, I think.
> It was run CURRENT fine since then, until last month, when it started
> hanging during boot. I was about to update it to a new version of
> CURRENT when it started hanging at boot, but hadn't updated yet. The
> hang is after (verbose boot):
What is the exact version of the kernel you are running and which hangs ?
Try to bisect.

Do you have EARLY_AP_STARTUP option in the kernel config ?

> 
> ACPI APIC Table: <SUPERM SMCI--MB>
> Package ID shift: 4
> L3 cache ID shift: 4
> L2 cache ID shift: 1
> L1 cache ID shift: 1
> Core ID shift: 1
Send NMI with 'ipmi power diag' and show the machine state from ddb.

> 
> Recently I've tried booting 9.3 and 10.3 on it without success. Other
> operating systems boot fine. Thinking the hang was similar to the one in
> this thread (or at least the board is), I tried many different BIOS
> changes and also tried enabling the memory test, but none of that
> changes anything. This is a single socket board so there are no NUMA or
> memory interleaving options in the BIOS. The BIOS is up to date (2.0a).
> It will boot if SMP is disabled. That's obviously sub-optimal, but is
> useful for building updated kernels, which I've tried. If anyone has any
> suggestions or ideas, I'd appreciate it.
> 
> Thanks,
> Steve
> 






Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20161221115846.GJ94325>