Skip site navigation (1)Skip section navigation (2)
Date:      Sun, 21 Oct 2018 17:11:50 -0700
From:      Mark Millard <marklmi@yahoo.com>
To:        Warner Losh <imp@bsdimp.com>
Cc:        FreeBSD Current <freebsd-current@freebsd.org>, FreeBSD-STABLE Mailing List <freebsd-stable@freebsd.org>
Subject:   Re: head -r339076's boot loader fails to boot threadripper 1950X system (BTX halted); an earlier version works [ WITHOUT_ZFS= fixes it ]
Message-ID:  <83E0D72D-04AD-48A6-89B0-4E4D6B79D749@yahoo.com>
In-Reply-To: <30DD2F47-C8CB-4CEC-8563-C7083D0EAEEF@yahoo.com>
References:  <D99F88D8-91CF-4BAD-A7F1-3D93D947BD46@yahoo.com> <2A425DE4-2B5B-474D-8B95-81890DE4D8A1@yahoo.com> <9D2A6528-F888-4833-A52B-8F9B4D66592C@yahoo.com> <CANCZdfq3376cb8df_D_feJZzCFzkX=3p_=2mggpOHiaZ95uEKw@mail.gmail.com> <30DD2F47-C8CB-4CEC-8563-C7083D0EAEEF@yahoo.com>

next in thread | previous in thread | raw e-mail | index | archive | help
[Building and installing based on WITHOUT_ZFS=3D allows the
resulting loader to work correctly on the 1950X.]

On 2018-Oct-21, at 12:05 AM, Mark Millard <marklmi@yahoo.com> wrote:

> On 2018-Oct-20, at 10:32 PM, Warner Losh <imp at bsdimp.com> wrote:
>=20
>> On Sat, Oct 20, 2018 at 11:04 PM Mark Millard <marklmi at yahoo.com> =
wrote:
>> [I found what change lead to the 1950X boot crashing
>> with BTX halted.]
>>=20
>>> On 2018-Oct-20, at 12:44 PM, Mark Millard <marklmi at yahoo.com> =
wrote:
>>>=20
>>>> [Adding some vintage information for a loader
>>>> that allowed a native boot.]
>>>>=20
>>>> On 2018-Oct-20, at 4:00 AM, Mark Millard <marklmi at yahoo.com> =
wrote:
>>>>=20
>>>>> I attempted to jump from head -r334014 to -r339076
>>>>> on a threadripper 1950X board and the native
>>>>> FreeBSD boot failed very early. (Hyper-V use of
>>>>> the same media did not have this issue.)
>>>>>=20
>>>>> But copying over an older /boot/loader from another
>>>>> storage device with a FreeBSD head version that has
>>>>> not been updated yet got past the problem being
>>>>> reported here. (For other reasons, the kernel has
>>>>> been moved back to -r338804 --and with that,
>>>>> and the older /boot/loader, the 1950X native-boots
>>>>> FreeBSD all the way just fine.)
>>>>=20
>>>> I found one /boot/loader.old that was dated
>>>> in the update'd file system as 2018-May 20,
>>>> instead of 2018-Apr-03 from the older file
>>>> system. May 20 would apparently mean a little
>>>> below -r334014 . It native-booted okay, as did
>>>> the April one.
>>>>=20
>>>> [I do not know how to inspect a /boot/loader*
>>>> to find out what -r?????? it is from.]
>>>>=20
>>>> Unfortunately, I had done more than one -r339076
>>>> install from -r334014 before rebooting and
>>>> no -r334014 loaders were still present:
>>>> the other *.old files from a few minutes before
>>>> the ones I had the boot problem with.
>>>>=20
>>>> I might be able to extract loaders from various:
>>>>=20
>>>> =
https://artifact.ci.freebsd.org/snapshot/head/r*/amd64/amd64/base.txz
>>>>=20
>>>> materials and try substituting them in order to
>>>> narrow the range for works -> fails. If I can,
>>>> this likely would take a fair amount of time in
>>>> my context.
>>>>=20
>>>> Other notes:
>>>>=20
>>>> It turns out that only Hyper-V based use needed
>>>> a -r334804 kernel: Native booting with the older
>>>> loaders and newer kernels works fine.
>>>>=20
>>>> Windows 10 Pro 64bit also has no problems
>>>> booting and operating the machine.
>>>>=20
>>>> The native-boot problem does seem to be freeBSD
>>>> loader-vintage specific.
>>>>=20
>>>>> For the BTX failure the display ends up with
>>>>> (hand transcribed, ". . ." for an omission):
>>>>>=20
>>>>> BTX loader 1.00 BTX version is 1.02
>>>>> Console: internal video/keyboard
>>>>> BIOS drive C: is disk0
>>>>> . . .
>>>>> BIOS drive P: is disk13
>>>>> -
>>>>> int=3D00000000  err=3D00000000  efl=3D00010246  eip=3D000096fd
>>>>> eax=3D74d48000  ebx=3D74d4e5e0  ecx=3D00000011  edx=3D00000000
>>>>> esi=3D74d4e380  edi=3D74d4e5b0  ebp=3D00091da0  esp=3D00091d60
>>>>> cs=3D002b  ds=3D0033  es=3D0033    fs=3D0033  gs=3D0033  ss=3D0033
>>>>> cs:eip=3D66 f7 77 04 0f b7 c0 89-44 24 0c 89 5c 24 04 8b
>>>>>     45 08 89 04 24 83 64 24-10 00 c7 44 24 08 01 00
>>>>> ss:esp=3D00 00 00 00 00 00 00 00-00 00 00 00 00 00 00 00
>>>>>     00 00 00 00 00 00 00 00-f0 1d 89 00 00 00 00 00
>>>>> BTX halted
>>>>=20
>>>> I've no clue what of that output might be loader vintage
>>>> specific. It might not be of use without knowing the
>>>> exact build of the loader.
>>>>=20
>>>>> The board is a GIGABYTE X399 AORUS Gaming 7 (rev 1.0).
>>>>> It has 96 GiBytes of ECC RAM, just 6 DIMMs installed.
>>>>=20
>>>> For reference for the board's BIOS:
>>>>=20
>>>> Version: F11e
>>>> Dated: 2018-Sep-17
>>>> Description: Update AGESA 1.1.0.1a
>>>=20
>>> Using:
>>>=20
>>> =
https://artifact.ci.freebsd.org/snapshot/head/r*/amd64/amd64/base.txz
>>>=20
>>> materials I found that:
>>>=20
>>> -r336492: worked (loader vs. zfsloader: not linked)
>>> (no more amd64 builds until . . .)
>>> -r336538: failed (loader vs. zfsloader: linked)
>>>=20
>>> (Later ones that I tried also failed.)
>>>=20
>>> Looks like this broke for booting the 1950X=20
>>> system in question when the following was
>>> checked in:
>>>=20
>>> Author: imp
>>> Date: Fri Jul 20 05:17:37 2018
>>> New Revision: 336532
>>> URL:=20
>>> https://svnweb.freebsd.org/changeset/base/336532
>>>=20
>>>=20
>>> Log:
>>>  Collapse zfsloader functionality back down into loader.
>>>=20
>> Yea, this shouldn't matter. It worked on all the systems I tried it =
on.
>>=20
>> So my first question: is this a ZFS system? Second, does it also have =
UFS? If yes to both, which one do you want it to boot off of?
>=20
> No zfs in use at all. It has been years since
> I experimented with ZFS and reverted back to
> UFS.
>=20
> # gpart show -l
> =3D>       40  937703008  da0  GPT  (447G)
>         40       1024    1  FBSDFSSDboot  (512K)
>       1064  746586112    2  FBSDFSSDroot  (356G)
>  746587176   31457280    3  FBSDFSSDswap  (15G)
>  778044456  159383552    4  FBSDFSSDswap2  (76G)
>  937428008     275040       - free -  (134M)
> . . .
>=20
> Doing:
>=20
> gpart bootcode -p /boot/gptboot -i 1 da0
>=20
> and the trying a modern /boot/loader
> did not change anything: still "BTX halted"
> for a native boot. (No problem under Hyper-V.)

I added WITHOUT_ZFS=3D to my equivalents of src.conf
files for targeting amd64, built, and installed.
The result native-boots just fine.

The crash is somehow specific to loader code
tied to LOADER_ZFS_SUPPORT being defined.

Of course, this leaves me unable to native-boot an
official, modern, unmodified build on the 1950X
machine.

While I do not actively use ZFS these days, I'd
always left it built and installed in case I
decided to do something with it at some point.
I do not normally try to minimize configurations.


=3D=3D=3D
Mark Millard
marklmi at yahoo.com
( dsl-only.net went
away in early 2018-Mar)




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?83E0D72D-04AD-48A6-89B0-4E4D6B79D749>