Skip site navigation (1)Skip section navigation (2)
Date:      Fri, 10 Feb 2017 00:02:40 +0200
From:      Toomas Soome <tsoome@me.com>
To:        Oleksandr Tymoshenko <gonzo@bluezbox.com>
Cc:        Karl Denninger <karl@denninger.net>, freebsd-current@freebsd.org
Subject:   Re: Crochet build for Pi3 fails to boot on r313441 (and later), works on r313109
Message-ID:  <B9B8FCBF-3176-417B-81E3-7F5E054AD6ED@me.com>
In-Reply-To: <20170209213916.GA40599@bluezbox.com>
References:  <517ab0d5-412a-35dd-7d0d-d8297af43b46@denninger.net> <4a6f872b-cee1-57e5-7a72-a1d445f9926f@denninger.net> <F4D44CDD-BEF6-4E21-848A-745B88440617@me.com> <5128b9a9-1186-8c6c-6227-e5e8a087cf89@denninger.net> <0B4B40AA-E654-4A71-92B0-D8E6CD234B63@me.com> <20170209213916.GA40599@bluezbox.com>

next in thread | previous in thread | raw e-mail | index | archive | help

> On 9. veebr 2017, at 23:39, Oleksandr Tymoshenko <gonzo@bluezbox.com> =
wrote:
>=20
> Toomas Soome (tsoome@me.com) wrote:
>>=20
>>> On 9. veebr 2017, at 17:05, Karl Denninger <karl@denninger.net> =
wrote:
>>>=20
>>>=20
>>> On 2/9/2017 08:58, Toomas Soome wrote:
>>>>> On 9. veebr 2017, at 16:36, Karl Denninger <karl@denninger.net> =
<mailto:karl@denninger.net> wrote:
>>>>>=20
>>>>>=20
>>>>> On 2/8/2017 16:18, Karl Denninger wrote:
>>>>>> r313441 blows up on the Pi3 in /boot/loader.efi with:
>>>>>>=20
>>>>>> FreeBSD/arm64 EFI loader, Revision 1.1
>>>>>> (Tue Feb  7 15:15:52 CST 2017 freebsd@NewFS.denninger.net =
<mailto:freebsd@NewFS.denninger.net>)
>>>>>> Failed to start image provided by UFS (14)
>>>>>> "Synchronous Abort" handler, esr 0x96000004
>>>>>> ELR:     3af62cec
>>>>>> LR:      3af61d60
>>>>>> x0 : 0000000000000001 x1 : 0000000000000001
>>>>>> x2 : 000000003afeb000 x3 : 000000000000003f
>>>>>> x4 : 0000000000000020 x5 : 0000000000000010
>>>>>> x6 : 0000000000000000 x7 : 0000000039b260a4
>>>>>> x8 : 000000003af61d48 x9 : 000000000000000d
>>>>>> x10: 0000000000000030 x11: 0000000000000000
>>>>>> x12: 0000000000000000 x13: 0000000000000002
>>>>>> x14: 0000000000000000 x15: 0000000000000000
>>>>>> x16: 0000000000000000 x17: 0000000000000000
>>>>>> x18: 000000003ab30df8 x19: 0000000037a16008
>>>>>> x20: 0000000000000000 x21: 0000000000000000
>>>>>> x22: 0000000039b28000 x23: 0000000039b1d49c
>>>>>> x24: 0000000039b28850 x25: 000000003ab3d740
>>>>>> x26: 000000003af839a0 x27: 0000000039b2e3e8
>>>>>> x28: 0000000000000000 x29: 000000003ab2ef60
>>>>>>=20
>>>>>> Resetting CPU ...
>>>>>>=20
>>>>>> If you copy in a loader.efi from an earlier build (e.g. r313109) =
then the system boots but complains about SMP problems, fails to start =
any of the other CPUs (although it sees them) and panics before it =
reaches a login prompt with what appears to be a problem reading the SD =
card (I also get a couple of lor's in here too..... not sure if those =
are "real" or false positives)
>>>>>>=20
>>>>>> B
>>>>> This has been isolated to r313333 in sys/boot/efi; reverting the =
EFI
>>>>> loader to a previous revision stops the crash.
>>>>>=20
>>>>> Filed here: =
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=3D216940 =
<https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=3D216940>; =
<https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=3D216940>; =
<https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=3D216940>;
>>>>>=20
>>>> Does it still crash with r313442? I think it does and this is why:
>>>>=20
>>>> =46rom your log above, the hint is "Failed to start image provided =
by UFS (14)=E2=80=9D, so what we can guess here is that for some reason =
the loader.efi main() failed to detect the boot device, and did return =
back to boot1.
>>>>=20
>>>> Boot1 did print out this error message and did call panic(). So, =
the question is, why it is failing to detect the root fs handle. I=E2=80=99=
ll try to check if I can replicate the issue with x86 + ufs.
>>>>=20
>>>> BTW: sorry for trouble:)
>>>>=20
>>>> rgds,
>>>> toomas
>>>>=20
>>> Yes.
>>>=20
>>> It's isolated to that particular revision, which appears to have =
reworked the enumeration of the available devices to boot from.  =
Reverting only sys/boot/efi to anything before 313333 (e.g. "svn update =
-r 313332 ." in src/sys/boot/efi) and rebuilding results in a loader.efi =
that successfully loads and starts the kernel.
>>>=20
>>=20
>> Well, the x86 version does not appear to have problems with finding =
the ufs devices. So this has to be some sort of corner case related to =
arm:( unfortunately I do not have much variants to test arm, except =
qemu=E2=80=A6 so some help would be needed there. Since the only crash =
is from boot1 call to panic, I am pretty sure the disk detection and =
setup was ok, so we should get disk list if we insert something like:
>>=20
>> for (i =3D 0; devsw[i] !=3D NULL; i++) {
>>        if (devsw[i]->dv_print !=3D NULL){
>>            if (devsw[i]->dv_print(verbose))
>>                break;
>>        }
>> }
>>=20
>> after dv_init() loop.
>>=20
>> If thats true, then it should not take too much to find why we fail =
to get the handle for root fs in case of arm=E2=80=A6=20
>=20
>=20
> On RPi3 U-Boot EFI API provided by U-Boot and it's somewhat =
non-standard
> comapring to x86 EFI or specialized EFI firmwares of ARM64. I did some
> debugging and found that U-Boot reports driver with subtype =
MEDIA_FILEPATH_DP
> so it's not recognized by disk handler. Quick hack below fixed boot
> but it's not ideal and the device structure looks like this:
>=20
> disk devices:
>    disk0:    15564801 X 512 blocks (removable)
>      disk0s1: DOS/Windows         49MB
>      disk0s2: FreeBSD             1856MB
>        disk0s2a: FreeBSD UFS      1856MB
>    disk1:    102375 X 512 blocks (removable)
>    disk2:    3802112 X 512 blocks (removable)
>      disk2a: FreeBSD UFS          1856MB
> net devices:
>    net0:
>=20
>=20
> disk1 and disk2 are actuallypartitions from disk0.
> I can work on proper fix but not sure what should be
> considered proper in this case. So some guidelines are welcome


I think Karl got or is getting the same result as I am writing - we did =
reach to the same point:)

Yep, the idea behind the efipart_hdinfo_add() is that the first argument =
is handle to the disk itself, so we can sort the partition handles into =
the partition list in the disk.

I think the logical step from there would be to dig out the path =
structure and see if we can use something to identify the disk itself.

So, the interesting part here also is that in your case it did dig out =
the BSD label too.


Ok, so the MEDIA_FILEPATH_DP has last node as CHAR16 file name, so we =
need to pick the last node and cast to FILEPATH_DEVICE_PATH and then we =
can see what that CHAR16 there has:)

Also we may need to inspect all the nodes on the path, then we have =
idea.

Once we have method to identify the disk itself, then we should have the =
solution.

rgds,
toomas

>=20
> Patch:
> Index: boot/efi/libefi/efipart.c
> =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=
=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=
=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D
> --- boot/efi/libefi/efipart.c   (revision 313477)
> +++ boot/efi/libefi/efipart.c   (working copy)
> @@ -467,6 +467,15 @@
>                        efipart_hdinfo_add(handle, efipart_handles[i]);
>                        continue;
>                }
> +
> +               /*
> +                * U-Boot case
> +                */
> +               if (DevicePathType(node) =3D=3D MEDIA_DEVICE_PATH &&
> +                   DevicePathSubType(node) =3D=3D MEDIA_FILEPATH_DP) =
{
> +                       efipart_hdinfo_add(efipart_handles[i], =
efipart_handles[i]);
> +                       continue;
> +               }
>        }
> }




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?B9B8FCBF-3176-417B-81E3-7F5E054AD6ED>