Skip site navigation (1)Skip section navigation (2)
Date:      Fri, 3 Aug 2018 09:03:22 -0700
From:      bob prohaska <fbsd@www.zefox.net>
To:        Warner Losh <imp@bsdimp.com>
Cc:        "freebsd-arm@freebsd.org" <freebsd-arm@freebsd.org>, bob prohaska <fbsd@www.zefox.net>
Subject:   Re: RPI3 swap experiments (insufficient swap)
Message-ID:  <20180803160322.GA6825@www.zefox.net>
In-Reply-To: <CANCZdfpishDTo%2BQOKSvKNemFZJz4zLxtTKMdWJu=A6d=KVLXVw@mail.gmail.com>
References:  <23793AAA-A339-4DEC-981F-21C7CC4FE440@yahoo.com> <20180731231912.GF94742@www.zefox.net> <2222ABBD-E689-4C3B-A7D3-50AECCC5E7B2@yahoo.com> <20180801034511.GA96616@www.zefox.net> <201808010405.w7145RS6086730@donotpassgo.dyslexicfish.net> <6BFE7B77-A0E2-4FAF-9C68-81951D2F6627@yahoo.com> <20180802002841.GB99523@www.zefox.net> <20180802015135.GC99523@www.zefox.net> <20180803010932.GA4321@www.zefox.net> <CANCZdfpishDTo%2BQOKSvKNemFZJz4zLxtTKMdWJu=A6d=KVLXVw@mail.gmail.com>

next in thread | previous in thread | raw e-mail | index | archive | help
On Fri, Aug 03, 2018 at 02:49:57AM -0600, Warner Losh wrote:
> On Thu, Aug 2, 2018 at 7:09 PM, bob prohaska <fbsd@www.zefox.net> wrote:
> 
> > In the end the console carried a stream of what look like hardware errors
> > referencing
> > da0, but all was forgiven after reboot and a couple cycles of fsck -fy. An
> > excerpt
> > is in the readme file.
> >
> 
> Unless the OOM is somehow killing a kernel thread, these errors are the
> root cause of all the crazy. Something happens along the way and we find
> that our error recovery to that something is bad and we never complete more
> I/O to the device (if I read the earlier thread right). No good can come
> from this. 

Is it not odd that when OOMA acts prematurely no errors associated
with da0 make it to the console? 

> The OOM is then doing weird things, but when the device goes
> away we're pretty bad at coping right now. The question, though, is how can
> we improve the USB / umass parts of the stack to do enough error recovery
> so we can survive whatever glitch that's causing it to stop talking to the
> device (and/or avoid that glitch in the first place). The disk/ssd isn't
> really bad, but we have some bug either in the USB host adapter, the USB
> stack, or in the umass SIM (or some combination).
> 

Can you suggest any experiments a naive user might perform to help identify
the culprit? There's also a Pi2 running 11-stable  available. Right now it
has storage set up like so:

bob@www:~ % gpart show da0
=>        0  122544516  da0  BSD  (58G)
          0    4194304    1  freebsd-ufs  (2.0G)
    4194304    4194304    2  freebsd-swap  (2.0G)
    8388608    6291456    4  freebsd-ufs  (3.0G)
   14680064  107864452    5  freebsd-ufs  (51G)

bob@www:~ % gpart show mmcsd0
=>      63  15523777  mmcsd0  MBR  (7.4G)
        63    102375       1  !12  [active]  (50M)
    102438   1994714       2  freebsd  (974M)
   2097152  13426688          - free -  (6.4G)


It could be given extra swap partitions on mmcsd0 so as to mimic the 
insufficient swap problem on a 32 bit system. The Pi2 is known to suffer 
the premature OOMA kill problem, it's never been subjected to a deliberate
run-out-of-swap condition.

Might there be something in the stress2 test suite that would be more revealing
more promptly than -j4 buildworld? 

Thanks for reading!

bob prohaska
 





Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20180803160322.GA6825>