Skip site navigation (1)Skip section navigation (2)
Date:      Sun, 20 Aug 2017 08:55:08 +0000
From:      bugzilla-noreply@freebsd.org
To:        freebsd-bugs@FreeBSD.org
Subject:   [Bug 221029] AMD Ryzen: strange compilation failures using poudriere or plain buildkernel/buildworld
Message-ID:  <bug-221029-8-wcWlL8CVjv@https.bugs.freebsd.org/bugzilla/>
In-Reply-To: <bug-221029-8@https.bugs.freebsd.org/bugzilla/>
References:  <bug-221029-8@https.bugs.freebsd.org/bugzilla/>

next in thread | previous in thread | raw e-mail | index | archive | help
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=3D221029

--- Comment #81 from Mark Millard <markmi@dsl-only.net> ---
(In reply to Don Lewis from comment #80)

FYI for my tests:

So far with kern.sched.steal_idle=3D0 I've never had
lang/ghc fail to build.

So far with kern.sched.steal_idle=3D1 I've always had
lang/ghc fail to build --before it even started using
ghc-stage1. (I monitored via top.) These failures do
seem random for at what point of the lang/ghc build
hat the failure happens. (Builds that work take
vastly longer to complete as well.)

I've done a half dozen build attempts or so each way
over those two alternatives (so far).

lang/ghc builds failed for kern.sched.steal_idle=3D1
every time that I've tried it, even when poudriere
was configured to avoid parallel builds (including
and avoiding ALLOW_MAKE_JOBS) and lang/ghc was the
only thing being built, prerequisites already in
place.

System-wide memory bandwidth usage being high does
not seem to be important to having the lang/ghc builds
fail.

Most of the failure examples are for CPU temperatures
under 40 DegC. Only for hot days in contexts without
air conditioning have I seen as high as 44 DegC for
the Ryzen PC. (Liquid cooled, two 360 mm radiators,
ten 120 mm fans, low end graphics card, and so on:
all biased to keeping temperatures down.)

Die temperature does not seem to be important to having
the lang/ghc builds fail.

(It is too preliminary to depend on yet but so far the
kern.sched.steal_idle=3D0 "poudriere bulk -a" has had no
SIGSEGV or SIGBUS like failures for any ports, unlike
when I'd tried with kern.sched.steal_idle=3D1 .)

I do not expect the kern.sched.steal_idle=3D0 results to be
luck at this point (up to any kern.sched.balance=3D1
contribution if involved).

--=20
You are receiving this mail because:
You are the assignee for the bug.=



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?bug-221029-8-wcWlL8CVjv>