From owner-freebsd-arm@freebsd.org Mon Dec 2 22:56:35 2019 Return-Path: Delivered-To: freebsd-arm@mailman.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mailman.nyi.freebsd.org (Postfix) with ESMTP id 4F5531BB5F3 for ; Mon, 2 Dec 2019 22:56:35 +0000 (UTC) (envelope-from marklmi@yahoo.com) Received: from sonic317-21.consmr.mail.gq1.yahoo.com (sonic317-21.consmr.mail.gq1.yahoo.com [98.137.66.147]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id 47RgTY3Bpyz4WZ1 for ; Mon, 2 Dec 2019 22:56:33 +0000 (UTC) (envelope-from marklmi@yahoo.com) X-YMail-OSG: VCy9fuoVM1mSvhcF2RI5L.nsGKbsxjRu6hx25rWGnHnlbMH6vuHfOd3EZPqTbyn 2ENO7xdZ9_3dYpfLl0.BG7Q0f9Yl_cH3rPE4tbj9Y9dadZRyYRZhniZOOrdherVyUawQR6ildHWP m1KECo8fE_etwIsC1Ie3kgIhEHw05k1DkfxUlX6QM1JByozDBYnHnQr34fn2t7yn5fcUDmnT8g6I GB2hF2lRsH.I0UZukdjItXzuNS1iAAUKO98OCqwIzVdXLIPvC5Tmq55mB1Pa4nZz_1CKXY7Mxug4 fWpB.0w.d8B6sQnb3gkX1sn37AYrQxgWSnZ3aafaoOk.SCdGMoPftNNkT6TgdalNkZ7hMPJs7YIu AJ56kHUlFs.X4nZOpsVNMLt17.nMKqMYMiPRC3TzNvnqn3ohwrSDg3ZUJ2wRuw2oYSeo5jHf0htP EQRNGzCBa26rq.NU0dyZStzVZ.6VaaavQ8NKi0XXMx.TkPoNgjFPAFZGBZTjQih2WfpawcseSvsp q9ryf_KoUTfR.c8UdkzyVmIIi7hneMMSVizrdl5NzJFcuw1.96CFVA2tVDdcPwZ3K0wPAjL31fOf lTVbthXsGKZlbEOjwv47JA95hKTz6d8IyPudNx0ji0yZmFQjobq4IeSY.W1r5.DNrF55OiSZoTMe Krfi4anLAJhwsqNzFFEQ4Y.ndpOZL_d2kdcglTNboN7f.5ZRLh9gf_gtuzlUc.jGtd46CuPCugIR ai.G5Y9eREyMAkxEaNuJ62_ahlo1TtJQrA8C.Ntyb6sxxPO59MCZzkCqmWZ98Nmn2cdi8LnXYfYt 3c_IkIwuCAz_nY0G0gnSPFGpjEylLwFdneIv4ddEK4NgD9vkYzP1O2pinHxbDvz0PkNEED.z_Zr0 HB4GgyaOsJ.FbPR7LrFGp1B49cGunt233IvM6N9XqgU5N_smGVijkarh3ssrFXU.LWbBzaSV7dWh VMB0gRJacksdRLcL1bTd4_ihFQ87gSfGnwunIfZpzOC94IrMogrnvju9NNqaT0YfxTpy8CsvuIWz 1oKhgYQMiJ3yacxv7XDKh6RPOZ0DiJhBwVVVXetXSmsRZvC2tHtbDrVA.TRiC8Cq7lK11OG8TuXe 0luEoQvs4RxAImQC4sJXfIEjdMGpx1TvoRkO5_O0SEhylxNCIssYJMrqjyu4H2b1jf6UH8R5hdSx GPLOgdKQgJcFnBi.om7U2xYWONAq5qYUQy0ucIOFGWVhZPRyy0F5mk0Y_WDQfGDFlYI.xR_QkkM9 i71bDh46hQy41ajoISDO9.d4xfl_ri_ow7gF0_ORUgs14xEMIXJP6QH3yL5Me16oE0heToQRD14U SRdROXU.FQv.lCW3CYxw.c0PWqThbFl8I0W09kXtMcESGiEkxTIi7qYHEhTNtYDx.J57AclP.nlO 92N5uFmv6Po6xfc5OugxYgQg- Received: from sonic.gate.mail.ne1.yahoo.com by sonic317.consmr.mail.gq1.yahoo.com with HTTP; Mon, 2 Dec 2019 22:56:30 +0000 Received: by smtp416.mail.ne1.yahoo.com (Oath Hermes SMTP Server) with ESMTPA ID 8272d2249384b91c51b71e3d0ace3587; Mon, 02 Dec 2019 22:56:26 +0000 (UTC) From: Mark Millard Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: quoted-printable Mime-Version: 1.0 (Mac OS X Mail 13.0 \(3601.0.10\)) Subject: Re: Comparing the OverDrive 1000 (A57) vs. MACCHIATObin Double Shot (A72) for buildworld and via a CPU/cache/RAM tradeoff-exploring benchmark (links corrected) Date: Mon, 2 Dec 2019 14:56:25 -0800 References: <92E7B63A-E790-4815-9D91-2161A4F66B71.ref@yahoo.com> <92E7B63A-E790-4815-9D91-2161A4F66B71@yahoo.com> <5F7E7618-A503-4D16-B83C-0379F4B6327F@yahoo.com> To: freebsd-arm@freebsd.org In-Reply-To: <5F7E7618-A503-4D16-B83C-0379F4B6327F@yahoo.com> Message-Id: <63787F5A-A3B7-434A-B594-999D95559BEE@yahoo.com> X-Mailer: Apple Mail (2.3601.0.10) X-Rspamd-Queue-Id: 47RgTY3Bpyz4WZ1 X-Spamd-Bar: / X-Spamd-Result: default: False [0.65 / 15.00]; ARC_NA(0.00)[]; R_DKIM_ALLOW(-0.20)[yahoo.com:s=s2048]; FROM_HAS_DN(0.00)[]; R_SPF_ALLOW(-0.20)[+ptr:yahoo.com]; FREEMAIL_FROM(0.00)[yahoo.com]; MIME_GOOD(-0.10)[text/plain]; TO_DN_NONE(0.00)[]; RCVD_COUNT_TWO(0.00)[2]; RCPT_COUNT_ONE(0.00)[1]; NEURAL_SPAM_MEDIUM(0.49)[0.493,0]; RCVD_TLS_LAST(0.00)[]; TO_MATCH_ENVRCPT_ALL(0.00)[]; DKIM_TRACE(0.00)[yahoo.com:+]; DMARC_POLICY_ALLOW(-0.50)[yahoo.com,reject]; RCVD_IN_DNSWL_NONE(0.00)[147.66.137.98.list.dnswl.org : 127.0.5.0]; MV_CASE(0.50)[]; IP_SCORE(0.00)[ip: (7.39), ipnet: 98.137.64.0/21(0.88), asn: 36647(0.70), country: US(-0.05)]; IP_SCORE_FREEMAIL(0.00)[]; NEURAL_SPAM_LONG(0.66)[0.660,0]; FROM_EQ_ENVFROM(0.00)[]; MIME_TRACE(0.00)[0:+]; FREEMAIL_ENVFROM(0.00)[yahoo.com]; ASN(0.00)[asn:36647, ipnet:98.137.64.0/21, country:US]; MID_RHS_MATCH_FROM(0.00)[]; DWL_DNSWL_NONE(0.00)[yahoo.com.dwl.dnswl.org : 127.0.5.0] X-BeenThere: freebsd-arm@freebsd.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: "Porting FreeBSD to ARM processors." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 02 Dec 2019 22:56:35 -0000 [Just correcting the links to be to .png files and correcting some PowerMac11,2 related wording.] On 2019-Dec-2, at 14:15, Mark Millard wrote: > It looks like the OverDrive 1000 vs. MACCHIATObin Double > Shot comparison ends up being an example of memory > access making the difference for the specific workload: > -j4 buildworld for head -r355027 (building itself > from scratch). >=20 > buildworld times (not needing a llvm bootstrap build): >=20 > OverDrive 1000: 13895 sec (about 3.86 hrs) > MACCHIATObin Double Shot: 16561 sec (about 4.60 hrs) >=20 > So a little under 45 min difference when the mean > and geometric mean are both a little over 4.2 hrs. >=20 > SSD ufs file systems: One with Samsung 860 Pro, the > other with Samsung 850 Pro. I do not expect that I/O > made much of a difference, but I did nothing to measure > such for the buildworld activity. >=20 > OverDrive RAM: 8GiByte, half in each of the 2 slots > MACCHIATObin RAM: 16GiByte, all in its 1 slot. >=20 > MACCHIATObin: jumpers set for the fastest CPU/RAM > speed for the Double Shot. >=20 > A comparison graph from exploring single threaded > and multi-threaded CPU/cache and RAM limited > performance (a variation on the old HINT serial > and pthread benchmarks) is shown at: Corrected link: = https://github.com/markmi/acpphint/blob/master/acpphint_example_data/acpph= int-OverDrive_1000_MacchDblShot-threads_4-LP64-g%2B%2B_9_8.3_O3-libc%2B%2B= _libstdc%2B%2B-DSIZE_large_fast_types-RAM.png > There are curves for various involved types: > double (d), unsigned long long (ull), unsigned > long (ul), unsigned int (ui). The match for > ull and ul for the context provides some > evidence of the variability observed. >=20 > (The OverDrive and MACCHIATObin were not benchmarked > for the graph at the same version of head: -r352341 > based vs. -r355027 based.) >=20 > (I did not set things such that the benchmark run > would explore paging getting involved. Thus there > is basically no I/O considered in the comparison > graph.) >=20 > The MACCHIATObin clearly wins single threaded and > its memory subsystem was well matched to the single > threaded use when the same-invovled-types are > compared. (Single threaded are the blueish curves, > MACCHIATObin having the lighter colors.) >=20 > For multi-threaded in the range where RAM access > limits things, the two systems are a close match. > (Greenish colors, right side of plot, upper > curves.) >=20 > The range were the OverDrive 1000 is clearly faster > is part of the middle of the multi-threaded curves. > (This might be tied to whatever is done with the > dual RAM slot structure or to the amount of caching, > or some such, I do not know the details.) >=20 > I would expect "-j1 buildworld" would take less time > on the MACCHIATObin than on the OverDrive, but I'm > not planing on measuring that. >=20 >=20 >=20 > A more historical comparison, old PowerMac11,2 > (2 sockets, 2 cores each) vs. the MACCHIATObin, > both having 16 GiBytes of RAM: >=20 > For analogous benchmark graphs (matching types), > the MACCHIATObin single threaded is faster than > the old PowerMac11,2 single threaded and also is > usually faster than that 11,2's multi-threaded > benchmark data as well. I should have pointed out that the MACCHIATObin single threaded and PowerMac11,2 multi-threaded results are similar where memory access limits things, with use of double (d) being a little slower on the MACCHIATObin in this region. > Multi-threaded, the > MACCHIATObin is faster for the exploration by > the benchmark. Corrected link: = https://github.com/markmi/acpphint/blob/master/acpphint_example_data/acpph= int-MacchDblShot_PowerMac11%2C2-threads_4-LP64-g%2B%2B_9_O3-libc%2B%2B-DSI= ZE_large_fast_types-RAM.png > I expect that this is interesting for the likely > difference in power usage during the benchmarking. > (Not that I've measured the power usage.) >=20 > (The FreeBSD head vintages are not the same in > the graph: -r355027 based vs. -r352341 based.) >=20 =3D=3D=3D Mark Millard marklmi at yahoo.com ( dsl-only.net went away in early 2018-Mar)