From owner-freebsd-arm@freebsd.org Mon Dec 2 22:15:32 2019 Return-Path: Delivered-To: freebsd-arm@mailman.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mailman.nyi.freebsd.org (Postfix) with ESMTP id EDD381BA431 for ; Mon, 2 Dec 2019 22:15:32 +0000 (UTC) (envelope-from marklmi@yahoo.com) Received: from sonic301-22.consmr.mail.gq1.yahoo.com (sonic301-22.consmr.mail.gq1.yahoo.com [98.137.64.148]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id 47RfZD18ppz4TK5 for ; Mon, 2 Dec 2019 22:15:31 +0000 (UTC) (envelope-from marklmi@yahoo.com) X-YMail-OSG: hyXnFU0VM1kClqYGWfPzlQR9oi8RWfm45Qmyi5Qruxzk3KzbenCKbKicoyj9qW5 mPXu0JyBJcZcjWgA8hOoG6.gTDgewYucP3_IdNfB17fDp4teIO5xs8L_nzX.PcoKOlYKOTGHGJPz mAjNEs1IDE.IPiUVp1i1x_J6acfXswW_ZOJDGA7FaHch9Apam9rGRl3IGwWCBf_w4_6W_wG2VYmh sUwwi6ZXcZaQTDEOD98No8iu6knqf.djNm6jWVUosXWKbsAVnu4a2ZubFxRGubwpaUZaFrMaPG72 FWoIAMj42cVJVOUgxJuGDuAqJUphQiwIFfXvF4pClXW70CiOszo9jQyvRe_rwfyHKzxSAqhWuDkp kj3440YwPg1p43AkhvgelluxwqTWYdoKFg8b1alJRsFSNwlxUhY0aQB0bB4ObqqEy5UFxAgT6CGT WtiKs5PzmfIY1eQKzxfbiMST2tZ2zMx6CFvYwBMpfOI6O4xz81W0jkHPSNTUaHwsj6YxyEYTapp4 1nj3I_uKCxBYv17mHZYqQ7pbndPPqSQy7VeZPEz57XtCMpVi5iXJ1SiwjKtaSFtciJwzYqnxtz8b U8efzknxvFwESQq3s5bvqqzX6aohkWct7CeV2lj7wQ5dCzwbMZKFOcAMddSSZso_fXn39lPwJ7I4 YTkUbmpPUgBpLTi52Ohb7EDslpdzgWw.j5cJWgYLeEQfQonE5i7Nz4DpvZCIcOuHZxO_LwNogxU9 uP6Tutde9rn5.wAFwQCtvdEEwdX05V1u5u.X644sBBIRGhKsxHZRxwNR18RRxQUX8cLt8Nja_OGU a.ZFi1lmCUQVbDAZ3sS5l2ZY4mtOoGX4kyphWFD2qB52jVhX.9KtYij0P_.CXWuSxf55BFkSCFuK GQertpMhEd2ChUDvn51lhhr7UPNfDQS5ItfjD4DTj82QPwIe_FvDDmTvWPlKQ3OEGNUgZ78azFz6 ROyaCTEfDSOtAy3y4sPReIeAbKYW9BBa.Cp5AQ5e_ZqK5_n.aiU1EJnurJPXPuGin4FQ9e9xylph lv1UCxlFGimYIcu9U1B9Ro0WqGlkW7RjfhYzIxMNlwe2kcFop3eb.aI2zYuaevGoRSRQ9qb5ECIC UhiVsgqJi..l.LijbOfkNl3C6jJ6VD0UpvzxBQS6c2yepIl.XILxUFYeoi8tudRtvhV6icLI3hiU sNzdqhChk_5G9SAlhcHT899n1CT3MPUuuXD2mkbynp0mr.0.wqFzZfGTDYdwJ_NwSqo5pOLu5Bhg T4vq1RamOjNdaEnfqm6b87HU9GhzzFwaJZ8zwvtdmcfpCcINDi0pbtKlAYFY5ZAGb9.UJegxKuzH ffqsiIxxUJQ.Uouh2l8hgGo7zgHYjkPcxfQVS8nDcVv.0eOB9nUMG70pmxL9oYljxkKStniQVu_W UegRrKTLlLczRAz4XjCg1 Received: from sonic.gate.mail.ne1.yahoo.com by sonic301.consmr.mail.gq1.yahoo.com with HTTP; Mon, 2 Dec 2019 22:15:30 +0000 Received: by smtp414.mail.ne1.yahoo.com (Oath Hermes SMTP Server) with ESMTPA ID a95c4a4951f45b3a79db0318a57106be; Mon, 02 Dec 2019 22:15:27 +0000 (UTC) From: Mark Millard Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: quoted-printable Mime-Version: 1.0 (Mac OS X Mail 13.0 \(3601.0.10\)) Subject: Comparing the OverDrive 1000 (A57) vs. MACCHIATObin Double Shot (A72) for buildworld and via a CPU/cache/RAM tradeoff-exploring benchmark Message-Id: <92E7B63A-E790-4815-9D91-2161A4F66B71@yahoo.com> Date: Mon, 2 Dec 2019 14:15:26 -0800 To: freebsd-arm@freebsd.org X-Mailer: Apple Mail (2.3601.0.10) References: <92E7B63A-E790-4815-9D91-2161A4F66B71.ref@yahoo.com> X-Rspamd-Queue-Id: 47RfZD18ppz4TK5 X-Spamd-Bar: / X-Spamd-Result: default: False [0.05 / 15.00]; ARC_NA(0.00)[]; R_DKIM_ALLOW(-0.20)[yahoo.com:s=s2048]; FROM_HAS_DN(0.00)[]; R_SPF_ALLOW(-0.20)[+ptr:yahoo.com]; FREEMAIL_FROM(0.00)[yahoo.com]; MIME_GOOD(-0.10)[text/plain]; TO_DN_NONE(0.00)[]; RCVD_COUNT_TWO(0.00)[2]; RCPT_COUNT_ONE(0.00)[1]; NEURAL_SPAM_MEDIUM(0.23)[0.226,0]; RCVD_TLS_LAST(0.00)[]; TO_MATCH_ENVRCPT_ALL(0.00)[]; DKIM_TRACE(0.00)[yahoo.com:+]; DMARC_POLICY_ALLOW(-0.50)[yahoo.com,reject]; RCVD_IN_DNSWL_NONE(0.00)[148.64.137.98.list.dnswl.org : 127.0.5.0]; MV_CASE(0.50)[]; IP_SCORE(0.00)[ip: (4.70), ipnet: 98.137.64.0/21(0.88), asn: 36647(0.70), country: US(-0.05)]; IP_SCORE_FREEMAIL(0.00)[]; NEURAL_SPAM_LONG(0.33)[0.328,0]; FROM_EQ_ENVFROM(0.00)[]; MIME_TRACE(0.00)[0:+]; FREEMAIL_ENVFROM(0.00)[yahoo.com]; ASN(0.00)[asn:36647, ipnet:98.137.64.0/21, country:US]; MID_RHS_MATCH_FROM(0.00)[]; DWL_DNSWL_NONE(0.00)[yahoo.com.dwl.dnswl.org : 127.0.5.0] X-BeenThere: freebsd-arm@freebsd.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: "Porting FreeBSD to ARM processors." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 02 Dec 2019 22:15:33 -0000 It looks like the OverDrive 1000 vs. MACCHIATObin Double Shot comparison ends up being an example of memory access making the difference for the specific workload: -j4 buildworld for head -r355027 (building itself from scratch). buildworld times (not needing a llvm bootstrap build): OverDrive 1000: 13895 sec (about 3.86 hrs) MACCHIATObin Double Shot: 16561 sec (about 4.60 hrs) So a little under 45 min difference when the mean and geometric mean are both a little over 4.2 hrs. SSD ufs file systems: One with Samsung 860 Pro, the other with Samsung 850 Pro. I do not expect that I/O made much of a difference, but I did nothing to measure such for the buildworld activity. OverDrive RAM: 8GiByte, half in each of the 2 slots MACCHIATObin RAM: 16GiByte, all in its 1 slot. MACCHIATObin: jumpers set for the fastest CPU/RAM speed for the Double Shot. A comparison graph from exploring single threaded and multi-threaded CPU/cache and RAM limited performance (a variation on the old HINT serial and pthread benchmarks) is shown at: = https://github.com/markmi/acpphint/blob/master/acpphint_example_data/acpph= int-OverDrive_1000_MacchDblShot-threads_4-LP64-g%2B%2B_9_8.3_O3-libc%2B%2B= _libstdc%2B%2B-DSIZE_large_fast_types-RAM.gp There are curves for various involved types: double (d), unsigned long long (ull), unsigned long (ul), unsigned int (ui). The match for ull and ul for the context provides some evidence of the variability observed. (The OverDrive and MACCHIATObin were not benchmarked for the graph at the same version of head: -r352341 based vs. -r355027 based.) (I did not set things such that the benchmark run would explore paging getting involved. Thus there is basically no I/O considered in the comparison graph.) The MACCHIATObin clearly wins single threaded and its memory subsystem was well matched to the single threaded use when the same-invovled-types are compared. (Single threaded are the blueish curves, MACCHIATObin having the lighter colors.) For multi-threaded in the range where RAM access limits things, the two systems are a close match. (Greenish colors, right side of plot, upper curves.) The range were the OverDrive 1000 is clearly faster is part of the middle of the multi-threaded curves. (This might be tied to whatever is done with the dual RAM slot structure or to the amount of caching, or some such, I do not know the details.) I would expect "-j1 buildworld" would take less time on the MACCHIATObin than on the OverDrive, but I'm not planing on measuring that. A more historical comparison, old PowerMac11,2 (2 sockets, 2 cores each) vs. the MACCHIATObin, both having 16 GiBytes of RAM: For analogous benchmark graphs (matching types), the MACCHIATObin single threaded is faster than the old PowerMac11,2 single threaded and also is usually faster than that 11,2's multi-threaded benchmark data as well. Multi-threaded, the MACCHIATObin is faster for the exploration by the benchmark. = https://github.com/markmi/acpphint/blob/master/acpphint_example_data/acpph= int-MacchDblShot_PowerMac11%2C2-threads_4-LP64-g%2B%2B_9_O3-libc%2B%2B-DSI= ZE_large_fast_types-RAM.gp I expect that this is interesting for the likely difference in power usage during the benchmarking. (Not that I've measured the power usage.) (The FreeBSD head vintages are not the same in the graph: -r355027 based vs. -r352341 based.) =3D=3D=3D Mark Millard marklmi at yahoo.com ( dsl-only.net went away in early 2018-Mar)