From owner-freebsd-arm@freebsd.org Mon Dec 2 23:07:17 2019 Return-Path: Delivered-To: freebsd-arm@mailman.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mailman.nyi.freebsd.org (Postfix) with ESMTP id 129001BB9C9 for ; Mon, 2 Dec 2019 23:07:17 +0000 (UTC) (envelope-from marklmi@yahoo.com) Received: from sonic303-25.consmr.mail.gq1.yahoo.com (sonic303-25.consmr.mail.gq1.yahoo.com [98.137.64.206]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id 47Rgjw0MxRz4X43 for ; Mon, 2 Dec 2019 23:07:15 +0000 (UTC) (envelope-from marklmi@yahoo.com) X-YMail-OSG: 1ZBRW_4VM1l5HNZSLoMZQgvhCTBIcw3wcry90h7BxQZyKtvz.CObVodQsJoUT1_ ZXD6h761v.Z1dldDCoGo0hTBDgcF.PudFUOvM1pRBi4gU4RuhjyReB9rooxJ3_feD_NyLWHdSXxj _m.YBPUFwNmGYIYJYRuxwNhTegfZTQUzOdhRHvtdCWYLcR1VzUM0yZv6ggpuv3Mnw6afWblPN6Tm ZZyQlw84XBcHn.0rirUHtQ0OV6kbrJz4gzsQ3vMhf2wfXJ4DgSMliKwqxCWawnK5CrXm2vgqCfnk Leht5gfk4s8zZ5Nbwch8YAXKTMShNB.OLyFF14L1FBKUuJv8A1J1npfFqdphlnTMvYSmnHlgtwcw Gl1n4ekcPbSh_A5fo7Y3rtwFCrlAKAlgGUdURqrfgbx2j1fmpFoRZ_6YBB5l1YLhZ6WNoWFXnJF3 fMipIyDzoI7rXTY_kz9WQCZFEH6.8FNr8iSd2ngSq1y.JAct4K.ALVmj5_Tv2VPxMIXZ9IEb9J3o 9xe9xvOujdQC8hkSKU1KWNTvkeEdZKBs2xrS7mzSTMV3I68BOJi_ATWGIMafrZNlRgQlDpuKQ2vs UonenhSwpuUQCdtAuuq7BmAqJwcScaZces.D0xD4abkWi8ZkZmms4l_AXZx6DsQWZGCwtzuArDdT NGmD773aCIp4sZ9Hq.aECqikIZ4w4.dqJHGVg4RQEIQOdapwzp47xewLKpwj8xcCUoO1NNYlzt7B Q72694uFT9bcBVy9_h4OL6kKpvBtcTgFwM_hbDb9pBgY7aS2bT9svspaVK_ZXS1oBlB8vlGNb3nF w7kS50mpovlOE534IlNjsi_pa4x6pDM.AiVCiaWVrLPtOSCYUhEQ9a7bWh9BPjTe78cpnSgKZ3EX TFnE31rJcrqQ5VnGzU1tAZyZ1GPwS.UYLIcEL46iPGRPht8BXXJuGq3zmYf.L0LtB3gxpzzODOnL ms99MsbYmu4gIhZPkDuHsyh6OJR0QkikVf.SMEf6ma2hiBLTP501Y_tVqyJ43oerlGI.Y1SNTEmn N5TCwWc67KRoEf.goQ8ZhPYg8aO32zZOFCJ39qzNAWWCZ_AR7SsJyq5fkVQrUf7EM.UkO9zTL.tU ZHZfcPV2s6rznNZv7v2pWbID.cd5n7H4ZDeQ18jooTooxqkFHpITXl4MgiDGfzgXv5JwJcqfgTTU A87J7UEvyL3zk3nc7BF5b0gj.EQUv04oHPkxrGzThB.tHMAx2XEDwXXv8zsNGYhEglzcfBCKt3kh 2Ykp6nDGo805yw_psLL5ZiUE3vq7jBZmgC2yOYFDZ6m5eRv9vFaJq3kZeM1gYI2zvbFFzwVr1PYg BuiIHCZraktgyem..bH0FH4aBz8KHanjAYvxcUyEi735EuQQq2c7l62Dvh6dcKZVyJAjKf8S5oYy kiCga4ss_RiBAIlVgXMgV Received: from sonic.gate.mail.ne1.yahoo.com by sonic303.consmr.mail.gq1.yahoo.com with HTTP; Mon, 2 Dec 2019 23:07:14 +0000 Received: by smtp416.mail.gq1.yahoo.com (Oath Hermes SMTP Server) with ESMTPA ID ba5e6e1d90896c429b3283c62bd8124d; Mon, 02 Dec 2019 23:07:00 +0000 (UTC) From: Mark Millard Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: quoted-printable Mime-Version: 1.0 (Mac OS X Mail 13.0 \(3601.0.10\)) Subject: Re: Comparing the OverDrive 1000 (A57) vs. MACCHIATObin Double Shot (A72) for buildworld and via a CPU/cache/RAM tradeoff-exploring benchmark (links corrected, again) Date: Mon, 2 Dec 2019 15:07:00 -0800 References: <92E7B63A-E790-4815-9D91-2161A4F66B71.ref@yahoo.com> <92E7B63A-E790-4815-9D91-2161A4F66B71@yahoo.com> <5F7E7618-A503-4D16-B83C-0379F4B6327F@yahoo.com> <63787F5A-A3B7-434A-B594-999D95559BEE@yahoo.com> To: freebsd-arm@freebsd.org In-Reply-To: <63787F5A-A3B7-434A-B594-999D95559BEE@yahoo.com> Message-Id: <8E3A0E01-F22D-4635-A8CF-CDB98CFF9794@yahoo.com> X-Mailer: Apple Mail (2.3601.0.10) X-Rspamd-Queue-Id: 47Rgjw0MxRz4X43 X-Spamd-Bar: / X-Spamd-Result: default: False [-0.62 / 15.00]; ARC_NA(0.00)[]; NEURAL_HAM_MEDIUM(-0.05)[-0.050,0]; R_DKIM_ALLOW(-0.20)[yahoo.com:s=s2048]; FROM_HAS_DN(0.00)[]; R_SPF_ALLOW(-0.20)[+ptr:yahoo.com]; FREEMAIL_FROM(0.00)[yahoo.com]; MIME_GOOD(-0.10)[text/plain]; TO_DN_NONE(0.00)[]; NEURAL_HAM_LONG(-0.07)[-0.070,0]; RCVD_COUNT_TWO(0.00)[2]; RCPT_COUNT_ONE(0.00)[1]; IP_SCORE_FREEMAIL(0.00)[]; RCVD_TLS_LAST(0.00)[]; TO_MATCH_ENVRCPT_ALL(0.00)[]; DKIM_TRACE(0.00)[yahoo.com:+]; DMARC_POLICY_ALLOW(-0.50)[yahoo.com,reject]; RCVD_IN_DNSWL_NONE(0.00)[206.64.137.98.list.dnswl.org : 127.0.5.0]; MV_CASE(0.50)[]; IP_SCORE(0.00)[ip: (3.42), ipnet: 98.137.64.0/21(0.88), asn: 36647(0.70), country: US(-0.05)]; FROM_EQ_ENVFROM(0.00)[]; MIME_TRACE(0.00)[0:+]; FREEMAIL_ENVFROM(0.00)[yahoo.com]; ASN(0.00)[asn:36647, ipnet:98.137.64.0/21, country:US]; MID_RHS_MATCH_FROM(0.00)[]; DWL_DNSWL_NONE(0.00)[yahoo.com.dwl.dnswl.org : 127.0.5.0] X-BeenThere: freebsd-arm@freebsd.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: "Porting FreeBSD to ARM processors." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 02 Dec 2019 23:07:17 -0000 [May be this time I'll get working links in place . . .] On 2019-Dec-2, at 14:56, Mark Millard wrote: > [Just correcting the links to be to .png files > and correcting some PowerMac11,2 related wording.] >=20 > On 2019-Dec-2, at 14:15, Mark Millard wrote: >=20 >> It looks like the OverDrive 1000 vs. MACCHIATObin Double >> Shot comparison ends up being an example of memory >> access making the difference for the specific workload: >> -j4 buildworld for head -r355027 (building itself >> from scratch). >>=20 >> buildworld times (not needing a llvm bootstrap build): >>=20 >> OverDrive 1000: 13895 sec (about 3.86 hrs) >> MACCHIATObin Double Shot: 16561 sec (about 4.60 hrs) >>=20 >> So a little under 45 min difference when the mean >> and geometric mean are both a little over 4.2 hrs. >>=20 >> SSD ufs file systems: One with Samsung 860 Pro, the >> other with Samsung 850 Pro. I do not expect that I/O >> made much of a difference, but I did nothing to measure >> such for the buildworld activity. >>=20 >> OverDrive RAM: 8GiByte, half in each of the 2 slots >> MACCHIATObin RAM: 16GiByte, all in its 1 slot. >>=20 >> MACCHIATObin: jumpers set for the fastest CPU/RAM >> speed for the Double Shot. >>=20 >> A comparison graph from exploring single threaded >> and multi-threaded CPU/cache and RAM limited >> performance (a variation on the old HINT serial >> and pthread benchmarks) is shown at: Corrected link (2nd try): = https://github.com/markmi/acpphint/blob/master/acpphint_example_data/acpph= int-OverDrive_1000_MacchDblShot-threads_4-LP64-g%2B%2B_9_O3-libc%2B%2B-DSI= ZE_large_fast_types-RAM.png >> There are curves for various involved types: >> double (d), unsigned long long (ull), unsigned >> long (ul), unsigned int (ui). The match for >> ull and ul for the context provides some >> evidence of the variability observed. >>=20 >> (The OverDrive and MACCHIATObin were not benchmarked >> for the graph at the same version of head: -r352341 >> based vs. -r355027 based.) >>=20 >> (I did not set things such that the benchmark run >> would explore paging getting involved. Thus there >> is basically no I/O considered in the comparison >> graph.) >>=20 >> The MACCHIATObin clearly wins single threaded and >> its memory subsystem was well matched to the single >> threaded use when the same-invovled-types are >> compared. (Single threaded are the blueish curves, >> MACCHIATObin having the lighter colors.) >>=20 >> For multi-threaded in the range where RAM access >> limits things, the two systems are a close match. >> (Greenish colors, right side of plot, upper >> curves.) >>=20 >> The range were the OverDrive 1000 is clearly faster >> is part of the middle of the multi-threaded curves. >> (This might be tied to whatever is done with the >> dual RAM slot structure or to the amount of caching, >> or some such, I do not know the details.) >>=20 >> I would expect "-j1 buildworld" would take less time >> on the MACCHIATObin than on the OverDrive, but I'm >> not planing on measuring that. >>=20 >>=20 >>=20 >> A more historical comparison, old PowerMac11,2 >> (2 sockets, 2 cores each) vs. the MACCHIATObin, >> both having 16 GiBytes of RAM: >>=20 >> For analogous benchmark graphs (matching types), >> the MACCHIATObin single threaded is faster than >> the old PowerMac11,2 single threaded and also is >> usually faster than that 11,2's multi-threaded >> benchmark data as well. >=20 > I should have pointed out that the MACCHIATObin > single threaded and PowerMac11,2 multi-threaded > results are similar where memory access limits > things, with use of double (d) being a little > slower on the MACCHIATObin in this region. >=20 >> Multi-threaded, the >> MACCHIATObin is faster for the exploration by >> the benchmark. >=20 Corrected link (2nd try): = https://github.com/markmi/acpphint/blob/master/acpphint_example_data/acpph= int-MacchDblShot_PowerMac11%2C2-threads_4-LP64-g%2B%2B_9_O3-libc%2B%2B-DSI= ZE_large_fast_types-RAM.png >> I expect that this is interesting for the likely >> difference in power usage during the benchmarking. >> (Not that I've measured the power usage.) >>=20 >> (The FreeBSD head vintages are not the same in >> the graph: -r355027 based vs. -r352341 based.) >>=20 =3D=3D=3D Mark Millard marklmi at yahoo.com ( dsl-only.net went away in early 2018-Mar)