![]() ![]() Over the last few years Arm has put a lot of effort into improving open-source tools and compilers, such as GCC. The figures weren’t run actual silicon but rather estimated on Arm’s server farm in an emulation environment with RTL.Īrm made a big note that among the many efforts to improve performance for the Arm ecosystem isn’t only offering better hardware, but also better software. The figures here are achieved in a quite low whole-server TDP of only 105W. ![]() Overall, Arm’s comparison to the A72 makes sense in the context that this is to its predecessor, however we have to keep in mind that the Cortex A72 was a core that was first introduced back in 2015 with first silicon products being released late that year as well as 2016, while the new Neoverse N1 in all likelihood isn’t something which we’ll be seeing in products for another 12-18 months, resulting in a ~3-4 year time span between the two products.Īrm did also divulge absolute SPEC numbers, and here we can have some more interesting analysis to competing platforms:įor a Neoverse N1 64-core hyperscale reference design running at about 2.6GHz, Arm proclaims single-threaded SPECint2006 Speed score of ~37 while reaching an estimated multi-threaded scores of 1310. Naturally, the N1’s ARMv8.2 ISA implementation also means that it supports 8-bit dot product as well as FP16 half-precision instructions which are particularly well fitted for machine-learning workloads, achieving performance boosts near 5x the predecessor platform. The data-points represent modelled and emulated performance estimates, the actual real-life performance improvements will higher due other SoC-level improvements as well as software improvements that aren’t available in existing actual A72 silicon products.Īrm again iterates the very large compute performance improvement compared to existing solutions, achieving beyond 2x performance boosts in vector workloads. ![]() The floating point benchmarks are even more impressive with gains ranging from 100 to 120%. In integer workloads we see PPC (performance per clock) and absolute performance gains from 60 to 70%. The figures here represent single-threaded performance in SPEC. The comparison to the A72 at the same frequency as well as a similarly configured system with SLC configuration, the new N1 outright smashes its predecessor platform / microarchitecture. In the context of today’s announcement, most performance figures disclosed by Arm were relative improvements compared to the A72 Cosmos platform, which might not be the most relevant data-point in terms of trying to actually place the N1 in the competitive landscape, however we also have some more concrete absolute figures we’ll try to put some more context behind shortly. Naturally all this talk about performance and efficiency needs to substantiated with some concrete numbers. Performance Targets: What Are The Numbers? ![]()
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |