TY - JOUR
T1 - Fragmentation Harmonization for the Arm Ecosystem
T2 - A Unified Method to Measure Memory Bandwidth via Network-on-Chip
AU - Liu, Tong Yu
AU - Cheng, Huanlun
AU - Erqi, E.
AU - Li, Ning
AU - Liao, Haoyu
AU - Huang, Bo
AU - Guo, Jianmei
N1 - Publisher Copyright:
© 2025 Copyright held by the owner/author(s).
PY - 2025/12/16
Y1 - 2025/12/16
N2 - Arm-based platforms have significant hardware fragmentation issues due to the nature of Intellectual Property (IP) licensing. It is common for hardware vendors to customize memory controllers. Conventional memory bandwidth measurements based on memory controllers’ performance monitoring units (PMUs) face numerous obstacles, including the lack of kernel driver support and official documentation. To this end, we lift the perspective to the more general and upper level, i.e., the Network-on-Chip (NoC) level, instead of relying on various customized memory controllers. We propose a unified method for measuring memory bandwidth based on NoC traffic monitoring. Through a purely PMU-data-driven detection, our method can automatically reveal the hidden physical locations of memory controllers. By monitoring the upstream and downstream data traffic of the ports where memory controllers are connected on the NoC, our method can achieve accurate read and write memory bandwidth measurements without accessing memory controller PMUs, thus achieving fragmentation harmonization. We summarize all memory bandwidth methods currently applicable to Arm-based platforms and conduct a systematical evaluation. The experimental results show that our proposed method has an average error of 1.05% compared to the oracle answer, which can serve as a generic method to measure memory bandwidth for Arm-based SoCs with vendor-customized memory controllers.
AB - Arm-based platforms have significant hardware fragmentation issues due to the nature of Intellectual Property (IP) licensing. It is common for hardware vendors to customize memory controllers. Conventional memory bandwidth measurements based on memory controllers’ performance monitoring units (PMUs) face numerous obstacles, including the lack of kernel driver support and official documentation. To this end, we lift the perspective to the more general and upper level, i.e., the Network-on-Chip (NoC) level, instead of relying on various customized memory controllers. We propose a unified method for measuring memory bandwidth based on NoC traffic monitoring. Through a purely PMU-data-driven detection, our method can automatically reveal the hidden physical locations of memory controllers. By monitoring the upstream and downstream data traffic of the ports where memory controllers are connected on the NoC, our method can achieve accurate read and write memory bandwidth measurements without accessing memory controller PMUs, thus achieving fragmentation harmonization. We summarize all memory bandwidth methods currently applicable to Arm-based platforms and conduct a systematical evaluation. The experimental results show that our proposed method has an average error of 1.05% compared to the oracle answer, which can serve as a generic method to measure memory bandwidth for Arm-based SoCs with vendor-customized memory controllers.
KW - Fragmentation harmonization
KW - hardware performance counter
KW - memory bandwidth
KW - network-on-chip
KW - performance measurement
UR - https://www.scopus.com/pages/publications/105026006360
U2 - 10.1145/3772287
DO - 10.1145/3772287
M3 - 文章
AN - SCOPUS:105026006360
SN - 1544-3566
VL - 22
JO - ACM Transactions on Architecture and Code Optimization
JF - ACM Transactions on Architecture and Code Optimization
IS - 4
M1 - 144
ER -