TY - GEN
T1 - Understanding the Dynamic Caches on Intel Processors
T2 - 12th IEEE International Conference on Embedded and Ubiquitous Computing, EUC 2014
AU - Zhang, Yi
AU - Guan, Nan
AU - Yi, Wang
N1 - Publisher Copyright:
© 2014 IEEE.
PY - 2014/11/18
Y1 - 2014/11/18
N2 - The design and implementation of caches on a given platform has significant impacts to many areas in computer system design. On chip-multiprocessors (CMP), new cache architectures are proposed to meet the rapidly increasing performance requirements. However, the cache architectures are usually not well-documented for commercial processors. This raises difficulties for people to precisely understand the working principle of many components of the processors, not only the cache itself, but also the related components like the whole memory subsystem. This paper aims at disclosing the working principle of the last level cache of Intel Ivy Bridge processors. First, we identify the address translation logic on this cache. Second, we disclose the replacement policy of the cache. This is a dynamic insertion replacement policy, which is very different from the widely used LRU policy and its variants. Although this replacement policy has been proposed in academic literatures, our work is the first one showing it is actually used in commercial processors. To show the significance of our discovery, we design a methodology to generate controllable cache miss sequences under this new cache, and apply it to the design of a benchmark to model the memory concurrency. Evaluations on physical machines are conducted to show the effectiveness of the proposed method.
AB - The design and implementation of caches on a given platform has significant impacts to many areas in computer system design. On chip-multiprocessors (CMP), new cache architectures are proposed to meet the rapidly increasing performance requirements. However, the cache architectures are usually not well-documented for commercial processors. This raises difficulties for people to precisely understand the working principle of many components of the processors, not only the cache itself, but also the related components like the whole memory subsystem. This paper aims at disclosing the working principle of the last level cache of Intel Ivy Bridge processors. First, we identify the address translation logic on this cache. Second, we disclose the replacement policy of the cache. This is a dynamic insertion replacement policy, which is very different from the widely used LRU policy and its variants. Although this replacement policy has been proposed in academic literatures, our work is the first one showing it is actually used in commercial processors. To show the significance of our discovery, we design a methodology to generate controllable cache miss sequences under this new cache, and apply it to the design of a benchmark to model the memory concurrency. Evaluations on physical machines are conducted to show the effectiveness of the proposed method.
KW - Cache Replacement Policy
KW - Dynamic Cache
KW - Profiling
UR - https://www.scopus.com/pages/publications/84917673424
U2 - 10.1109/EUC.2014.18
DO - 10.1109/EUC.2014.18
M3 - 会议稿件
AN - SCOPUS:84917673424
T3 - Proceedings - 2014 International Conference on Embedded and Ubiquitous Computing, EUC 2014
SP - 58
EP - 64
BT - Proceedings - 2014 International Conference on Embedded and Ubiquitous Computing, EUC 2014
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 26 August 2014 through 28 August 2014
ER -