TY - GEN
T1 - Redesigning Data and Metadata Updates in PM File Systems with Persistent CPU Caches
AU - Chen, Congyong
AU - Zheng, Shengan
AU - Zhang, Yuhang
AU - Huang, Linpeng
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024.
PY - 2024
Y1 - 2024
N2 - The recent advent of persistent memory has revolutionized file system design by enabling efficient, fast, and durable data updates. However, to cope with mismatched access granularities and volatile CPU caches, existing file systems resort to costly data and metadata update approaches, leading to high write-back latency and excessive PM bandwidth consumption. Recent cache persistence techniques, such as Intel’s eADR, address this issue by automatically flushing data from CPU caches to PM during power failures, making CPU caches effectively persistent. Persistent CPU caches provide both cacheline access granularity and data durability, acting as a new storage medium that can greatly reduce the latency and throughput overhead of data persistence. We present FusionFS, a file system that leverages persistent CPU caches to redesign data and metadata update approaches. FusionFS employs an adaptive data update approach that chooses the most effective mechanism based on file access patterns, minimizing PM bandwidth consumption and update latency. FusionFS also adopts an aggregated metadata update approach that consolidates small entries in persistent log buffers before appending them to PM logs, minimizing small random writes to PM. Experimental results show that FusionFS outperforms existing PM file systems in terms of latency and throughput in various scenarios.
AB - The recent advent of persistent memory has revolutionized file system design by enabling efficient, fast, and durable data updates. However, to cope with mismatched access granularities and volatile CPU caches, existing file systems resort to costly data and metadata update approaches, leading to high write-back latency and excessive PM bandwidth consumption. Recent cache persistence techniques, such as Intel’s eADR, address this issue by automatically flushing data from CPU caches to PM during power failures, making CPU caches effectively persistent. Persistent CPU caches provide both cacheline access granularity and data durability, acting as a new storage medium that can greatly reduce the latency and throughput overhead of data persistence. We present FusionFS, a file system that leverages persistent CPU caches to redesign data and metadata update approaches. FusionFS employs an adaptive data update approach that chooses the most effective mechanism based on file access patterns, minimizing PM bandwidth consumption and update latency. FusionFS also adopts an aggregated metadata update approach that consolidates small entries in persistent log buffers before appending them to PM logs, minimizing small random writes to PM. Experimental results show that FusionFS outperforms existing PM file systems in terms of latency and throughput in various scenarios.
KW - CPU cache
KW - File system
KW - Persistent memory
UR - https://www.scopus.com/pages/publications/85203580932
U2 - 10.1007/978-981-97-5572-1_33
DO - 10.1007/978-981-97-5572-1_33
M3 - 会议稿件
AN - SCOPUS:85203580932
SN - 9789819755714
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 453
EP - 462
BT - Database Systems for Advanced Applications - 29th International Conference, DASFAA 2024, Proceedings
A2 - Onizuka, Makoto
A2 - Lee, Jae-Gil
A2 - Tong, Yongxin
A2 - Xiao, Chuan
A2 - Ishikawa, Yoshiharu
A2 - Lu, Kejing
A2 - Amer-Yahia, Sihem
A2 - Jagadish, H.V.
PB - Springer Science and Business Media Deutschland GmbH
T2 - 29th International Conference on Database Systems for Advanced Applications, DASFAA 2024
Y2 - 2 July 2024 through 5 July 2024
ER -