TY - GEN
T1 - Mera
T2 - 42nd IEEE International Conference on Computer Design, ICCD 2024
AU - Song, Yuhong
AU - Sha, Edwin Hsing Mean
AU - Xu, Longshan
AU - Zhuge, Qingfeng
AU - Shao, Zili
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - With the development of quantum computing, quantum processor demonstrates the potential supremacy in specific applications, such as Grover's database search and popular quantum neural networks (QNNs). For better calibrating the quantum algorithms and machines, quantum circuit simulation on classical computers becomes crucial. However, as the number of quantum bits (qubits) increases, the memory requirement grows exponentially. In order to reduce memory usage and accelerate simulation, we propose a multi-level optimization, namely Mera, by exploring memory and computation redundancy. First, for a large number of sparse quantum gates, we propose two compressed structures for low-level full-state simulation. The corresponding gate operations are designed for practical implementations, which are relieved from the longtime compression and decompression. Second, for the dense Hadamard gate, which is definitely used to construct the superposition, we design a customized structure for significant memory saving as a regularity-oriented simulation. Meanwhile, an ondemand amplitude updating process is optimized for execution acceleration. Experiments show that our compressed structures increase the number of qubits from 17 to 35, and achieve up to 6.9 × acceleration for QNN.
AB - With the development of quantum computing, quantum processor demonstrates the potential supremacy in specific applications, such as Grover's database search and popular quantum neural networks (QNNs). For better calibrating the quantum algorithms and machines, quantum circuit simulation on classical computers becomes crucial. However, as the number of quantum bits (qubits) increases, the memory requirement grows exponentially. In order to reduce memory usage and accelerate simulation, we propose a multi-level optimization, namely Mera, by exploring memory and computation redundancy. First, for a large number of sparse quantum gates, we propose two compressed structures for low-level full-state simulation. The corresponding gate operations are designed for practical implementations, which are relieved from the longtime compression and decompression. Second, for the dense Hadamard gate, which is definitely used to construct the superposition, we design a customized structure for significant memory saving as a regularity-oriented simulation. Meanwhile, an ondemand amplitude updating process is optimized for execution acceleration. Experiments show that our compressed structures increase the number of qubits from 17 to 35, and achieve up to 6.9 × acceleration for QNN.
KW - Quantum circuit simulation
KW - acceleration
KW - algorithm optimization
KW - memory reduction
KW - redundancy exploration
UR - https://www.scopus.com/pages/publications/85217054525
U2 - 10.1109/ICCD63220.2024.00087
DO - 10.1109/ICCD63220.2024.00087
M3 - 会议稿件
AN - SCOPUS:85217054525
T3 - Proceedings - IEEE International Conference on Computer Design: VLSI in Computers and Processors
SP - 525
EP - 533
BT - Proceedings - 2024 IEEE 42nd International Conference on Computer Design, ICCD 2024
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 18 November 2024 through 20 November 2024
ER -