TY - GEN
T1 - Hierarchical memory-constrained operator scheduling of neural architecture search networks
AU - Wang, Zihan
AU - Wan, Chengcheng
AU - Chen, Yuting
AU - Lin, Ziyi
AU - Jiang, He
AU - Qiao, Lei
N1 - Publisher Copyright:
© 2022 ACM.
PY - 2022/7/10
Y1 - 2022/7/10
N2 - Neural Architecture Search (NAS) is widely used in industry, searching for neural networks meeting task requirements. Meanwhile, it faces a challenge in scheduling networks satisfying memory constraints. This paper proposes HMCOS that performs hierarchical memory-constrained operator scheduling of NAS networks: given a network, HMCOS constructs a hierarchical computation graph and employs an iterative scheduling algorithm to progressively reduce peak memory footprints. We evaluate HMCOS against RPO and Serenity (two popular scheduling techniques). The results show that HMCOS outperforms existing techniques in supporting more NAS networks, reducing 8.7∼42.4% of peak memory footprints, and achieving 137 - 283x of speedups in scheduling.
AB - Neural Architecture Search (NAS) is widely used in industry, searching for neural networks meeting task requirements. Meanwhile, it faces a challenge in scheduling networks satisfying memory constraints. This paper proposes HMCOS that performs hierarchical memory-constrained operator scheduling of NAS networks: given a network, HMCOS constructs a hierarchical computation graph and employs an iterative scheduling algorithm to progressively reduce peak memory footprints. We evaluate HMCOS against RPO and Serenity (two popular scheduling techniques). The results show that HMCOS outperforms existing techniques in supporting more NAS networks, reducing 8.7∼42.4% of peak memory footprints, and achieving 137 - 283x of speedups in scheduling.
UR - https://www.scopus.com/pages/publications/85137537213
U2 - 10.1145/3489517.3530472
DO - 10.1145/3489517.3530472
M3 - 会议稿件
AN - SCOPUS:85137537213
T3 - Proceedings - Design Automation Conference
SP - 493
EP - 498
BT - Proceedings of the 59th ACM/IEEE Design Automation Conference, DAC 2022
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 59th ACM/IEEE Design Automation Conference, DAC 2022
Y2 - 10 July 2022 through 14 July 2022
ER -