跳到主要导航 跳到搜索 跳到主要内容

OPASS: Orchestrating TVM's Passes for Lowering Memory Footprints of Computation Graphs

  • Pengbo Nie
  • , Zihan Wang
  • , Chengcheng Wan
  • , Ziyi Lin
  • , He Jiang
  • , Jianjun Zhao
  • , Yuting Chen*
  • *此作品的通讯作者
  • Shanghai Jiao Tong University
  • Alibaba Group Holding Ltd.
  • Dalian University of Technology
  • Kyushu University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Deep learning (DL) compilers, such as TVM and TensorFlow, encompass a variety of passes for optimizing computation graphs (i.e., DL models). Despite the efforts on developing optimization passes, it remains a challenge in arranging these passes - most compilers employ fixed pass sequences that do not fit with computation graphs of diverse structures; on the other hand, optimization passes have cascade effects, making the structures of graphs under compilation volatile and as well making it difficult to generate optimal sequences for graphs. Inspired by recent progresses on static computing memory footprints (i.e., memory usages) of computation graphs, we introduce in this paper OPASS, a novel approach to orchestrating TVM's optimization passes for lowering memory footprints of computation graphs, and finally allowing the graphs to run on memory-constrained devices. The key idea is, given a computation graph G, to optimize the graph heuristically and iteratively: OPASS learns the effects of passes on the graph; it then optimizes G iteratively - each iteration picks up a pass by the reduction of the memory footprint of G and as well the implicit effects of the pass for further optimizations, letting the pass be applied. We evaluate OPASS on Rebench (a suite of computation graphs) and two real-world models (Transformer and ResNet). The results clearly show the strength of OPASS: it outperforms TVM's default sequence by 1.77× in reducing graphs' memory footprints, with affordable costs; it also offers extra memory reductions of 5∼ 12% by catching the implicit effects of passes. Furthermore, OPASS helps analyze positive/negative effects of passes to graphs' memory footprints, providing TVM developers with best practices for designing optimization pass sequences.

源语言英语
主期刊名Proceedings - 2024 IEEE International Conference on Software Maintenance and Evolution, ICSME 2024
出版商Institute of Electrical and Electronics Engineers Inc.
175-186
页数12
ISBN(电子版)9798350395686
DOI
出版状态已出版 - 2024
活动40th IEEE International Conference on Software Maintenance and Evolution, ICSME 2024 - Flagstaff, 美国
期限: 6 10月 202411 10月 2024

出版系列

姓名Proceedings - 2024 IEEE International Conference on Software Maintenance and Evolution, ICSME 2024

会议

会议40th IEEE International Conference on Software Maintenance and Evolution, ICSME 2024
国家/地区美国
Flagstaff
时期6/10/2411/10/24

指纹

探究 'OPASS: Orchestrating TVM's Passes for Lowering Memory Footprints of Computation Graphs' 的科研主题。它们共同构成独一无二的指纹。

引用此