跳到主要导航 跳到搜索 跳到主要内容

Toward the Predictability of Dynamic Real-Time DNN Inference

  • Weiguang Pang
  • , Xu Jiang*
  • , Mingsong Lv
  • , Teng Gao
  • , Di Liu
  • , Wang Yi
  • *此作品的通讯作者
  • Key Laboratory of Medical Image Computing (Northeastern University)
  • Hong Kong Polytechnic University
  • Yunnan University
  • Uppsala University

科研成果: 期刊稿件文章同行评审

摘要

Deep neural networks (DNNs) have been widely used in many cyber-physical systems (CPSs). However, it is still a challenging work to deploy DNNs in real-time systems. In particular, the execution time of DNN inference must be predictable, s.t. it could be known whether the runtime inference can complete within a required timing constraint. Moreover, the timing constraints may change dynamically with the runtime environment in many embedded applications, such as autonomous cars. A possible way to meet such dynamic real-time requirements is to execute different subnetworks of a DNN at runtime. However, improper construction of subnetworks may not only introduce unpredictable inference time, s.t. the real-timing constraints could be violated unexpectedly, but also has poor compatibility with the well-optimized machine learning framework (e.g., TensorFlow). In this article, we study the predictability when executing different subnetworks of a DNN. In particular, we present a featurewise runtime adaptation framework for DNN inference, which is implemented and validated on NVIDIA Jetson TX2 and Nano with TensorFlow. The experimental results show that our method can achieve predictable inference time in comparison with the state-of-the-art methods.

源语言英语
页(从-至)2849-2862
页数14
期刊IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
41
9
DOI
出版状态已出版 - 1 9月 2022
已对外发布

指纹

探究 'Toward the Predictability of Dynamic Real-Time DNN Inference' 的科研主题。它们共同构成独一无二的指纹。

引用此