跳到主要导航 跳到搜索 跳到主要内容

Estimation-Based Strategy Generation for Deep Neural Network Model Compression

  • Hongkai Wang
  • , Jun Feng
  • , Shuai Zhao
  • , Yidan Wang
  • , Dong Mao
  • , Zuge Chen
  • , Gongwu Ke
  • , Gaoli Wang
  • , Youqun Long
  • Information and Telecommunication Branch
  • East China Normal University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Compressing the neural network can significantly reduce its computational complexity, save resources and speed up inference time. However, current compression methods, whether used individually or in combination, often neglect the issue of compression strategy generation, making it challenging to obtain compressed models with the smallest accuracy degradation that meet the user's deployment requirements. This paper proposes a method for automatically generating compression strategy, aiming to achieve high-performance models that meet deployment requirements with minimal accuracy degradation. Firstly, we design a predictor to estimate the compression performance of the model if it is compressed by different compression methods such as distillation, pruning and quantization. This includes estimating the model size, the number of parameters, computational complexity, and memory access of the model after compression. Then a computational method for estimating the inference time of the model after compression is discussed. Based on the estimated results, user requirements and hardware parameters, a method for automatically generating compression strategy is designed, which outputs suitable combinations of compression methods and compression parameter settings. Experimental results on commonly used convolutional neural networks and Jetson Nano development board validated the effectiveness of the proposed method.

源语言英语
主期刊名2023 IEEE 6th International Conference on Pattern Recognition and Artificial Intelligence, PRAI 2023
出版商Institute of Electrical and Electronics Engineers Inc.
1009-1015
页数7
ISBN(电子版)9798350325485
DOI
出版状态已出版 - 2023
活动6th IEEE International Conference on Pattern Recognition and Artificial Intelligence, PRAI 2023 - Haikou, 中国
期限: 18 8月 202320 8月 2023

出版系列

姓名2023 IEEE 6th International Conference on Pattern Recognition and Artificial Intelligence, PRAI 2023

会议

会议6th IEEE International Conference on Pattern Recognition and Artificial Intelligence, PRAI 2023
国家/地区中国
Haikou
时期18/08/2320/08/23

指纹

探究 'Estimation-Based Strategy Generation for Deep Neural Network Model Compression' 的科研主题。它们共同构成独一无二的指纹。

引用此