跳到主要导航 跳到搜索 跳到主要内容

Automatic Hyper-Parameter Search for Vision Transformer Pruning

  • Jun Feng
  • , Shuai Zhao
  • , Liangying Peng
  • , Sichen Pan
  • , Hao Chen
  • , Zhongxu Li
  • , Gongwu Ke
  • , Gaoli Wang
  • , Youqun Long
  • Information and Telecommunication Branch
  • East China Normal University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

In recent years, the high computational cost of the popular Vision Transformer (ViT) has made it difficult to deploy on lightweight devices. As a result, many pruning techniques have been developed to reduce the size and complexity of ViT models. However, most of these techniques focus on pruning the model as a whole, without considering the differences among its internal modules. Specifically, they apply a uniform pruning ratio to all modules. In our work, we observe that using different pruning ratios for the Multi-Head Self Attention (MHSA) and Feed-Forward Network (FFN) modules can result in improved compression performance for the Vision Transformer (ViT). In this way, we propose a new compression algorithm that applies distinct pruning ratios to each of these modules and automatically searches for optimal pruning ratio parameters. To further enhance the precision of this algorithm, we introduce an improved approach that employs iterative pruning and binary search strategies to identify the optimal parameters at a finer granularity, thereby minimizing the model's accuracy loss during the pruning process. We evaluated the effectiveness of our approach on two commonly used datasets, CIFAR-10 and Mini-ImageNet. Our method was compared to the state-of-The-Art (SOTA) method, CP-ViT, which uses a fixed pruning ratio. We found that when the pruned model accuracy was nearly the same, our method achieved a significant reduction in FLOPs, with our method achieving 56.91% of the FLOPs of the fixed pruning ratio method on CIFAR-10. These results demonstrate that our method can be more effective in reducing model complexity while maintaining accuracy.

源语言英语
主期刊名2023 IEEE 6th International Conference on Pattern Recognition and Artificial Intelligence, PRAI 2023
出版商Institute of Electrical and Electronics Engineers Inc.
606-611
页数6
ISBN(电子版)9798350325485
DOI
出版状态已出版 - 2023
活动6th IEEE International Conference on Pattern Recognition and Artificial Intelligence, PRAI 2023 - Haikou, 中国
期限: 18 8月 202320 8月 2023

出版系列

姓名2023 IEEE 6th International Conference on Pattern Recognition and Artificial Intelligence, PRAI 2023

会议

会议6th IEEE International Conference on Pattern Recognition and Artificial Intelligence, PRAI 2023
国家/地区中国
Haikou
时期18/08/2320/08/23

指纹

探究 'Automatic Hyper-Parameter Search for Vision Transformer Pruning' 的科研主题。它们共同构成独一无二的指纹。

引用此