跳到主要导航 跳到搜索 跳到主要内容

Towards convolutional neural networks compression via global error reconstruction

  • Shaohui Lin
  • , Rongrong Ji*
  • , Xiaowei Guo
  • , Xuelong Li
  • *此作品的通讯作者

科研成果: 期刊稿件会议文章同行评审

摘要

In recent years, convolutional neural networks (CNNs) have achieved remarkable success in various applications such as image classification, object detection, object parsing and face alignment. Such CNN models are extremely powerful to deal with massive amounts of training data by using millions and billions of parameters. However, these models are typically deficient due to the heavy cost in model storage, which prohibits their usage on resource-limited applications like mobile or embedded devices. In this paper, we target at compressing CNN models to an extreme without significantly losing their discriminability. Our main idea is to explicitly model the output reconstruction error between the original and compressed CNNs, which error is minimized to pursuit a satisfactory rate-distortion after compression. In particular, a global error reconstruction method termed GER is presented, which firstly leverages an SVD-based low-rank approximation to coarsely compress the parameters in the fully connected layers in a layerwise manner. Subsequently, such layer-wise initial compressions are jointly optimized in a global perspective via back-propagation. The proposed GER method is evaluated on the ILSVRC2012 image classification benchmark, with implementations on two widely-adopted convolutional neural networks, i.e., the AlexNet and VGGNet-19. Comparing to several state-of-the-art and alternative methods of CNN compression, the proposed scheme has demonstrated the best rate-distortion performance on both networks.

源语言英语
页(从-至)1753-1759
页数7
期刊IJCAI International Joint Conference on Artificial Intelligence
2016-January
出版状态已出版 - 2016
已对外发布
活动25th International Joint Conference on Artificial Intelligence, IJCAI 2016 - New York, 美国
期限: 9 7月 201615 7月 2016

指纹

探究 'Towards convolutional neural networks compression via global error reconstruction' 的科研主题。它们共同构成独一无二的指纹。

引用此