跳到主要导航 跳到搜索 跳到主要内容

Heterogeneous FPGA-based cost-optimal design for timing-constrained CNNs

  • Chongqing University
  • University of Pittsburgh
  • University of California at Irvine

科研成果: 期刊稿件文章同行评审

摘要

Field programmable gate array (FPGA) has been one of the most popular platforms to implement convolutional neural networks (CNNs) due to its high performance and cost efficiency; however, limited by the on-chip resources, the existing single-FPGA architectures cannot fully exploit the parallelism in CNNs. In this paper, we explore heterogeneous FPGA-based designs to effectively leverage both task and data parallelism, such that the resultant system can achieve the minimum cost while satisfying timing constraints. In order to maximize the task parallelism, we investigate two critical problems: 1) buffer placement, where to place buffers to partition CNNs into pipeline stages and 2) task assignment, what type of FPGA to implement different CNN layers. We first formulate the system-level optimization problem with a mixed integer linear programming model. Then, we propose an efficient dynamic programming algorithm to obtain the optimal solutions. On top of that, we devise an efficient algorithm that exploits data parallelism within CNN layers to further improve cost efficiency. Evaluations on well-known CNNs demonstrate that the proposed techniques can obtain an average of 30.82% reduction in system cost under the same timing constraint, and an average of 1.5 times speedup in performance under the same cost budget, compared with the state-of-the-art techniques.

源语言英语
文章编号8412611
页(从-至)2542-2554
页数13
期刊IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
37
11
DOI
出版状态已出版 - 11月 2018

指纹

探究 'Heterogeneous FPGA-based cost-optimal design for timing-constrained CNNs' 的科研主题。它们共同构成独一无二的指纹。

引用此