Heterogeneous FPGA-based cost-optimal design for timing-constrained CNNs

Weiwen Jiang, Edwin Hsing Mean Sha, Qingfeng Zhuge*, Lei Yang, Xianzhang Chen, Jingtong Hu

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

31 Scopus citations

Abstract

Field programmable gate array (FPGA) has been one of the most popular platforms to implement convolutional neural networks (CNNs) due to its high performance and cost efficiency; however, limited by the on-chip resources, the existing single-FPGA architectures cannot fully exploit the parallelism in CNNs. In this paper, we explore heterogeneous FPGA-based designs to effectively leverage both task and data parallelism, such that the resultant system can achieve the minimum cost while satisfying timing constraints. In order to maximize the task parallelism, we investigate two critical problems: 1) buffer placement, where to place buffers to partition CNNs into pipeline stages and 2) task assignment, what type of FPGA to implement different CNN layers. We first formulate the system-level optimization problem with a mixed integer linear programming model. Then, we propose an efficient dynamic programming algorithm to obtain the optimal solutions. On top of that, we devise an efficient algorithm that exploits data parallelism within CNN layers to further improve cost efficiency. Evaluations on well-known CNNs demonstrate that the proposed techniques can obtain an average of 30.82% reduction in system cost under the same timing constraint, and an average of 1.5 times speedup in performance under the same cost budget, compared with the state-of-the-art techniques.

Original languageEnglish
Article number8412611
Pages (from-to)2542-2554
Number of pages13
JournalIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
Volume37
Issue number11
DOIs
StatePublished - Nov 2018

Keywords

  • Convolutional neural networks (CNNs)
  • heterogeneous field programmable gate array (FPGA) cluster
  • optimization algorithms
  • partitioning and mapping

Fingerprint

Dive into the research topics of 'Heterogeneous FPGA-based cost-optimal design for timing-constrained CNNs'. Together they form a unique fingerprint.

Cite this