FPGA-accelerated for constrained high dispersal network

  • Yanliang Chen
  • , Minghua Zhu*
  • , Bo Xiao
  • , Dan Meng
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

In recent years, the Deep Neural Network (DNN) has been successfully used in image classification. Most of existing DNN often need to learn a very large set of parameters, which require a huge amount of computational resources and time to train these model parameters using the gradient descent and back-propagation procedure. To solve this issue, the PCANet has been developed for high efficient design and training of the DNN. Compared with traditional DNN, PCANet has simpler structure and better performance, which makes it attractive for hardware design. To overcome the limitations of PCANet and significantly improve its performance, we have proposed a novel model named Constrained High Dispersal Network (CHDNet) which is a variant of PCANet. In this paper, we implement the CHDNet on the Xilinx ZYNQ FPGA to ensure the instantaneity of the system with lower power than personal computer needed by taking advantage of the algorithmic parallelism and ZYNQ architecture. Our experimental results over two major datasets, the MNIST dataset for handwritten digits recognition, and the Extended Yale B dataset for face recognition, demonstrate that our model of implementation on FPGA is more than 15x faster than software implementation on PC (Intel i7-4720HQ, 2.6GHz).

Original languageEnglish
Title of host publicationProceedings - 15th IEEE International Symposium on Parallel and Distributed Processing with Applications and 16th IEEE International Conference on Ubiquitous Computing and Communications, ISPA/IUCC 2017
EditorsGregorio Martinez, Richard Hill, Geoffrey Fox, Peter Mueller, Guojun Wang
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages840-845
Number of pages6
ISBN (Electronic)9781538637906
DOIs
StatePublished - 25 May 2018
Event15th IEEE International Symposium on Parallel and Distributed Processing with Applications and 16th IEEE International Conference on Ubiquitous Computing and Communications, ISPA/IUCC 2017 - Guangzhou, China
Duration: 12 Dec 201715 Dec 2017

Publication series

NameProceedings - 15th IEEE International Symposium on Parallel and Distributed Processing with Applications and 16th IEEE International Conference on Ubiquitous Computing and Communications, ISPA/IUCC 2017

Conference

Conference15th IEEE International Symposium on Parallel and Distributed Processing with Applications and 16th IEEE International Conference on Ubiquitous Computing and Communications, ISPA/IUCC 2017
Country/TerritoryChina
CityGuangzhou
Period12/12/1715/12/17

Keywords

  • Deep neural network
  • FPGA
  • High level Synthesis
  • Image classification

Fingerprint

Dive into the research topics of 'FPGA-accelerated for constrained high dispersal network'. Together they form a unique fingerprint.

Cite this