A Hierarchical and Reconfigurable Process Element Design for Quantized Neural Networks

Yu Guang Chen, Chi Wei Hsu, Hung Yi Chiang, Tsung Han Hsieh, Jing Yang Jou

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Convolution neural networks are very popular for various applications. However, data size and accuracy are the two major concerns to perform efficient and effective computations. In conventional CNN models, 32bits data are frequently used to maintain high accuracy. However, performing a bunch of 32bits multiply-and-accumulate (MAC) operations causes significant computing efforts as well as power consumptions. Therefore, recently researchers develop various methods to reduce data size and speed up calculations. Quantization is one of the techniques which reduces the number of the bits of data and the computational complexity at the cost of accuracy loss. To provide better computation effort and accuracy trade-off, different bitwidth may be applied to different layers within a CNN model. Therefore, a flexible Processing Element (PE) which can support operations of different bitwidth is in demand. In this paper, we propose a hierarchal PE structure that can support 8bits x 8bits, 8bits x 4bits, 4bits x 4bits and 2bits x 2bits operations. The structure applies the concept of reconfiguration and can avoid the redundant hardware for reconfiguration. Moreover, the concept of pipelining is also adopted in our design to provide better efficiency. The experimental results show that in 2bits x 2bits PE, we can achieve area reduction by 57% and 68% compared to a Precision-Scalable accelerator and Bit Fusion, respectively.

Original languageEnglish
Title of host publicationProceedings - 34th IEEE International System-on-Chip Conference, SOCC 2021
EditorsGang Qu, Jinjun Xiong, Danella Zhao, Venki Muthukumar, Md Farhadur Reza, Ramalingam Sridhar
PublisherIEEE Computer Society
Pages278-283
Number of pages6
ISBN (Electronic)9781665429313
DOIs
StatePublished - 2021
Event34th IEEE International System-on-Chip Conference, SOCC 2021 - Virtual, Online, United States
Duration: 14 Sep 202117 Sep 2021

Publication series

NameInternational System on Chip Conference
Volume2021-September
ISSN (Print)2164-1676
ISSN (Electronic)2164-1706

Conference

Conference34th IEEE International System-on-Chip Conference, SOCC 2021
Country/TerritoryUnited States
CityVirtual, Online
Period14/09/2117/09/21

Keywords

  • Processing Element (PE)
  • Quantized Neural Networks (QNN)
  • Reconfigurable Design

Fingerprint

Dive into the research topics of 'A Hierarchical and Reconfigurable Process Element Design for Quantized Neural Networks'. Together they form a unique fingerprint.

Cite this