Projects per year
Abstract
Convolution neural networks are very popular for various applications. However, data size and accuracy are the two major concerns to perform efficient and effective computations. In conventional CNN models, 32bits data are frequently used to maintain high accuracy. However, performing a bunch of 32bits multiply-and-accumulate (MAC) operations causes significant computing efforts as well as power consumptions. Therefore, recently researchers develop various methods to reduce data size and speed up calculations. Quantization is one of the techniques which reduces the number of the bits of data and the computational complexity at the cost of accuracy loss. To provide better computation effort and accuracy trade-off, different bitwidth may be applied to different layers within a CNN model. Therefore, a flexible Processing Element (PE) which can support operations of different bitwidth is in demand. In this paper, we propose a hierarchal PE structure that can support 8bits x 8bits, 8bits x 4bits, 4bits x 4bits and 2bits x 2bits operations. The structure applies the concept of reconfiguration and can avoid the redundant hardware for reconfiguration. Moreover, the concept of pipelining is also adopted in our design to provide better efficiency. The experimental results show that in 2bits x 2bits PE, we can achieve area reduction by 57% and 68% compared to a Precision-Scalable accelerator and Bit Fusion, respectively.
Original language | English |
---|---|
Title of host publication | Proceedings - 34th IEEE International System-on-Chip Conference, SOCC 2021 |
Editors | Gang Qu, Jinjun Xiong, Danella Zhao, Venki Muthukumar, Md Farhadur Reza, Ramalingam Sridhar |
Publisher | IEEE Computer Society |
Pages | 278-283 |
Number of pages | 6 |
ISBN (Electronic) | 9781665429313 |
DOIs | |
State | Published - 2021 |
Event | 34th IEEE International System-on-Chip Conference, SOCC 2021 - Virtual, Online, United States Duration: 14 Sep 2021 → 17 Sep 2021 |
Publication series
Name | International System on Chip Conference |
---|---|
Volume | 2021-September |
ISSN (Print) | 2164-1676 |
ISSN (Electronic) | 2164-1706 |
Conference
Conference | 34th IEEE International System-on-Chip Conference, SOCC 2021 |
---|---|
Country/Territory | United States |
City | Virtual, Online |
Period | 14/09/21 → 17/09/21 |
Keywords
- Processing Element (PE)
- Quantized Neural Networks (QNN)
- Reconfigurable Design
Fingerprint
Dive into the research topics of 'A Hierarchical and Reconfigurable Process Element Design for Quantized Neural Networks'. Together they form a unique fingerprint.Projects
- 2 Finished
-
-
Machine Learning Based Negative-Bias Temperature Instability (Nbti) Detection and Mitigation for Heterogeneous Multi-Core Systems(1/2)
Chen, Y.-G. (PI)
1/08/20 → 31/07/21
Project: Research