TY - GEN
T1 - Low Cost Early Exit Decision Unit Design for CNN Accelerator
AU - Kim, Geonho
AU - Park, Jongsun
N1 - Publisher Copyright:
© 2020 IEEE.
PY - 2020/10/21
Y1 - 2020/10/21
N2 - Early exit has been studied as a way to reduce the complex computation of convolutional neural networks. However, in order to determine whether to exit early in a conventional CNN accelerator, there is a problem that a unit for computing softmax layer having a large hardware overhead is required. To solve this problem, we propose a low cost early exit decision unit. The proposed architecture uses only fully-connected (FC) layer outputs to make early exit decisions, so the computation of the softmax layer is not necessary. Our implementation results show an energy reduction of 68% with an accuracy drop of less than 0.3%.
AB - Early exit has been studied as a way to reduce the complex computation of convolutional neural networks. However, in order to determine whether to exit early in a conventional CNN accelerator, there is a problem that a unit for computing softmax layer having a large hardware overhead is required. To solve this problem, we propose a low cost early exit decision unit. The proposed architecture uses only fully-connected (FC) layer outputs to make early exit decisions, so the computation of the softmax layer is not necessary. Our implementation results show an energy reduction of 68% with an accuracy drop of less than 0.3%.
KW - CNN accelerator
KW - early exit
KW - softmax
UR - http://www.scopus.com/inward/record.url?scp=85100773253&partnerID=8YFLogxK
U2 - 10.1109/ISOCC50952.2020.9333079
DO - 10.1109/ISOCC50952.2020.9333079
M3 - Conference contribution
AN - SCOPUS:85100773253
T3 - Proceedings - International SoC Design Conference, ISOCC 2020
SP - 127
EP - 128
BT - Proceedings - International SoC Design Conference, ISOCC 2020
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 17th International System-on-Chip Design Conference, ISOCC 2020
Y2 - 21 October 2020 through 24 October 2020
ER -