iSPADE: End-to-end Sparse Architecture for Dense DNN Acceleration via Inverted-bit Representation

Dongjun Kim, Han Cho, Jongsun Park

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

While recent cutting-edge deep neural network (DNN) models, such as large language models (LLMs), demonstrate remarkable capabilities, their inherent dense data characteristics limit the performance and energy gains achievable through sparse acceleration. In this paper, we introduce the iSPADE architecture, which sparsifies end-to-end execution of dense DNNs to directly adapt the advantages of sparse acceleration without applying accuracy-sensitive techniques such as pruning. First, we propose inverted-bit representation to eliminate repetitive sign bits in 2's complement representation. Leveraging the inverted-bit representation that generates a significant number of zero bits, we propose data packing and computation skipping techniques to reduce both redundant data movement and computation. Finally, we present an iSPADE bit-slice hardware architecture that efficiently supports and accelerates the proposed sparse dataflow. In the evaluation results, we assess performance across general DNN workloads using 8 popular DNNs. iSPADE achieves 4.1X and 4.5X improvements in energy efficiency and speedup, respectively, over the previous state-of-the-art bit-slice accelerators, and it realizes a 1.7X reduction in memory footprint.

Original languageEnglish
Title of host publicationProceedings of the 29th International Symposium on Low Power Electronics and Design, ISLPED 2024
PublisherAssociation for Computing Machinery, Inc
ISBN (Electronic)9798400706882
DOIs
Publication statusPublished - 2024 Aug 5
Event29th ACM/IEEE International Symposium on Low Power Electronics and Design, ISLPED 2024 - Newport Beach, United States
Duration: 2024 Aug 52024 Aug 7

Publication series

NameProceedings of the 29th International Symposium on Low Power Electronics and Design, ISLPED 2024

Conference

Conference29th ACM/IEEE International Symposium on Low Power Electronics and Design, ISLPED 2024
Country/TerritoryUnited States
CityNewport Beach
Period24/8/524/8/7

Bibliographical note

Publisher Copyright:
© 2024 Copyright is held by the owner/author(s). Publication rights licensed to ACM.

Keywords

  • binary representation
  • deep neural network
  • sparse acceleration

ASJC Scopus subject areas

  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'iSPADE: End-to-end Sparse Architecture for Dense DNN Acceleration via Inverted-bit Representation'. Together they form a unique fingerprint.

Cite this