DPP-ViT: Dynamic Patch Pruning for Low Complexity Vision Transformer Accelerator

  • Han Cho*
  • , Joongho Jo*
  • , Seung Eon Hwang
  • , Jongsun Park
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

While vision transformers excel in various computer vision tasks, their high computational cost limits use on resource-constrained devices, highlighting the need for complexity reduction. In this paper, we present dynamic patch pruning for low complexities of vision transformers (DPP-ViT). To identify relatively more important patches, the column-wise accumulations of attention maps are computed and those are used as importance scores. Through DPP-ViT with block-wise importance score thresholds, our approach considers image-wise difficulties and block-wise sensitivities, removing the patches that contribute the least to accuracies. Additionally, we present a reconfigurable accelerator that dynamically changes dataflow and PE structure by applying pruning-aware row-level reconfiguration. DPP-ViT achieves 47% computation reduction with a minor -0.25% degradation on DeiT-B model in ImageNet top-1 accuracy. The proposed reconfigurable accelerator also achieves 47.96×/ 4.36×/ 1.47× speed-ups compared to EdgeCPU, EdgeGPU, and vision transformer accelerator ViTCoD.

Original languageEnglish
Title of host publicationAICAS 2025 - 2025 7th IEEE International Conference on Artificial Intelligence Circuits and Systems, Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798331524241
DOIs
Publication statusPublished - 2025
Event7th IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2025 - Bordeaux, France
Duration: 2025 Apr 282025 Apr 30

Publication series

NameAICAS 2025 - 2025 7th IEEE International Conference on Artificial Intelligence Circuits and Systems, Proceedings

Conference

Conference7th IEEE International Conference on Artificial Intelligence Circuits and Systems, AICAS 2025
Country/TerritoryFrance
CityBordeaux
Period25/4/2825/4/30

Bibliographical note

Publisher Copyright:
© 2025 IEEE.

Keywords

  • neural network
  • patch pruning
  • reconfigurable accelerator
  • vision transformer

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Vision and Pattern Recognition
  • Hardware and Architecture
  • Electrical and Electronic Engineering
  • Instrumentation

Fingerprint

Dive into the research topics of 'DPP-ViT: Dynamic Patch Pruning for Low Complexity Vision Transformer Accelerator'. Together they form a unique fingerprint.

Cite this