Poster: Adaptive In-Network Inference using Early-Exits

Heewon Kim, Seongyeon Yoon, Sangheon Pack

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In-network (or on-path) inference over programmable data planes allows fast and low-overhead inference in deep neural networks. In this work, we propose an adaptive approach to strike the balance between accuracy and processing cost. To be specific, the confidence score is evaluated at the end of each layer, and an early exit is triggered if the confidence score is sufficiently high. We implement this early-exit scheme over BMv2 software switches and the results demonstrate that the proposed scheme successfully controls the trade-off by making use of the confidence score.

Original languageEnglish
Title of host publicationEuroP4 2023 - Proceedings of the 6th International Workshop on P4 in Europe
PublisherAssociation for Computing Machinery, Inc
Pages61-64
Number of pages4
ISBN (Electronic)9798400704468
DOIs
Publication statusPublished - 2023 Dec 8
Event6th International Workshop on P4 in Europe, EuroP4 2023, co-located with ACM CoNEXT 2023 - Paris, France
Duration: 2023 Dec 8 → …

Publication series

NameEuroP4 2023 - Proceedings of the 6th International Workshop on P4 in Europe

Conference

Conference6th International Workshop on P4 in Europe, EuroP4 2023, co-located with ACM CoNEXT 2023
Country/TerritoryFrance
CityParis
Period23/12/8 → …

Bibliographical note

Publisher Copyright:
© 2023 ACM.

Keywords

  • early-exit
  • in-network intelligence
  • programmable data plane

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Hardware and Architecture
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Poster: Adaptive In-Network Inference using Early-Exits'. Together they form a unique fingerprint.

Cite this