Poster: Adaptive In-Network Inference using Early-Exits

Heewon Kim, Seongyeon Yoon, Sangheon Pack

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Abstract

    In-network (or on-path) inference over programmable data planes allows fast and low-overhead inference in deep neural networks. In this work, we propose an adaptive approach to strike the balance between accuracy and processing cost. To be specific, the confidence score is evaluated at the end of each layer, and an early exit is triggered if the confidence score is sufficiently high. We implement this early-exit scheme over BMv2 software switches and the results demonstrate that the proposed scheme successfully controls the trade-off by making use of the confidence score.

    Original languageEnglish
    Title of host publicationEuroP4 2023 - Proceedings of the 6th International Workshop on P4 in Europe
    PublisherAssociation for Computing Machinery, Inc
    Pages61-64
    Number of pages4
    ISBN (Electronic)9798400704468
    DOIs
    Publication statusPublished - 2023 Dec 8
    Event6th International Workshop on P4 in Europe, EuroP4 2023, co-located with ACM CoNEXT 2023 - Paris, France
    Duration: 2023 Dec 8 → …

    Publication series

    NameEuroP4 2023 - Proceedings of the 6th International Workshop on P4 in Europe

    Conference

    Conference6th International Workshop on P4 in Europe, EuroP4 2023, co-located with ACM CoNEXT 2023
    Country/TerritoryFrance
    CityParis
    Period23/12/8 → …

    Bibliographical note

    Publisher Copyright:
    © 2023 ACM.

    Keywords

    • early-exit
    • in-network intelligence
    • programmable data plane

    ASJC Scopus subject areas

    • Computer Networks and Communications
    • Hardware and Architecture
    • Electrical and Electronic Engineering

    Fingerprint

    Dive into the research topics of 'Poster: Adaptive In-Network Inference using Early-Exits'. Together they form a unique fingerprint.

    Cite this