Abstract
Recently, the increase of low-latency applications has led to the need for faster data transmission. Meanwhile, in-network inference, which utilizes neural networks (NNs) in the data plane for traffic analysis (e.g., traffic classification, in-trusion detection), is gaining attention for its line-rate processing capability. This approach offers a solution for low-latency tasks by enabling simultaneous traffic analysis and packet processing within the network. However, existing approaches primarily focus on traffic analysis and overhead reduction without considering flow priorities, which can not satisfy sufficient performance for diverse applications. To address this, we propose rank-based traffic classification that considers both classification accuracy and quality of service (QoS) of each flow. First, we suggest a utility function that represents both task accuracy and the degree of satisfaction about flow requirements. Next, we use early-exit inference to meet flow requirements by deciding whether to stop inference procedure or continue to the subsequent steps based on the utility function. Experimental results show that our approach reduces flow completion time compared with baseline, while achieving comparable classification accuracy.
| Original language | English |
|---|---|
| Title of host publication | ICTC 2024 - 15th International Conference on ICT Convergence |
| Subtitle of host publication | AI-Empowered Digital Innovation |
| Publisher | IEEE Computer Society |
| Pages | 2160-2161 |
| Number of pages | 2 |
| ISBN (Electronic) | 9798350364637 |
| DOIs | |
| Publication status | Published - 2024 |
| Event | 15th International Conference on Information and Communication Technology Convergence, ICTC 2024 - Jeju Island, Korea, Republic of Duration: 2024 Oct 16 → 2024 Oct 18 |
Publication series
| Name | International Conference on ICT Convergence |
|---|---|
| ISSN (Print) | 2162-1233 |
| ISSN (Electronic) | 2162-1241 |
Conference
| Conference | 15th International Conference on Information and Communication Technology Convergence, ICTC 2024 |
|---|---|
| Country/Territory | Korea, Republic of |
| City | Jeju Island |
| Period | 24/10/16 → 24/10/18 |
Bibliographical note
Publisher Copyright:© 2024 IEEE.
Keywords
- Adaptive Inference
- Early-Exit
- In-Network Intelligence
- Programmable Data Plane
- QoS
ASJC Scopus subject areas
- Information Systems
- Computer Networks and Communications