Abstract
Deep neural networks (DNNs) are widely used to meet the needs of service consumers. However, most research on DNN focuses on the use of single-task learning models. In this approach, each task necessitates distinct inference resources, which results in the allocation of separate computing resources. This increases the possibility of failing to meet the deadline for task completion due to insufficient computing resources, especially as the task request rate increases. To tackle this challenge, we propose a resource-efficient and adaptive model selection (REMS) scheme that adaptively selects a multi-task learning model (MTL) and single-task learning (STL) model. We formulate the model selection problem as a Markov decision process (MDP) to minimize resource consumption while satisfying latency requirements. The optimal policy can be obtained by converting the MDP problem into Q-learning. The evaluation results demonstrate that REMS achieves a significant performance improvement compared to the existing scheme.
Original language | English |
---|---|
Title of host publication | ICTC 2023 - 14th International Conference on Information and Communication Technology Convergence |
Subtitle of host publication | Exploring the Frontiers of ICT Innovation |
Publisher | IEEE Computer Society |
Pages | 491-493 |
Number of pages | 3 |
ISBN (Electronic) | 9798350313277 |
DOIs | |
Publication status | Published - 2023 |
Event | 14th International Conference on Information and Communication Technology Convergence, ICTC 2023 - Jeju Island, Korea, Republic of Duration: 2023 Oct 11 → 2023 Oct 13 |
Publication series
Name | International Conference on ICT Convergence |
---|---|
ISSN (Print) | 2162-1233 |
ISSN (Electronic) | 2162-1241 |
Conference
Conference | 14th International Conference on Information and Communication Technology Convergence, ICTC 2023 |
---|---|
Country/Territory | Korea, Republic of |
City | Jeju Island |
Period | 23/10/11 → 23/10/13 |
Bibliographical note
Publisher Copyright:© 2023 IEEE.
Keywords
- Core network
- Q-learning
- computing resource
- multi-task learning
ASJC Scopus subject areas
- Information Systems
- Computer Networks and Communications