TY - GEN
T1 - Combinatorial multi-armed bandits in cognitive radio networks
T2 - 8th International Conference on Information and Communication Technology Convergence, ICTC 2017
AU - Kang, Sunjung
AU - Joo, Changhee
N1 - Funding Information:
This work was supported by the research fund of the Signal Intelligence Research Center supervised by the Defense Acquisition Program Administration and Agency for Defense Development of Korea.
Publisher Copyright:
© 2017 IEEE.
PY - 2017/12/12
Y1 - 2017/12/12
N2 - Combinatorial multi-armed bandit (MAB) problem can be used to formulate sequential decision problems with exploration-exploitation tradeoff. Dynamic spectrum access (DSA) in cognitive radio (CR) networks is one of important applications. In this work, we briefly overview combinatorial MAB problems with its possible applications to CR networks. We first investigate the standard MAB problems where a single player either explores an arm to gather information to improve its decision strategy, or exploits the arm based on the information that it has collected at each round. Then, we study the taxonomy of combinatorial MAB problems, in particular for multi-player scenarios with independent and identically distributed (i.i.d.) rewards. Finally, we discuss limitations of existing works and interesting open problems.
AB - Combinatorial multi-armed bandit (MAB) problem can be used to formulate sequential decision problems with exploration-exploitation tradeoff. Dynamic spectrum access (DSA) in cognitive radio (CR) networks is one of important applications. In this work, we briefly overview combinatorial MAB problems with its possible applications to CR networks. We first investigate the standard MAB problems where a single player either explores an arm to gather information to improve its decision strategy, or exploits the arm based on the information that it has collected at each round. Then, we study the taxonomy of combinatorial MAB problems, in particular for multi-player scenarios with independent and identically distributed (i.i.d.) rewards. Finally, we discuss limitations of existing works and interesting open problems.
KW - Cognitive radio networks
KW - Combinatorial multi-armed bandits
KW - Multi-armed bandits
UR - http://www.scopus.com/inward/record.url?scp=85046892612&partnerID=8YFLogxK
U2 - 10.1109/ICTC.2017.8190862
DO - 10.1109/ICTC.2017.8190862
M3 - Conference contribution
AN - SCOPUS:85046892612
T3 - International Conference on Information and Communication Technology Convergence: ICT Convergence Technologies Leading the Fourth Industrial Revolution, ICTC 2017
SP - 1086
EP - 1088
BT - International Conference on Information and Communication Technology Convergence
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 18 October 2017 through 20 October 2017
ER -