Online Learning with Regularized Knowledge Gradients

Donghun Lee, Warren B. Powell

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

We introduce a simple and effective regularization of knowledge gradient (KG) and use it to present the first sublinear regret bound result for KG-based algorithms. We construct online learning with regularized knowledge gradients (ORKG) algorithm with independent Gaussian belief model, and prove that ORKG algorithm achieves sublinear regret upper bound with high probability facing bounded independent Gaussian multi-armed bandit (MAB) problems. The theoretical properties of regularized KG and ORKG algorithm are analyzed, and the empirical characteristics of ORKG algorithm are empirically validated with MAB benchmark simulations. ORKG algorithm shows top-tier performance comparable to select MAB algorithms with provable regret bounds.

Original languageEnglish
Title of host publicationAdvances in Knowledge Discovery and Data Mining - 26th Pacific-Asia Conference, PAKDD 2022, Proceedings
EditorsJoão Gama, Tianrui Li, Yang Yu, Enhong Chen, Yu Zheng, Fei Teng
PublisherSpringer Science and Business Media Deutschland GmbH
Pages328-339
Number of pages12
ISBN (Print)9783031059353
DOIs
Publication statusPublished - 2022
Event26th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2022 - Chengdu, China
Duration: 2022 May 162022 May 19

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13281 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference26th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2022
Country/TerritoryChina
CityChengdu
Period22/5/1622/5/19

Bibliographical note

Publisher Copyright:
© 2022, The Author(s), under exclusive license to Springer Nature Switzerland AG.

Keywords

  • Knowledge gradient
  • Online learning
  • Regret analysis

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'Online Learning with Regularized Knowledge Gradients'. Together they form a unique fingerprint.

Cite this