Empirical Evaluation of Acquisition Functions for Bayesian Optimization-Based Configuration Tuning of Apache Spark Applications

Research output: Contribution to journalArticlepeer-review

Abstract

The execution time of an Apache Spark application is heavily influenced by its configuration settings. Accordingly, Bayesian Optimization (BO) is commonly used for automated tuning, employing the acquisition function, Expected Improvement (EI). However, existing works did not compare the performance to the other acquisition functions empirically. In this paper, we show that EI may not work well for Spark applications due to a huge search space compared to the other optimization problems. In addition, we demonstrate the performance of BO based on Probability of Improvement (PI), which achieves exploration via rich random initialization and exploitation via the PI acquisition function. Through the experimental evaluations, we show that the PI-based BO outperforms the EI-based BO in both optimal time and optimization cost.

Original languageEnglish
Pages (from-to)1246-1249
Number of pages4
JournalIEICE Transactions on Information and Systems
VolumeE108
Issue number10
DOIs
Publication statusPublished - 2025 Oct 1

Bibliographical note

Publisher Copyright:
Copyright © 2025 The Institute of Electronics, Information and Communication Engineers.

Keywords

  • Bayesian optimization
  • apache spark
  • configuration tuning

ASJC Scopus subject areas

  • Software
  • Hardware and Architecture
  • Computer Vision and Pattern Recognition
  • Electrical and Electronic Engineering
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Empirical Evaluation of Acquisition Functions for Bayesian Optimization-Based Configuration Tuning of Apache Spark Applications'. Together they form a unique fingerprint.

Cite this