Abstract
The execution time of an Apache Spark application is heavily influenced by its configuration settings. Accordingly, Bayesian Optimization (BO) is commonly used for automated tuning, employing the acquisition function, Expected Improvement (EI). However, existing works did not compare the performance to the other acquisition functions empirically. In this paper, we show that EI may not work well for Spark applications due to a huge search space compared to the other optimization problems. In addition, we demonstrate the performance of BO based on Probability of Improvement (PI), which achieves exploration via rich random initialization and exploitation via the PI acquisition function. Through the experimental evaluations, we show that the PI-based BO outperforms the EI-based BO in both optimal time and optimization cost.
| Original language | English |
|---|---|
| Pages (from-to) | 1246-1249 |
| Number of pages | 4 |
| Journal | IEICE Transactions on Information and Systems |
| Volume | E108 |
| Issue number | 10 |
| DOIs | |
| Publication status | Published - 2025 Oct 1 |
Bibliographical note
Publisher Copyright:Copyright © 2025 The Institute of Electronics, Information and Communication Engineers.
Keywords
- Bayesian optimization
- apache spark
- configuration tuning
ASJC Scopus subject areas
- Software
- Hardware and Architecture
- Computer Vision and Pattern Recognition
- Electrical and Electronic Engineering
- Artificial Intelligence
Fingerprint
Dive into the research topics of 'Empirical Evaluation of Acquisition Functions for Bayesian Optimization-Based Configuration Tuning of Apache Spark Applications'. Together they form a unique fingerprint.Cite this
- APA
- Standard
- Harvard
- Vancouver
- Author
- BIBTEX
- RIS