Offline Model-based Optimization for Real-World Molecular Discover

  • Dong Hee Shin
  • , Young Han Son
  • , Hyun Jung Lee
  • , Deok Joong Lee
  • , Tae Eui Kam*
  • *Corresponding author for this work

Research output: Contribution to journalConference articlepeer-review

Abstract

Molecular discovery has attracted significant attention in scientific fields for its ability to generate novel molecules with desirable properties. Although numerous methods have been developed to tackle this problem, most rely on an online setting that requires repeated online evaluation of candidate molecules using the oracle. However, in real-world molecular discovery, the oracle is often represented by wet lab experiments, making this online setting impractical due to the significant time and resource demands. To fill this gap, we propose the Molecular Stitching (MolStitch) framework, which utilizes a fixed offline dataset to explore and optimize molecules without the need for repeated oracle evaluations. Specifically, Mol-Stitch leverages existing molecules from the offline dataset to generate novel ‘stitched molecules’ that combine their desirable properties. These stitched molecules are then used as training samples to fine-tune the generative model using preference optimization techniques. Experimental results on various offline multi-objective molecular optimization problems validate the effectiveness of MolStitch. The source code is available online.

Original languageEnglish
Pages (from-to)55205-55254
Number of pages50
JournalProceedings of Machine Learning Research
Volume267
Publication statusPublished - 2025
Event42nd International Conference on Machine Learning, ICML 2025 - Vancouver, Canada
Duration: 2025 Jul 132025 Jul 19

Bibliographical note

Publisher Copyright:
© 2025, ML Research Press. All rights reserved.

ASJC Scopus subject areas

  • Software
  • Control and Systems Engineering
  • Statistics and Probability
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Offline Model-based Optimization for Real-World Molecular Discover'. Together they form a unique fingerprint.

Cite this