NBDI: A Simple and Effective Termination Condition for Skill Extraction from Task-Agnostic Demonstrations

  • Myunsoo Kim
  • , Hayeong Lee
  • , Seong Woong Shim
  • , Junho Seo
  • , Byung Jun Lee*
  • *Corresponding author for this work

Research output: Contribution to journalConference articlepeer-review

Abstract

Intelligent agents are able to make decisions based on different levels of granularity and duration. Recent advances in skill learning enabled the agent to solve complex, long-horizon tasks by effectively guiding the agent in choosing appropriate skills. However, the practice of using fixed-length skills can easily result in skipping valuable decision points, which ultimately limits the potential for further exploration and faster policy learning. In this work, we propose to learn a simple and effective termination condition that identifies decision points through a state-action novelty module that leverages agent experience data. Our approach, Novelty-based Decision Point Identification (NBDI), outperforms previous baselines in complex, long-horizon tasks, and remains effective even in the presence of significant variations in the environment configurations of downstream tasks, highlighting the importance of decision point identification in skill learning.

Original languageEnglish
Pages (from-to)30437-30461
Number of pages25
JournalProceedings of Machine Learning Research
Volume267
Publication statusPublished - 2025
Event42nd International Conference on Machine Learning, ICML 2025 - Vancouver, Canada
Duration: 2025 Jul 132025 Jul 19

Bibliographical note

Publisher Copyright:
© 2025, ML Research Press. All rights reserved.

ASJC Scopus subject areas

  • Software
  • Control and Systems Engineering
  • Statistics and Probability
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'NBDI: A Simple and Effective Termination Condition for Skill Extraction from Task-Agnostic Demonstrations'. Together they form a unique fingerprint.

Cite this