Skip to main navigation Skip to search Skip to main content

DynaMIX: Sample-Efficient Multi Agent Reinforcement Learning with Multi-step Temporal Forward Dynamics Modeling

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In multi-agent reinforcement Learning (MARL), improving sample efficiency poses a significant challenge. Existing methods such as QMIX primarily focus on modeling agent cooperation but still face limitations, including the requirement for extensive interaction data and long training times. To address these challenges, existing studies have investigated the use of auxiliary tasks based on representation learning, including self-supervised learning approaches. In this paper, we propose DynaMIX, which incorporates a multi-step temporal forward dynamics modeling (MTFDM) as an auxiliary task for QMIX. DynaMIX forecasts the future state beyond the immediate next state. This enables explicit learning of environmental dynamics, allowing for better understanding and adaptation to complex multi-agent interactions. Experimental results on the StarCraft II micromanagement benchmark demonstrate that DynaMIX significantly improves sample efficiency under limited interactions compared to QMIX.

Original languageEnglish
Title of host publicationAdvances and Trends in Artificial Intelligence. Theory and Applications - 38th International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2025, Proceedings
EditorsHamido Fujita, Yutaka Watanobe, Moonis Ali, Yinglin Wang
PublisherSpringer Science and Business Media Deutschland GmbH
Pages188-193
Number of pages6
ISBN (Print)9789819688883
DOIs
Publication statusPublished - 2026
Event38th International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2025 - Kitakyushu, Japan
Duration: 2025 Jul 12025 Jul 4

Publication series

NameLecture Notes in Computer Science
Volume15706 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference38th International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2025
Country/TerritoryJapan
CityKitakyushu
Period25/7/125/7/4

Bibliographical note

Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2026.

Keywords

  • Forward Dynamics Modeling
  • Multi-agent Reinforcement Learning
  • Representation Learning
  • Sample Efficiency
  • StarCraft II

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'DynaMIX: Sample-Efficient Multi Agent Reinforcement Learning with Multi-step Temporal Forward Dynamics Modeling'. Together they form a unique fingerprint.

Cite this