Abstract
In multi-agent reinforcement Learning (MARL), improving sample efficiency poses a significant challenge. Existing methods such as QMIX primarily focus on modeling agent cooperation but still face limitations, including the requirement for extensive interaction data and long training times. To address these challenges, existing studies have investigated the use of auxiliary tasks based on representation learning, including self-supervised learning approaches. In this paper, we propose DynaMIX, which incorporates a multi-step temporal forward dynamics modeling (MTFDM) as an auxiliary task for QMIX. DynaMIX forecasts the future state beyond the immediate next state. This enables explicit learning of environmental dynamics, allowing for better understanding and adaptation to complex multi-agent interactions. Experimental results on the StarCraft II micromanagement benchmark demonstrate that DynaMIX significantly improves sample efficiency under limited interactions compared to QMIX.
| Original language | English |
|---|---|
| Title of host publication | Advances and Trends in Artificial Intelligence. Theory and Applications - 38th International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2025, Proceedings |
| Editors | Hamido Fujita, Yutaka Watanobe, Moonis Ali, Yinglin Wang |
| Publisher | Springer Science and Business Media Deutschland GmbH |
| Pages | 188-193 |
| Number of pages | 6 |
| ISBN (Print) | 9789819688883 |
| DOIs | |
| Publication status | Published - 2026 |
| Event | 38th International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2025 - Kitakyushu, Japan Duration: 2025 Jul 1 → 2025 Jul 4 |
Publication series
| Name | Lecture Notes in Computer Science |
|---|---|
| Volume | 15706 LNAI |
| ISSN (Print) | 0302-9743 |
| ISSN (Electronic) | 1611-3349 |
Conference
| Conference | 38th International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2025 |
|---|---|
| Country/Territory | Japan |
| City | Kitakyushu |
| Period | 25/7/1 → 25/7/4 |
Bibliographical note
Publisher Copyright:© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2026.
Keywords
- Forward Dynamics Modeling
- Multi-agent Reinforcement Learning
- Representation Learning
- Sample Efficiency
- StarCraft II
ASJC Scopus subject areas
- Theoretical Computer Science
- General Computer Science
Fingerprint
Dive into the research topics of 'DynaMIX: Sample-Efficient Multi Agent Reinforcement Learning with Multi-step Temporal Forward Dynamics Modeling'. Together they form a unique fingerprint.Cite this
- APA
- Standard
- Harvard
- Vancouver
- Author
- BIBTEX
- RIS