CtoD-MAT: Bridging Centralized and Decentralized Execution in Multi-Agent Reinforcement Learning (Student Abstract)

Shota Takayama; Katsuhide Fujita

2026 AAAI AAAI 2026

CtoD-MAT: Bridging Centralized and Decentralized Execution in Multi-Agent Reinforcement Learning (Student Abstract)

Abstract

Abstract Although centralized training with centralized execution (CTCE) excels at multi-agent coordination, its reliance on global information limits its use in the real world. Conversely, the practical decentralized execution (CTDE) paradigm often struggles with complex coordination. This paper bridges this critical gap by introducing the Centralized-to-Decentralized (CtoD) learning concept: a novel framework for transferring the knowledge of a powerful centralized policy into a robust, practical decentralized policy. Our method, CtoD-MAT, realizes this transition through a curriculum that gradually shifts agents from centralized to decentralized control. A key innovation is our dynamic scheduling mechanism, featuring a mediator module, which ensures a robust and effective knowledge transfer. Using challenging SMAC benchmarks, we demonstrate that CtoD-MAT successfully produces competitive decentralized policies, notably solving complex coordination tasks that are difficult for standard CTDE methods.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Reinforcement Learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Shota Takayama , Katsuhide Fujita

Topics

Artificial Intelligence > Core AI > Multi-Agent Systems Reinforcement Learning > Methods > Multi-Agent Systems

Keywords

multi-agent reinforcement learning knowledge transfer centralized training decentralized execution

Download PDF

Related papers

Hi-EF: Benchmarking Emotion Forecasting in Human-interaction 2026

MosaicDoc: A Large-Scale Bilingual Benchmark for Visually Rich Document Understanding 2026

Sparse3DPR: Training-Free 3D Hierarchical Scene Parsing and Task-Adaptive Subgraph Reasoning from Sparse RGB Views 2026

LayerEdit: Disentangled Multi-Object Editing via Conflict-Aware Multi-Layer Learning 2026

HDGS: Hierarchical Dynamic Gaussian Splatting for Urban Driving Scenes 2026