This document discusses using deep reinforcement learning to coordinate control of multiple agents. Specifically, it proposes using decentralized neural networks, with each agent having its own network, rather than a single network controlling all agents. This decentralized approach led to improved performance, with policies up to twice as good as the single network approach. It also allowed for faster training times since the agent networks could be trained in parallel. The decentralized control achieved better scaling to complex multi-agent tasks compared to the traditional centralized approach.
Related topics: