Home /Research /Sparse Discrete Communication Learning for Multi-Agent Cooperation Through Backpropagation
LEARNING

Sparse Discrete Communication Learning for Multi-Agent Cooperation Through Backpropagation

Benjamin Freed, Rohan James, Guillaume Sartoretti, Howie Choset

Year
2020
Citations
11

Abstract

Recent approaches to multi-agent reinforcement learning (MARL) with inter-agent communication have often overlooked important considerations of real-world communication networks, such as limits on bandwidth. In this paper, we propose an approach to learning sparse discrete communication through backpropagation in the context of MARL, in which agents are incentivized to communicate as little as possible while still achieving high reward. Building on top of our prior work on differentiable discrete communication learning, we develop a regularization-inspired message-length penalty term, that encourages agents to send shorter messages and avoid unnecessary communications. To this end, we introduce a variable-length message code that provides agents with a general means of modulating message length while keeping the overall learning objective differentiable. We present simulation results on a partially-observable robot navigation task, where we first show how our approach allows learning of sparse communication behavior while still solving the task. We finally demonstrate our approach can even learn an effective sparse communication behavior from demonstrations of an expert (potentially communication-free) policy.

Keywords

Computer scienceDifferentiable functionReinforcement learningArtificial intelligenceBackpropagationTask (project management)Context (archaeology)Distributed computingMachine learningArtificial neural network

Related papers

Browse all LEARNING papers