Decentralization of Multiagent Policies by Learning What to Communicate

James Paulos, Steven W. Chen, Daigo Shishika, Vijay Kumar

发表年份: 2019
引用次数: 4

摘要

Effective communication is required for teams of robots to solve sophisticated collaborative tasks. In practice it is typical for both the encoding and semantics of communication to be manually defined by an expert; this is true regardless of whether the behaviors themselves are bespoke, optimization based, or learned. We present an agent architecture and training methodology using neural networks to learn task-oriented communication semantics based on the example of a communication-unaware expert policy. A perimeter defense game illustrates the system's ability to handle dynamically changing numbers of agents and its graceful degradation in performance as communication constraints are tightened or the expert's observability assumptions are broken.

关键词

BespokeComputer scienceTask (project management)Semantics (computer science)ObservabilityArtificial intelligenceEncoding (memory)Human–computer interactionDistributed computingRobot

Decentralization of Multiagent Policies by Learning What to Communicate

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory