首页 /研究 /Multi-agent Reinforcement Learning with Deep Networks for Diverse Q-Vectors

LEARNING

Multi-agent Reinforcement Learning with Deep Networks for Diverse Q-Vectors

Zhenglong Luo, Zhiyong Chen, James Welsh

发表年份: 2024
访问权限: 开放获取

摘要

Multi-agent reinforcement learning (MARL) has become a significant research topic due to its ability to facilitate learning in complex environments. In multi-agent tasks, the state-action value, commonly referred to as the Q-value, can vary among agents because of their individual rewards, resulting in a Q-vector. Determining an optimal policy is challenging, as it involves more than just maximizing a single Q-value. Various optimal policies, such as a Nash equilibrium, have been studied in this context. Algorithms like Nash Q-learning and Nash Actor-Critic have shown effectiveness in these scenarios. This paper extends this research by proposing a deep Q-networks (DQN) algorithm capable of learning various Q-vectors using Max, Nash, and Maximin strategies. The effectiveness of this approach is demonstrated in an environment where dual robotic arms collaborate to lift a pot.

关键词

cs.AIcs.MA

Multi-agent Reinforcement Learning with Deep Networks for Diverse Q-Vectors

摘要

关键词

相关论文

The Organization of Behavior

Fractional Brownian Motions, Fractional Noises and Applications

Review of deep learning: concepts, CNN architectures, challenges, applications, future directions

A guide to deep learning in healthcare