首页 /研究 /Distributional Actor-Critic Ensemble for Uncertainty-Aware Continuous Control

LEARNING

Distributional Actor-Critic Ensemble for Uncertainty-Aware Continuous Control

Takuya Kanazawa, Haiyan Wang, Chetan Gupta

发表年份: 2022
引用次数: 4

摘要

Uncertainty quantification is one of the central challenges for machine learning in real-world applications. In reinforcement learning, an agent confronts two kinds of uncertainty, called epistemic uncertainty and aleatoric uncertainty. Disentangling and evaluating these uncertainties simultaneously stands a chance of improving the agent's final performance, accelerating training, and facilitating quality assurance after deployment. In this work, we propose an uncertainty-aware reinforcement learning algorithm for continuous control tasks that extends the Deep Deterministic Policy Gradient algorithm (DDPG). It exploits epistemic uncertainty to accelerate exploration and aleatoric uncertainty to learn a risk-sensitive policy. We conduct numerical experiments showing that our variant of DDPG outperforms vanilla DDPG without uncertainty estimation in benchmark tasks on robotic control and power-grid optimization.

关键词

Reinforcement learningBenchmark (surveying)Uncertainty quantificationComputer scienceControl (management)ExploitArtificial intelligenceMeasurement uncertaintyGridSoftware deployment

Distributional Actor-Critic Ensemble for Uncertainty-Aware Continuous Control

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory