首页 /研究 /UNICON: Uncertainty-Conditioned Policy for Robust Behavior in Unfamiliar Scenarios

LEARNING

UNICON: Uncertainty-Conditioned Policy for Robust Behavior in Unfamiliar Scenarios

Chan Kim, Jaekyung Cho, Hyung-Suk Yoon, Seung‐Woo Seo, Seong-Woo Kim

发表年份: 2022
引用次数: 3

摘要

Deep reinforcement learning has been used to solve complex tasks in various fields, particularly in robotics control. However, agents trained using deep reinforcement learning have a problem of taking overconfident actions, even when the input state is far from the learned state distribution. This restricts deep reinforcement learning from being applied to real-world environments as overconfident actions in unlearned situations can result in catastrophic events; such as the collision of an autonomous vehicle. To address this, the agents should know “what they do not know” and choose an action by considering not only the state but also its uncertainty. In this study, we propose a novel uncertainty-conditioned policy (UNICON) inspired by the human behavior of changing policies according to uncertainty, e.g., slowing a car on a narrow road that has never been visited before. Our experimental results demonstrate that the proposed method is robust to unfamiliar scenarios that are not seen during training.

关键词

Reinforcement learningArtificial intelligenceAction (physics)Computer scienceState (computer science)ReinforcementControl (management)RoboticsMachine learningRobot

UNICON: Uncertainty-Conditioned Policy for Robust Behavior in Unfamiliar Scenarios

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory