Adaptive Variance for Changing Sparse-Reward Environments

Xingyu Lin, Pengsheng Guo, Carlos Florensa, David Held

Year: 2019
Access: Open access

Abstract

Robots that are trained to perform a task in a fixed environment often fail when facing unexpected changes to the environment due to a lack of exploration. We propose a principled way to adapt the policy for better exploration in changing sparse-reward environments. Unlike previous works which explicitly model environmental changes, we analyze the relationship between the value function and the optimal exploration for a Gaussian-parameterized policy and show that our theory leads to an effective strategy for adjusting the variance of the policy, enabling fast adapt to changes in a variety of sparse-reward environments.

Keywords

cs.ROcs.AI

Adaptive Variance for Changing Sparse-Reward Environments

Abstract

Keywords

Related papers

A dual-loop framework for manufacturability-aware topology optimization of electric vehicle structures via wire arc additive manufacturing

Geometric digital twin: A digital and intelligent model for aero-engine assembly accuracy prediction

Revolutionizing Industries Through AI-Driven Robotics

Design and dynamic performance prediction of a novel large-aperture offset-feed deployable antenna