首页 /研究 /Novelty-based Sample Reuse for Continuous Robotics Control
LEARNING

Novelty-based Sample Reuse for Continuous Robotics Control

Ke Duan, Kai Yang, Houde Liu, Xueqian Wang

发表年份
2024
访问权限
开放获取

摘要

In reinforcement learning, agents collect state information and rewards through environmental interactions, essential for policy refinement. This process is notably time-consuming, especially in complex robotic simulations and real-world applications. Traditional algorithms usually re-engage with the environment after processing a single batch of samples, thereby failing to fully capitalize on historical data. However, frequently observed states, with reliable value estimates, require minimal updates; in contrast, rare observed states necessitate more intensive updates for achieving accurate value estimations. To address uneven sample utilization, we propose Novelty-guided Sample Reuse (NSR). NSR provides extra updates for infrequent, novel states and skips additional updates for frequent states, maximizing sample use before interacting with the environment again. Our experiments show that NSR improves the convergence rate and success rate of algorithms without significantly increasing time consumption. Our code is publicly available at https://github.com/ppksigs/NSR-DDPG-HER.

关键词

cs.ROcs.LG

相关论文

查看 LEARNING 分类全部论文