首页 /研究 /Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon Reasoning

MANIPULATION

Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon Reasoning

Dhruv Shah, Peng Xu, Yao Lu, Ted Xiao, Alexander Toshev, Sergey Levine, Brian Ichter

发表年份: 2021
访问权限: 开放获取

摘要

Reinforcement learning can train policies that effectively perform complex tasks. However for long-horizon tasks, the performance of these methods degrades with horizon, often necessitating reasoning over and chaining lower-level skills. Hierarchical reinforcement learning aims to enable this by providing a bank of low-level skills as action abstractions. Hierarchies can further improve on this by abstracting the space states as well. We posit that a suitable state abstraction should depend on the capabilities of the available lower-level policies. We propose Value Function Spaces: a simple approach that produces such a representation by using the value functions corresponding to each lower-level skill. These value functions capture the affordances of the scene, thus forming a representation that compactly abstracts task relevant information and robustly ignores distractors. Empirical evaluations for maze-solving and robotic manipulation tasks demonstrate that our approach improves long-horizon performance and enables better zero-shot generalization than alternative model-free and model-based methods.

关键词

cs.LGcs.AIcs.RO

Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon Reasoning

摘要

关键词

相关论文

面向大型复杂构件的移动机器人辅助磨削技术综述

基于物理信息与机器学习的五轴铣削TC4钛合金刀具磨损融合预测模型

通过新型压电主动阻尼刀柄提升机器人铣削质量

一种利用磁致非线性宽带多向被动减振器抑制机器人铣削低频颤振的新方法