LEARNING
DOP: Deep Optimistic Planning with Approximate Value Function Evaluation
Francesco Riccio, Roberto Capobianco, Daniele Nardi
- 发表年份
- 2018
- 引用次数
- 4
摘要
Research on reinforcement learning has demonstrated promising results in manifold applications and domains. Still, efficiently learning effective robot behaviors is very difficult, due to unstructured scenarios, high uncertainties, and large state dimensionality (e.g. multi-agent systems or hyper-redundant robots). To alleviate this problem, we present DOP, a deep model-based reinforcement learning algorithm, that attacks the curse of dimensionality and reduces the computational demand of the planning process while achieving good performance.
关键词
Reinforcement learningComputer scienceMonte Carlo tree searchCurse of dimensionalityArtificial intelligenceRobotExploitState spaceBellman equationTask (project management)
相关论文
OTHER
📊 26,957 引用
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
PERCEPTION
📊 22,245 引用
Artificial intelligence: a modern approach
1995
OTHER
📊 18,993 引用
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
SWARM
📊 14,853 引用
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002