DOP: Deep Optimistic Planning with Approximate Value Function Evaluation

Francesco Riccio, Roberto Capobianco, Daniele Nardi

发表年份: 2018
引用次数: 4

摘要

Research on reinforcement learning has demonstrated promising results in manifold applications and domains. Still, efficiently learning effective robot behaviors is very difficult, due to unstructured scenarios, high uncertainties, and large state dimensionality (e.g. multi-agent systems or hyper-redundant robots). To alleviate this problem, we present DOP, a deep model-based reinforcement learning algorithm, that attacks the curse of dimensionality and reduces the computational demand of the planning process while achieving good performance.

关键词

Reinforcement learningComputer scienceMonte Carlo tree searchCurse of dimensionalityArtificial intelligenceRobotExploitState spaceBellman equationTask (project management)

DOP: Deep Optimistic Planning with Approximate Value Function Evaluation

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory