Bellman equation

Related papers: 20

Top Cited Papers

High-Dimensional Continuous Control Using Generalized Advantage Estimation

John Schulman, Philipp Moritz, Sergey Levine, Michael I. Jordan, Pieter Abbeel

Citations: 1750 • 2015

Natural Actor-Critic

Jan Peters, Stefan Schaal

Citations: 751 • 2008

A Generalized Path Integral Control Approach to Reinforcement Learning

Evangelos A. Theodorou, Jonas Buchli, Stefan Schaal

Citations: 449 • 2010

Anytime Point-Based Approximations for Large POMDPs

Joëlle Pineau, Geoff Gordon, Sebastian Thrun

Citations: 373 • 2006

Natural Actor-Critic

Jan Peters, Sethu Vijayakumar, Stefan Schaal

Citations: 312 • 2005

Motion planning under uncertainty using iterative local optimization in belief space

Jur van den Berg, Sachin Patil, Ron Alterovitz

Citations: 305 • 2012

Finding Approximate POMDP solutions Through Belief Compression

Nicholas Roy, Geoffrey J. Gordon, Sebastian Thrun

Citations: 253 • 2005

Point-Based Value Iteration for Continuous POMDPs

Josep M. Porta, Nikos Vlassis, Matthijs T. J. Spaan, Pascal Poupart

Citations: 246 • 2006

Model-Free reinforcement learning with continuous action in practice

Thomas Degris, Patrick M. Pilarski, Richard S. Sutton

Citations: 234 • 2012

Planning in the Presence of Cost Functions Controlled by an Adversary

H. Brendan McMahan, Geoffrey J. Gordon, Avrim Blum

Citations: 228 • 2018

Practical Reinforcement Learning in Continuous Spaces

William D. Smart, Leslie Pack Kaelbling

Citations: 211 • 2000

An efficiently solvable quadratic program for stabilizing dynamic locomotion

Citations: 183 • 2014

Inverse Optimal Control with Linearly-Solvable MDPs

Krishnamurthy Dvijotham, Emanuel Todorov

Citations: 119 • 2010

Bisimulation Metrics for Continuous Markov Decision Processes

Norm Ferns, Prakash Panangaden, Doina Precup

Citations: 100 • 2011

Learning to Drive a Real Car in 20 Minutes

Martin Riedmiller, Mike Montemerlo, Hendrik Dahlkamp

Citations: 92 • 2007

Parametric POMDPs for planning in continuous state spaces

Alex Brooks, Alexei Makarenko, Stefan B. Williams, Hugh Durrant‐Whyte

Citations: 90 • 2006

Safety Augmented Value Estimation From Demonstrations (SAVED): Safe Deep Model-Based RL for Sparse Cost Robotic Tasks

Brijen Thananjeyan, Ashwin Balakrishna, Ugo Rosolia, Felix Li, Rowan McAllister, Joseph E. Gonzalez, Sergey Levine, Francesco Borrelli

Citations: 90 • 2020

Bridging Hamilton-Jacobi Safety Analysis and Reinforcement Learning

Jaime F. Fisac, Neil F. Lugovoy, Vicenç Rubies-Royo, Shromona Ghosh, Claire J. Tomlin

Citations: 85 • 2019

Jumping over obstacles with MIT Cheetah 2

Hae-Won Park, Patrick M. Wensing, Sangbae Kim

Citations: 77 • 2020

A Deep Q-network (DQN) Based Path Planning Method for Mobile Robots

Siyu Zhou, Xin Liu, Yingfu Xu, Jifeng Guo

Citations: 75 • 2018