Bellman equation
Related papers: 20
Top Researchers
Top Cited Papers
High-Dimensional Continuous Control Using Generalized Advantage Estimation
John Schulman, Philipp Moritz, Sergey Levine, Michael I. Jordan, Pieter Abbeel
Citations: 1750 • 2015
Natural Actor-Critic
Jan Peters, Stefan Schaal
Citations: 751 • 2008
A Generalized Path Integral Control Approach to Reinforcement Learning
Evangelos A. Theodorou, Jonas Buchli, Stefan Schaal
Citations: 449 • 2010
Anytime Point-Based Approximations for Large POMDPs
Joëlle Pineau, Geoff Gordon, Sebastian Thrun
Citations: 373 • 2006
Natural Actor-Critic
Jan Peters, Sethu Vijayakumar, Stefan Schaal
Citations: 312 • 2005
Motion planning under uncertainty using iterative local optimization in belief space
Jur van den Berg, Sachin Patil, Ron Alterovitz
Citations: 305 • 2012
Finding Approximate POMDP solutions Through Belief Compression
Nicholas Roy, Geoffrey J. Gordon, Sebastian Thrun
Citations: 253 • 2005
Point-Based Value Iteration for Continuous POMDPs
Josep M. Porta, Nikos Vlassis, Matthijs T. J. Spaan, Pascal Poupart
Citations: 246 • 2006
Model-Free reinforcement learning with continuous action in practice
Thomas Degris, Patrick M. Pilarski, Richard S. Sutton
Citations: 234 • 2012
Planning in the Presence of Cost Functions Controlled by an Adversary
H. Brendan McMahan, Geoffrey J. Gordon, Avrim Blum
Citations: 228 • 2018
Practical Reinforcement Learning in Continuous Spaces
William D. Smart, Leslie Pack Kaelbling
Citations: 211 • 2000
An efficiently solvable quadratic program for stabilizing dynamic locomotion
Citations: 183 • 2014
Inverse Optimal Control with Linearly-Solvable MDPs
Krishnamurthy Dvijotham, Emanuel Todorov
Citations: 119 • 2010
Bisimulation Metrics for Continuous Markov Decision Processes
Norm Ferns, Prakash Panangaden, Doina Precup
Citations: 100 • 2011
Learning to Drive a Real Car in 20 Minutes
Martin Riedmiller, Mike Montemerlo, Hendrik Dahlkamp
Citations: 92 • 2007
Parametric POMDPs for planning in continuous state spaces
Alex Brooks, Alexei Makarenko, Stefan B. Williams, Hugh Durrant‐Whyte
Citations: 90 • 2006
Safety Augmented Value Estimation From Demonstrations (SAVED): Safe Deep Model-Based RL for Sparse Cost Robotic Tasks
Brijen Thananjeyan, Ashwin Balakrishna, Ugo Rosolia, Felix Li, Rowan McAllister, Joseph E. Gonzalez, Sergey Levine, Francesco Borrelli
Citations: 90 • 2020
Bridging Hamilton-Jacobi Safety Analysis and Reinforcement Learning
Jaime F. Fisac, Neil F. Lugovoy, Vicenç Rubies-Royo, Shromona Ghosh, Claire J. Tomlin
Citations: 85 • 2019
Jumping over obstacles with MIT Cheetah 2
Hae-Won Park, Patrick M. Wensing, Sangbae Kim
Citations: 77 • 2020
A Deep Q-network (DQN) Based Path Planning Method for Mobile Robots
Siyu Zhou, Xin Liu, Yingfu Xu, Jifeng Guo
Citations: 75 • 2018