Markov decision process
相关论文数: 20
顶级研究者
最高引用论文
Point-based value iteration: an anytime algorithm for POMDPs
Joëlle Pineau, Geoff Gordon, Sebastian Thrun
引用数: 934 • 2003
Reinforcement learning for robots using neural networks
Long-Ji Lin
引用数: 887 • 1992
SARSOP: Efficient Point-Based POMDP Planning by Approximating Optimally Reachable Belief Spaces
Hanna Kurniawati, David Hsu, Wee Sun Lee
引用数: 782 • 2008
Learning to Track: Online Multi-object Tracking by Decision Making
Xiang Yu, Alexandre Alahi, Silvio Savarese
引用数: 716 • 2015
Learning policies for partially observable environments: Scaling up
Michael L. Littman, Anthony R. Cassandra, Leslie Pack Kaelbling
引用数: 662 • 1995
Acting under uncertainty: discrete Bayesian models for mobile-robot navigation
Anthony R. Cassandra, Leslie Pack Kaelbling, James Kurien
引用数: 468 • 2002
Anytime Point-Based Approximations for Large POMDPs
Joëlle Pineau, Geoff Gordon, Sebastian Thrun
引用数: 373 • 2006
Safe, Multi-Agent, Reinforcement Learning for Autonomous Driving
Shai Shalev‐Shwartz, Shaked Shammah, Amnon Shashua
引用数: 367 • 2016
Intention-aware online POMDP planning for autonomous driving in a crowd
Haoyu Bai, Shaojun Cai, Nan Ye, David Hsu, Wee Sun Lee
引用数: 331 • 2015
Motion planning under uncertainty using iterative local optimization in belief space
Jur van den Berg, Sachin Patil, Ron Alterovitz
引用数: 305 • 2012
Autonomous helicopter control using reinforcement learning policy search methods
J. Andrew Bagnell, Jeff Schneider
引用数: 278 • 2002
Finding Approximate POMDP solutions Through Belief Compression
Nicholas Roy, Geoffrey J. Gordon, Sebastian Thrun
引用数: 253 • 2005
Deep Reinforcement Learning-Based Automatic Exploration for Navigation in Unknown Environment
Haoran Li, Qichao Zhang, Dongbin Zhao
引用数: 247 • 2019
Temporal abstraction in reinforcement learning
Doina Precup, Richard S. Sutton
引用数: 247 • 2000
Point-Based Value Iteration for Continuous POMDPs
Josep M. Porta, Nikos Vlassis, Matthijs T. J. Spaan, Pascal Poupart
引用数: 246 • 2006
Parameter-exploring policy gradients
Frank Sehnke, Christian Osendorfer, Thomas Rückstieß, Alex Graves, Jan Peters, Jürgen Schmidhuber
引用数: 245 • 2009
A Gentle Introduction to Reinforcement Learning and its Application in Different Fields
Muddasar Naeem, Syed Tahir Hussain Rizvi, Antonio Coronato
引用数: 241 • 2020
Planning under Uncertainty for Robotic Tasks with Mixed Observability
Sylvie C. W. Ong, Shao Wei Png, David Hsu, Wee Sun Lee
引用数: 238 • 2010
The Stochastic Motion Roadmap: A Sampling Framework for Planning with Markov Motion Uncertainty
Ron Alterovitz, Thierry Siméon, Ken Goldberg
引用数: 236 • 2007
Planning in the Presence of Cost Functions Controlled by an Adversary
H. Brendan McMahan, Geoffrey J. Gordon, Avrim Blum
引用数: 228 • 2018