Markov decision process

顶级研究者

Sebastian Thrun

研究机构: —

Richard M. Murray

研究机构: —

Marco Dorigo

研究机构: —

Lydia E. Kavraki

研究机构: —

Daniela Rus

研究机构: —

Steven M. LaValle

研究机构: —

Marc Peter Deisenroth

研究机构: —

Hugh Durrant‐Whyte

研究机构: —

Wolfram Burgard

研究机构: —

Seth Hutchinson

研究机构: —

顶尖机构

Carnegie Mellon UniversityUS39 篇论文 Massachusetts Institute of TechnologyUS24 篇论文 Boston UniversityUS17 篇论文 National University of SingaporeSG13 篇论文 The University of Texas at AustinUS13 篇论文 Centre National de la Recherche ScientifiqueFR11 篇论文 University of OxfordGB11 篇论文 Stanford UniversityUS10 篇论文

最高引用论文

Point-based value iteration: an anytime algorithm for POMDPs

Joëlle Pineau, Geoff Gordon, Sebastian Thrun

引用数: 934 • 2003

Reinforcement learning for robots using neural networks

Long-Ji Lin

引用数: 887 • 1992

SARSOP: Efficient Point-Based POMDP Planning by Approximating Optimally Reachable Belief Spaces

Hanna Kurniawati, David Hsu, Wee Sun Lee

引用数: 782 • 2008

Learning to Track: Online Multi-object Tracking by Decision Making

Xiang Yu, Alexandre Alahi, Silvio Savarese

引用数: 716 • 2015

Learning policies for partially observable environments: Scaling up

Michael L. Littman, Anthony R. Cassandra, Leslie Pack Kaelbling

引用数: 662 • 1995

Acting under uncertainty: discrete Bayesian models for mobile-robot navigation

Anthony R. Cassandra, Leslie Pack Kaelbling, James Kurien

引用数: 468 • 2002

Anytime Point-Based Approximations for Large POMDPs

Joëlle Pineau, Geoff Gordon, Sebastian Thrun

引用数: 373 • 2006

Safe, Multi-Agent, Reinforcement Learning for Autonomous Driving

Shai Shalev‐Shwartz, Shaked Shammah, Amnon Shashua

引用数: 367 • 2016

Intention-aware online POMDP planning for autonomous driving in a crowd

Haoyu Bai, Shaojun Cai, Nan Ye, David Hsu, Wee Sun Lee

引用数: 331 • 2015

Motion planning under uncertainty using iterative local optimization in belief space

Jur van den Berg, Sachin Patil, Ron Alterovitz

引用数: 305 • 2012

Autonomous helicopter control using reinforcement learning policy search methods

J. Andrew Bagnell, Jeff Schneider

引用数: 278 • 2002

Finding Approximate POMDP solutions Through Belief Compression

Nicholas Roy, Geoffrey J. Gordon, Sebastian Thrun

引用数: 253 • 2005

Deep Reinforcement Learning-Based Automatic Exploration for Navigation in Unknown Environment

Haoran Li, Qichao Zhang, Dongbin Zhao

引用数: 247 • 2019

Temporal abstraction in reinforcement learning

Doina Precup, Richard S. Sutton

引用数: 247 • 2000

Point-Based Value Iteration for Continuous POMDPs

Josep M. Porta, Nikos Vlassis, Matthijs T. J. Spaan, Pascal Poupart

引用数: 246 • 2006

Parameter-exploring policy gradients

Frank Sehnke, Christian Osendorfer, Thomas Rückstieß, Alex Graves, Jan Peters, Jürgen Schmidhuber

引用数: 245 • 2009

A Gentle Introduction to Reinforcement Learning and its Application in Different Fields

Muddasar Naeem, Syed Tahir Hussain Rizvi, Antonio Coronato

引用数: 241 • 2020

Planning under Uncertainty for Robotic Tasks with Mixed Observability

Sylvie C. W. Ong, Shao Wei Png, David Hsu, Wee Sun Lee

引用数: 238 • 2010

The Stochastic Motion Roadmap: A Sampling Framework for Planning with Markov Motion Uncertainty

Ron Alterovitz, Thierry Siméon, Ken Goldberg

引用数: 236 • 2007

Planning in the Presence of Cost Functions Controlled by an Adversary

H. Brendan McMahan, Geoffrey J. Gordon, Avrim Blum

引用数: 228 • 2018

Markov decision process

顶级研究者

顶尖机构

最高引用论文

相关技术