Policy learning

Related papers: 20

Top Cited Papers

A Survey on Policy Search for Robotics

Marc Peter Deisenroth

Citations: 684 • 2011

Data-Efficient Hierarchical Reinforcement Learning

Ofir Nachum, Shixiang Gu, Honglak Lee, Sergey Levine

Citations: 265 • 2018

Transfer via inter-task mappings in policy search reinforcement learning

Matthew E. Taylor, Shimon Whiteson, Peter Stone

Citations: 133 • 2007

Multi-task policy search for robotics

Marc Peter Deisenroth, Péter Englert, Jan Peters, Dieter Fox

Citations: 121 • 2014

Effect of human guidance and state space size on Interactive Reinforcement Learning

Halit Bener Suay, Sonia Chernova

Citations: 118 • 2011

Interactive Learning from Policy-Dependent Human Feedback

James MacGlashan, Mark K. Ho, Robert Loftin, Bei Peng, Guan Wang, David L. Roberts, Matthew E. Taylor, Michael L. Littman

Citations: 108 • 2017

Preference-Based Policy Learning

Riad Akrour, Marc Schoenauer, Michèle Sébag

Citations: 83 • 2011

Explanation-Based Reward Coaching to Improve Human Performance via Reinforcement Learning

Aaquib Tabrez, Shivendra Agrawal, Bradley Hayes

Citations: 62 • 2019

Stochastic Abstract Policies: Generalizing Knowledge to Improve Reinforcement Learning

Marcelo Li Koga, Valdinei Freire, Anna Helena Reali Costa

Citations: 45 • 2014

A residual reinforcement learning method for robotic assembly using visual and force information

Zhuangzhuang Zhang, Yizhao Wang, Zhinan Zhang, Lihui Wang, Huang Huang, Qixin Cao

Citations: 40 • 2023

Any-point Trajectory Modeling for Policy Learning

Xingyu Lin, John So, Kai Chen, Qi Dou, Yang Gao, Pieter Abbeel

Citations: 40 • 2024

Reinforcement Learning for Pivoting Task

Rika Antonova, Silvia Cruciani, Christian Smith, Danica Kragić

Citations: 36 • 2017

Affordance Learning from Play for Sample-Efficient Policy Learning

Jessica Borja-Diaz, Oier Mees, Gabriel Kalweit, Lukás Hermann, Joschka Boedecker, Wolfram Burgard

Citations: 29 • 2022

Velocity adaptation for self-improvement of skills learned from user demonstrations

Bojan Nemec, Andrej Gams, Aleš Ude

Citations: 28 • 2013

GAPLE: Generalizable Approaching Policy LEarning for Robotic Object Searching in Indoor Environment

Xin Ye, Zhe Lin, Joon‐Young Lee, Jianming Zhang, Shibin Zheng, Yezhou Yang

Citations: 26 • 2019

Transfer Learning for Policy Search Methods

Shimon Whiteson

Citations: 25 • 2006

Sample and time efficient policy learning with CMA-ES and Bayesian Optimisation

Léni K. Le Goff, Edgar Buchanan, Emma Hart, A. E. Eiben, Wei Li, Matteo De Carlo, Matthew F. Hale, Mike Angus, Robert Woolley, Jon Timmis, Alan Winfield, Andrew M. Tyrrell

Citations: 20 • 2020

Learning policies for attentional control

Luiz Marcos Garcia Gonçalves, Gilson A. Giraldi, Antonio A. F. Oliveira, Roderic A. Grupen

Citations: 15 • 2003

Interaction-Aware Multi-Agent Reinforcement Learning for Mobile Agents with Individual Goals

Anahita Mohseni-Kabir, David Isele, Kikuo Fujimura

Citations: 14 • 2019

Learning Environmental Calibration Actions for Policy Self-Evolution

Chao Zhang, Yang Yu, Zhi‐Hua Zhou

Citations: 13 • 2018