Trust region

相关论文数: 20

最高引用论文

Trust Region Policy Optimization

John Schulman, Sergey Levine, Philipp Moritz, Michael I. Jordan, Pieter Abbeel

引用数: 3141 • 2015

Guided Policy Search via Approximate Mirror Descent

William Montgomery, Sergey Levine

引用数: 83 • 2016

An incremental trust-region method for Robust online sparse least-squares estimation

David M. Rosen, Michael Kaess, John J. Leonard

引用数: 73 • 2012

A hybrid conjugate gradient based approach for solving unconstrained optimization and motion control problems

Auwal Bala Abubakar, Poom Kumam, Maulana Malik, Abdulkarim Hassan Ibrahim

引用数: 51 • 2021

Global Convergence of Policy Gradient Methods to (Almost) Locally Optimal Policies

Kaiqing Zhang, Alec Koppel, Hao Zhu, Tamer Başar

引用数: 44 • 2019

A new trust region–sequential quadratic programming approach for nonlinear systems based on nonlinear model predictive control

Zhongbo Sun, Yifang Sun, Y. Li, K.P. Liu

引用数: 41 • 2018

Optimizing Expectations: From Deep Reinforcement Learning to Stochastic Computation Graphs

John Schulman

引用数: 37 • 2016

TRC: Trust Region Conditional Value at Risk for Safe Reinforcement Learning

Dohyeong Kim, Songhwai Oh

引用数: 21 • 2022

Reinforcement Learning for UAV Attitude Control

William R. Koch, Renato Mancuso, Richard West, Azer Bestavros

引用数: 19 • 2019

Efficient Off-Policy Safe Reinforcement Learning Using Trust Region Conditional Value At Risk

Dohyeong Kim, Songhwai Oh

引用数: 17 • 2022

A Differentiable Augmented Lagrangian Method for Bilevel Nonlinear Optimization

Benoit Landry, Zachary Manchester, Marco Pavone

引用数: 16 • 2019

Guided Policy Search as Approximate Mirror Descent

William Montgomery, Sergey Levine

引用数: 15 • 2016

Stochastic Variance Reduction for Policy Gradient Estimation

Tian-Bing Xu, Qiang Liu, Jian Peng

引用数: 10 • 2017

Trust dampening and trust promoting: A dual-pathway of trust calibration in human-robot interaction

Xinyu HUANG, Ye Li

引用数: 7 • 2024

Two steps natural actor critic learning for underwater cable tracking

Andrés El-Fakdi, Marc Carreras, Enric Galceran

引用数: 7 • 2010

Bayesian Optimization Based Trust Model for Human Multi-robot Collaborative Motion Tasks in Offroad Environments

Huanfei Zheng, Jonathon M. Smereka, Dariusz Mikulski, Yue Wang

引用数: 6 • 2023

Smoothing Policies and Safe Policy Gradients

Matteo Papini, Matteo Pirotta, Marcello Restelli

引用数: 6 • 2019

Hindsight Trust Region Policy Optimization

Hanbo Zhang, Xuguang Lan, David Hsu, Nanning Zheng

引用数: 5 • 2021

A Fast and Robust Algorithm for General Inequality/Equality Constrained Minimum-Time Problems

B.J. Driessen, Nader Sadegh, Gordon G. Parker, G. Richard Eisler

引用数: 5 • 1999

Deep Black-Box Reinforcement Learning with Movement Primitives

Fabian Otto, Onur Çelik, Hongyi Zhou, Hanna Ziesche, Ngo Anh Vien, Gerhard Neumann

引用数: 5 • 2022