Thompson sampling
相关论文数: 11
顶级研究者
最高引用论文
Batched Gaussian Process Bandit Optimization via Determinantal Point\n Processes
Tarun Kathuria, Amit Deshpande, Pushmeet Kohli
引用数: 39 • 2016
Constrained stochastic optimal control with learned importance sampling: A path integral approach
Jan Carius, René Ranftl, Farbod Farshidian, Marco Hutter
引用数: 24 • 2021
Regret bounds for meta Bayesian optimization with an unknown Gaussian\n process prior
Zi Wang, Beomjoon Kim, Leslie Pack Kaelbling
引用数: 24 • 2018
Off-Policy Evaluation via Off-Policy Classification
Alexander Irpan, Kanishka Rao, Konstantinos Bousmalis, C.J. Harris, Julian Ibarz, Sergey Levine
引用数: 15 • 2019
Off-Policy Evaluation via Off-Policy Classification
Alex Irpan, Kanishka Rao, Konstantinos Bousmalis, C.J. Harris, Julian Ibarz, Sergey Levine
引用数: 13 • 2019
Intelligent mapping for autonomous robotic survey
David Wettergreen, David R. Thompson
引用数: 8 • 2008
Contributions to Multi-Armed Bandits : Risk-Awareness and Sub-Sampling for Linear Contextual Bandits
Nicolas Galichet
引用数: 8 • 2015
Multi-Armed Bandit Algorithms for Spare Time Planning of a Mobile Service Robot
Max Korein, Manuela Veloso
引用数: 4 • 2018
Demand-Aware Multi-Robot Task Scheduling with Mixed Reality Simulation
Ajay Kumar Sandula, Arushi Khokhar, Debasish Ghose, Pradipta Biswas
引用数: 2 • 2023
Multi-Armed Bandit Algorithms for a Mobile Service Robot's Spare Time in a Structured Environment
Max Korein, Manuela Veloso
引用数: 2 • 2018
USHER: Unbiased Sampling for Hindsight Experience Replay
Liam Schramm, Yunfu Deng, Edgar Granados, Abdeslam Boularias
引用数: 2 • 2022