Thompson sampling

相关论文数: 11

最高引用论文

Batched Gaussian Process Bandit Optimization via Determinantal Point\n Processes

Tarun Kathuria, Amit Deshpande, Pushmeet Kohli

引用数: 39 • 2016

Constrained stochastic optimal control with learned importance sampling: A path integral approach

Jan Carius, René Ranftl, Farbod Farshidian, Marco Hutter

引用数: 24 • 2021

Regret bounds for meta Bayesian optimization with an unknown Gaussian\n process prior

Zi Wang, Beomjoon Kim, Leslie Pack Kaelbling

引用数: 24 • 2018

Off-Policy Evaluation via Off-Policy Classification

Alexander Irpan, Kanishka Rao, Konstantinos Bousmalis, C.J. Harris, Julian Ibarz, Sergey Levine

引用数: 15 • 2019

Off-Policy Evaluation via Off-Policy Classification

Alex Irpan, Kanishka Rao, Konstantinos Bousmalis, C.J. Harris, Julian Ibarz, Sergey Levine

引用数: 13 • 2019

Intelligent mapping for autonomous robotic survey

David Wettergreen, David R. Thompson

引用数: 8 • 2008

Contributions to Multi-Armed Bandits : Risk-Awareness and Sub-Sampling for Linear Contextual Bandits

Nicolas Galichet

引用数: 8 • 2015

Multi-Armed Bandit Algorithms for Spare Time Planning of a Mobile Service Robot

Max Korein, Manuela Veloso

引用数: 4 • 2018

Demand-Aware Multi-Robot Task Scheduling with Mixed Reality Simulation

Ajay Kumar Sandula, Arushi Khokhar, Debasish Ghose, Pradipta Biswas

引用数: 2 • 2023

Multi-Armed Bandit Algorithms for a Mobile Service Robot's Spare Time in a Structured Environment

Max Korein, Manuela Veloso

引用数: 2 • 2018

USHER: Unbiased Sampling for Hindsight Experience Replay

Liam Schramm, Yunfu Deng, Edgar Granados, Abdeslam Boularias

引用数: 2 • 2022