首页 /研究 /Multifidelity Reinforcement Learning With Gaussian Processes: Model-Based and Model-Free Algorithms

LEARNING

Multifidelity Reinforcement Learning With Gaussian Processes: Model-Based and Model-Free Algorithms

Varun Suryan, Nahush Gondhalekar, Pratap Tokekar

发表年份: 2020
引用次数: 11

摘要

We study the problem of reinforcement learning (RL) using as few real-world samples as possible. A naive application of RL can be inefficient in large and continuous-state spaces. We present two versions of multifidelity RL (MFRL), model based and model free, that leverage Gaussian processes (GPs) to learn the optimal policy in a real-world environment. In the MFRL framework, an agent uses multiple simulators of the real environment to perform actions. With increasing fidelity in a simulator chain, the number of samples used in successively higher simulators can be reduced. By incorporating GPs in the MFRL framework, we empirically observe an up to 40% reduction in the number of samples for model-based RL and 60% reduction for the model-free version. We examine the performance of our algorithms through simulations and realworld experiments for navigation with a ground robot.

关键词

Reinforcement learningLeverage (statistics)Computer scienceGlobal Positioning SystemFidelityRobotReduction (mathematics)Gaussian processGaussianArtificial intelligence

Multifidelity Reinforcement Learning With Gaussian Processes: Model-Based and Model-Free Algorithms

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory