首页 /研究 /Model-based Ensemble Reinforcement Learning with Soft Proximal Policy Optimization

LEARNING

Model-based Ensemble Reinforcement Learning with Soft Proximal Policy Optimization

Dazi Li, Fuqiang Zhu

发表年份: 2021
引用次数: 4

摘要

At present, model-free reinforcement learning has been widely used in games, robot control and other fields, and has achieved good results. However, these algorithms require a large number of samples to achieve good performance, which limits the application of model-free reinforcement learning methods in real-world domains. Model-based reinforcement learning can use fewer samples, but requires careful tuning. In this article, we use a neural network ensemble model to learn the dynamics, and incorporate model predictive control as the basic control framework. The dynamic model is also used to train an initial model-free neural network to achieve a combination of sampling efficiency and performance. We evaluated our method on the MuJoCo experimental platform. The results show that, compared with other model-free and model-based methods, our approach achieves better task performance while having excellent sample efficiency.

关键词

Reinforcement learningComputer scienceArtificial neural networkTask (project management)Artificial intelligenceMachine learningEnsemble learningSample (material)Control (management)Engineering

Model-based Ensemble Reinforcement Learning with Soft Proximal Policy Optimization

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory