首页 /研究 /Model-based Ensemble Reinforcement Learning with Soft Proximal Policy Optimization
LEARNING

Model-based Ensemble Reinforcement Learning with Soft Proximal Policy Optimization

Dazi Li, Fuqiang Zhu

发表年份
2021
引用次数
4

摘要

At present, model-free reinforcement learning has been widely used in games, robot control and other fields, and has achieved good results. However, these algorithms require a large number of samples to achieve good performance, which limits the application of model-free reinforcement learning methods in real-world domains. Model-based reinforcement learning can use fewer samples, but requires careful tuning. In this article, we use a neural network ensemble model to learn the dynamics, and incorporate model predictive control as the basic control framework. The dynamic model is also used to train an initial model-free neural network to achieve a combination of sampling efficiency and performance. We evaluated our method on the MuJoCo experimental platform. The results show that, compared with other model-free and model-based methods, our approach achieves better task performance while having excellent sample efficiency.

关键词

Reinforcement learningComputer scienceArtificial neural networkTask (project management)Artificial intelligenceMachine learningEnsemble learningSample (material)Control (management)Engineering

相关论文

查看 LEARNING 分类全部论文