Shared Trained Models Selection and Management for Transfer Reinforcement Learning in Open IoT
Hirofumi Noguchi, Takuma Isoda, Seisuke Arai
- Year
- 2021
- Citations
- 1
Abstract
This paper proposes a new paradigm that utilizes trained models shared by multiple systems and also proposes a mandatory method to achieve it. Reinforcement learning and transfer learning are powerful technologies to automatically build systems composed of shared devices with various combinations of installation positions and device types. However, current works are premised on an individual system, and learning knowledge is not shared across multiple systems. Our proposed method enables systems to select and transfer an appropriate trained model from shared models across multiple systems. The trained model is selected on the basis of the similarity of input data of learning. The proposed method enables individual systems to complete learning with a small number of demonstrations while maintaining an appropriate number of trained models in the learning model pool. Moreover, since the proposed method does not depend on a specific learning algorithm, it can be applied to various systems. In the simulation of the robot navigation task, the selection of the transfer model by the proposed method increased the sum of rewards by 32% compared with randomly selecting the transfer model. In addition, in experiments on multiple successive learning requests, the proposed method updated the learning model pool and increased the sum of rewards of all systems by 34% compared with the case without transfer learning.
Keywords
Related papers
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
Artificial intelligence: a modern approach
1995
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002