首页 /研究 /Reinforcement learning-based collision avoidance: impact of reward function and knowledge transfer

LEARNING

Reinforcement learning-based collision avoidance: impact of reward function and knowledge transfer

Xiongqing Liu, Yan Jin

发表年份: 2020
引用次数: 14

摘要

Abstract Collision avoidance for robots and vehicles in unpredictable environments is a challenging task. Various control strategies have been developed for the agent (i.e., robots or vehicles) to sense the environment, assess the situation, and select the optimal actions to avoid collision and accomplish its mission. In our research on autonomous ships, we take a machine learning approach to collision avoidance. The lack of available ship steering data of human ship masters has made it necessary to acquire collision avoidance knowledge through reinforcement learning (RL). Given that the learned neural network tends to be a black box, it is desirable that a method is available which can be used to design an agent's behavior so that the desired knowledge can be captured. Furthermore, RL with complex tasks can be either time consuming or unfeasible. A multi-stage learning method is needed in which agents can learn from simple tasks and then transfer their learned knowledge to closely related but more complex tasks. In this paper, we explore the ways of designing agent behaviors through tuning reward functions and devise a transfer RL method for multi-stage knowledge acquisition. The computer simulation-based agent training results have shown that it is important to understand the roles of each component in a reward function and the various design parameters in transfer RL. The settings of these parameters are all dependent on the complexity of the tasks and the similarities between them.

关键词

Reinforcement learningCollision avoidanceComputer scienceTask (project management)Function (biology)RobotCollisionArtificial intelligenceArtificial neural networkTransfer of learning

Reinforcement learning-based collision avoidance: impact of reward function and knowledge transfer

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory