首页 /研究 /Inverse reinforcement learning-based time-dependent A* planner for human-aware robot navigation with local vision
LEARNING

Inverse reinforcement learning-based time-dependent A* planner for human-aware robot navigation with local vision

Shiying Sun, Xiaoguang Zhao, Qian‐Zhong Li, Min Tan

发表年份
2020
引用次数
25

摘要

In an environment where robots coexist with humans, mobile robots should be human-aware and comply with humans' behavioural norms so as to not disturb humans' personal space and activities. In this work, we propose an inverse reinforcement learning-based time-dependent A* planner for human-aware robot navigation with local vision. In this method, the planning process of time-dependent A* is regarded as a Markov decision process and the cost function of the time-dependent A* is learned using the inverse reinforcement learning via capturing humans' demonstration trajectories. With this method, a robot can plan a path that complies with humans' behaviour patterns and the robot's kinematics. When constructing feature vectors of the cost function, considering the local vision characteristics, we propose a visual coverage feature for enabling robots to learn from how humans move in a limited visual field. The effectiveness of the proposed method has been validated by experiments in real-world scenarios: using this approach robots can effectively mimic human motion patterns when avoiding pedestrians; furthermore, in a limited visual field, robots can learn to choose a path that enables them to have the larger visual coverage which shows a better navigation performance.

关键词

Artificial intelligenceRobotReinforcement learningComputer scienceMotion planningMobile robotMarkov decision processComputer visionRobot learningProcess (computing)

相关论文

查看 LEARNING 分类全部论文