Optimal Path Finding using Iterative SARSA
Prajval Mohan, Lakshya Sharma, Pranav Narayan
- 发表年份
- 2021
- 引用次数
- 13
摘要
This paper presents a novel and state-of-the-art algorithm named Iterative SARSA to effectively determine the optimal trajectory for an autonomous mobile robot present in an unknown environment. Additionally, a detailed comparative analysis of the proposed algorithm is provided along with other traditional reinforcement learning algorithms using apropos parameters such as path length, computational time, and execution risk. The traditional algorithms used here are Q-learning and SARSA (State-Action-Reward-State-Action). Based on the calculation of next step, these algorithms use either of two reinforcement learning methods - the on-policy or off-policy. While SARSA and Iterative SARSA use the on-policy method, Q-learning utilizes the off-policy method. Optimized trajectory planning along with obstacle avoidance has always been a challenging yet foundational component of various principal applications. Being one of the most primary algorithms of machine learning, any development using Iterative SARSA should render a greater applicative scope.
关键词
相关论文
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
Artificial intelligence: a modern approach
1995
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002