LEARNING
Continuous reinforcement learning via advantage value difference reward shaping: A proximal policy optimization perspective
Jiawei Lin, Xuekai Wei, Weizhi Xian, Jielu Yan, Leong Hou U, Zhaowei Shang, Mingliang Zhou
- Year
- 2025
- Citations
- 6
Keywords
Computer scienceReinforcement learningPerspective (graphical)Temporal difference learningValue (mathematics)ReinforcementArtificial intelligenceMathematical optimizationMachine learningSocial psychology
Related papers
OTHER
📊 26,957 cites
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
PERCEPTION
📊 22,245 cites
Artificial intelligence: a modern approach
1995
OTHER
📊 18,993 cites
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
SWARM
📊 14,853 cites
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002