Home /Research /Continuous reinforcement learning via advantage value difference reward shaping: A proximal policy optimization perspective
LEARNING

Continuous reinforcement learning via advantage value difference reward shaping: A proximal policy optimization perspective

Jiawei Lin, Xuekai Wei, Weizhi Xian, Jielu Yan, Leong Hou U, Zhaowei Shang, Mingliang Zhou

Year
2025
Citations
6

Keywords

Computer scienceReinforcement learningPerspective (graphical)Temporal difference learningValue (mathematics)ReinforcementArtificial intelligenceMathematical optimizationMachine learningSocial psychology

Related papers

Browse all LEARNING papers