LEARNING
Reinforcement Learning and Control
Alishba Imran, Keerthana Gopalakrishnan
- 发表年份
- 2025
- 引用次数
- 6
摘要
This chapter explores reinforcement learning (RL) as a method for enabling robots to autonomously collect data and refine their skills through interaction with their environment. It covers key RL concepts, including Markov Decision Processes (MDP), model-free and model-based approaches, and techniques like RLHF and DPO for aligning models with human preferences. The chapter also discusses challenges such as data scarcity and reward design, highlighting future directions in sample efficiency, transfer learning, and sim-to-real adaptation for improving RL in robotics.
关键词
ReinforcementReinforcement learningControl (management)PsychologyComputer scienceArtificial intelligenceSocial psychology
相关论文
OTHER
📊 26,957 引用
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
PERCEPTION
📊 22,245 引用
Artificial intelligence: a modern approach
1995
OTHER
📊 18,993 引用
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
SWARM
📊 14,853 引用
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002