LEARNING
Sample-efficient backtrack temporal difference deep reinforcement learning
Qi Liu, Pengbin Chen, Ke Lin, Kaidong Zhao, Jinliang Ding, Yanjie Li
- 发表年份
- 2025
- 引用次数
- 18
关键词
Reinforcement learningTemporal difference learningControl (management)Sampling (signal processing)Bellman equationAction (physics)PrioritizationValue (mathematics)Representation (politics)
相关论文
PERCEPTION
📊 22,245 引用
Artificial intelligence: a modern approach
1995
OTHER
📊 18,993 引用
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
LEARNING
📊 8,465 引用
The Organization of Behavior
D. O. Hebb
2005
LEARNING
📊 7,678 引用
Fractional Brownian Motions, Fractional Noises and Applications
Benoît B. Mandelbrot, John W. Van Ness
1968