LEARNING
Average Reward Reinforcement Learning: Foundations, Algorithms, and Empirical Results
Sridhar Mahadevan
- Year
- 1996
- Citations
- 20
- Access
- Open access
Keywords
Reinforcement learningComputer scienceConvergence (economics)Asynchronous communicationMetric (unit)Temporal difference learningLearning automataQ-learningArtificial intelligencePerformance metric
Related papers
OTHER
📊 26,957 cites
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
PERCEPTION
📊 22,245 cites
Artificial intelligence: a modern approach
1995
OTHER
📊 18,993 cites
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
SWARM
📊 14,853 cites
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002