Autonomous reinforcement of behavioral sequences in neural dynamics

Sohrob Kazerounian, Matthew Luciw, Mathis Richter, Yulia Sandamirskaya

发表年份: 2013
引用次数: 7

摘要

We introduce a dynamic neural algorithm called Dynamic Neural (DN) SARSA(λ) for learning a behavioral sequence from delayed reward. DN-SARSA(λ) combines Dynamic Field Theory models of behavioral sequence representation, classical reinforcement learning, and a computational neuroscience model of working memory, called Item and Order working memory, which serves as an eligibility trace. DN-SARSA(λ) is implemented on both a simulated and real robot that must learn a specific rewarding sequence of elementary behaviors from exploration. Results show DN-SARSA(λ) performs on the level of the discrete SARSA(λ), validating the feasibility of general reinforcement learning without compromising neural dynamics.

关键词

Reinforcement learningComputer scienceArtificial intelligenceSequence (biology)TRACE (psycholinguistics)ReinforcementRepresentation (politics)Behavioral modelingRobotArtificial neural network

Autonomous reinforcement of behavioral sequences in neural dynamics

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory