首页 /研究 /Re-purposing Compact Neuronal Circuit Policies to Govern Reinforcement Learning Tasks.

LEARNING

Re-purposing Compact Neuronal Circuit Policies to Govern Reinforcement Learning Tasks.

Ramin Hasani, Mathias Lechner, Alexander Amini, Daniela Rus, Radu Grosu

发表年份: 2018
引用次数: 5

摘要

We propose an effective method for creating interpretable control agents, by \textit{re-purposing} the function of a biological neural circuit model, to govern simulated and real world reinforcement learning (RL) test-beds. Inspired by the structure of the nervous system of the soil-worm, \emph{C. elegans}, we introduce \emph{Neuronal Circuit Policies} (NCPs) as a novel recurrent neural network instance with liquid time-constants, universal approximation capabilities and interpretable dynamics. We theoretically show that they can approximate any finite simulation time of a given continuous n-dimensional dynamical system, with $n$ output units and some hidden units. We model instances of the policies and learn their synaptic and neuronal parameters to control standard RL tasks and demonstrate its application for autonomous parking of a real rover robot on a pre-defined trajectory. For reconfiguration of the \emph{purpose} of the neural circuit, we adopt a search-based RL algorithm. We show that our neuronal circuit policies perform as good as deep neural network policies with the advantage of realizing interpretable dynamics at the cell-level. We theoretically find bounds for the time-varying dynamics of the circuits, and introduce a novel way to reason about networks' dynamics.

关键词

Reinforcement learningControl reconfigurationComputer scienceArtificial neural networkTrajectoryFunction (biology)Control (management)Dynamics (music)Controller (irrigation)Control theory (sociology)

Re-purposing Compact Neuronal Circuit Policies to Govern Reinforcement Learning Tasks.

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory