Autonomous learning of active multi-scale binocular vision

Luca Lonini, Yu Zhao, Pramod Chandrashekhariah, Bertram E. Shi, Jochen Triesch

发表年份: 2013
引用次数: 24

摘要

We present a method for autonomously learning representations of visual disparity between images from left and right eye, as well as appropriate vergence movements to fixate objects with both eyes. A sparse coding model (perception) encodes sensory information using binocular basis functions, while a reinforcement learner (behavior) generates the eye movement, according to the sensed disparity. Perception and behavior develop in parallel, by minimizing the same cost function: the reconstruction error of the stimulus by the generative model. In order to efficiently cope with multiple disparity ranges, sparse coding models are learnt at multiple scales, encoding disparities at various resolutions. Similarly, vergence commands are defined on a logarithmic scale to allow both coarse and fine actions. We demonstrate the efficacy of the proposed method using the humanoid robot iCub. We show that the model is fully self-calibrating and does not require any prior information about the camera parameters or the system dynamics.

关键词

iCubComputer scienceArtificial intelligenceNeural codingComputer visionBinocular disparityCoding (social sciences)Generative modelPerceptionGaze

Autonomous learning of active multi-scale binocular vision

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory