首页 /研究 /Autonomous learning of active multi-scale binocular vision
PERCEPTION

Autonomous learning of active multi-scale binocular vision

Luca Lonini, Yu Zhao, Pramod Chandrashekhariah, Bertram E. Shi, Jochen Triesch

发表年份
2013
引用次数
24

摘要

We present a method for autonomously learning representations of visual disparity between images from left and right eye, as well as appropriate vergence movements to fixate objects with both eyes. A sparse coding model (perception) encodes sensory information using binocular basis functions, while a reinforcement learner (behavior) generates the eye movement, according to the sensed disparity. Perception and behavior develop in parallel, by minimizing the same cost function: the reconstruction error of the stimulus by the generative model. In order to efficiently cope with multiple disparity ranges, sparse coding models are learnt at multiple scales, encoding disparities at various resolutions. Similarly, vergence commands are defined on a logarithmic scale to allow both coarse and fine actions. We demonstrate the efficacy of the proposed method using the humanoid robot iCub. We show that the model is fully self-calibrating and does not require any prior information about the camera parameters or the system dynamics.

关键词

iCubComputer scienceArtificial intelligenceNeural codingComputer visionBinocular disparityCoding (social sciences)Generative modelPerceptionGaze

相关论文

查看 PERCEPTION 分类全部论文