Out in the World: What Did The Robot Hear AndSee?

Lijin Aryananda

Year: 2005
Citations: 3

Abstract

It is now well established that joint attention is a key capability for socially interacting robots (Brooks et al., 1999, Kaplan and Hafner, 2004, Scassellati, 1999, Itti, 2003). It is also a key component for epigenetic robotic applications in general. This subject has been widely discussed and we present here one specific technical improvement for joint attention which relies on image segmentation. In the new Talking Robots (Baillie, 2004) experiment that we have started, following a successful reimplementation of the Sony’s Talking Heads (Steels, 1998) experiment on Aibo ERS7, we try to have two robots inter- acting to evolve a shared repertoire of syn- chronized behaviors, or “games”. This leads to a dynamic version of the Talk- ing Heads, where the interaction protocol, or language game, is not predefined in the agents. As usually with experiments involving symbol grounding and social interaction, and for these experiments in particular, it is es- sential that the two robots can establish joint attention and share a common representa- tion of their surrounding environment, while they are looking at the same scene. Many techniques can be used to achieve this stable shared representation. We have chosen to fo- cus on segmentation-based algorithms, which provide interesting shapes together with vec- tors of features that can be easily extracted and proved useful in the context of our exper- iment. However, when using segmentation algo- rithms, a slight change in the viewpoint, or even the residual camera noise, is enough to significantly change the result of the image partition, possibly leading to completely dif- ferent perceptions of the same scene. We have developed an original measure to assess the stability of a segmentation algo- rithm and we have used it on a set of algo- rithms to automatically determine the most stable partition for a given scene. This ap- proach is different from classical methods used to estimate the quality of a segmentation al- gorithm, where the result of the algorithm is compared to an ideal perfect segmentation done by hand. In our approach, the measure is done automatically, involves only stability considerations and could lead to interesting improvements whenever joint attention using segmentation is required. We quickly present in the poster the back- ground of the Talking Robots experiment and why joint attention and image segmentation stability is an important issue for us. We in- troduce then two stability measures and show some results on natural scenes from our ex- periments in the lab and test scenes used to control image parameters. The influence of several image characterizations (noise, num- ber of objects, luminosity,...) is carefully re- viewed. An example (fig.1) is given below, showing the influence of noise on typical im- ages. Using the fact that a given algorithm can be ranked according to its stability score, which is calculated online assuming that the scene itself is static, a general method of algorithm switching is introduced and used in the ex- periment with different kind of algorithms: region growing, recursive histogram splitting, CSC (Priese and Rehrmann, 1993) and split & merge (CIS). We show how this method significantly improves the convergence speed in the experiment and conclude on the gen- erality of our approach to facilitate certain aspects of joint attention, when it relies on segmentation.

Keywords

RobotComputer scienceSegmentationArtificial intelligenceContext (archaeology)Human–computer interactionKey (lock)Computer vision

Out in the World: What Did The Robot Hear AndSee?

Abstract

Keywords

Related papers

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory