Home /Research /Exploiting cross-modal rhythm for robot perception of objects
PERCEPTION

Exploiting cross-modal rhythm for robot perception of objects

Artur Arsénio, Paul Fitzpatrick

Year
2004
Citations
16

Abstract

Why: Objects that move rhythmically are common and important in the kinds of workplaces where we might employ a humanoid robot. The humanoid form is often argued for so that the robot can interact well with tools designed for humans, and such tools are typically used in a repetitive manner, whether the sound is generated by physical abrasion or collision: hammers, chisels, saws etc. We also work with the perception of toys designed for infants – rattles, bells etc. – which could have utility for entertainment/pet robotics. The advantage of combining rhythmic information across acoustic and visual senses is that these senses have complementary properties. Since sound waves disperse more readily than light, vision retains more spatial structure – but for the same reason it is sensitive to occlusion and the relative angle of the robot’s sensors, while auditory perception is quite robust to these factors. The spatial trajectory of a moving object can be recovered quite straightforwardly from visual analysis, but not from sound. However, the trajectory in itself is not very revealing about the nature of the object. We use the trajectory to extract visual and acoustic features – patches of pixels, and sound frequency bands – that are likely to be associated with the object. Both can be used for recognition. Sound features are easier to use since they are relatively insensitive to spatial parameters such as the relative position and pose of the object and the robot. How: For each object moving visually, fragments of the concurrent sound input are taken for periods of that object, aligned, and compared. If the fragments are consistent, with sound and vision in phase with each other, then the visual trajectory and the sound are considered bound. The relationship betweenobject motion andthe sound generatedvaries inanobject-specific way. The hammer causes sound when changing direction after striking an object. The bell typically causes sound at either extreme of motion. A toy truck causes sound while moving rapidly with wheels spinning; it is quiet when changing direction.

Keywords

Computer visionArtificial intelligenceRobotComputer sciencePerceptionTrajectoryObject (grammar)Humanoid robotRoboticsAcoustics

Related papers

Browse all PERCEPTION papers