Toward Interactive Grounded Language Acqusition

Thomas Kollar, Jayant Krishnamurthy, Grant P. Strimel

发表年份: 2013
引用次数: 22
访问权限: 开放获取

摘要

This paper addresses the problem of enabling robots to interactively learn visual and spatial models from multi-modal interactions involving speech, gesture and images. Our approach, called Logical Semantics with Perception (LSP), provides a natural and intuitive interface by significantly reducing the amount of supervision that a human is required to provide. This paper demonstrates LSP in an interactive setting. Given speech and gesture input, LSP is able to learn object and relation classifiers for objects like mugs and relations like left and right. We extend LSP to generate complex natural language descriptions of selected objects using adjectives, nouns and relations, such as "the orange mug to the right of the green book." Furthermore, we extend LSP to incorporate determiners (e.g., "the") into its training procedure, enabling the model to generate acceptable relational language 20% more often than the unaugmented model.

关键词

Computer scienceGrounded theoryLinguisticsHuman–computer interactionSociologyQualitative researchAnthropologyPhilosophy

Toward Interactive Grounded Language Acqusition

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory