Toward Interactive Grounded Language Acqusition
Thomas Kollar, Jayant Krishnamurthy, Grant P. Strimel
- 发表年份
- 2013
- 引用次数
- 22
- 访问权限
- 开放获取
摘要
This paper addresses the problem of enabling robots to interactively learn visual and spatial models from multi-modal interactions involving speech, gesture and images. Our approach, called Logical Semantics with Perception (LSP), provides a natural and intuitive interface by significantly reducing the amount of supervision that a human is required to provide. This paper demonstrates LSP in an interactive setting. Given speech and gesture input, LSP is able to learn object and relation classifiers for objects like mugs and relations like left and right. We extend LSP to generate complex natural language descriptions of selected objects using adjectives, nouns and relations, such as "the orange mug to the right of the green book." Furthermore, we extend LSP to incorporate determiners (e.g., "the") into its training procedure, enabling the model to generate acceptable relational language 20% more often than the unaugmented model.
关键词
相关论文
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
Artificial intelligence: a modern approach
1995
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002