Home /Research /Enhancing Human Action Recognition through Spatio-temporal Feature Learning and Semantic Rules
PERCEPTION

Enhancing Human Action Recognition through Spatio-temporal Feature Learning and Semantic Rules

Karinne Ramírez-Amaro, Eun‐Sol Kim, Jiseob Kim, Byoung‐Tak Zhang, Michael Beetz, Gordon Cheng

Year
2013
Citations
8

Abstract

Abstract — In this paper, we present a two-stage framework that deal with the problem of automatically extract human activities from videos. First, for action recognition we employ an unsupervised state-of-the-art learning algorithm based on Independent Subspace Analysis (ISA). This learning algorithm extracts spatio-temporal features directly from video data and it is computationally more efficient and robust than other unsupervised methods. Nevertheless, when applying this one-stage state-of-the-art action recognition technique on the ob-servations of human everyday activities, it can only reach an accuracy rate of approximately 25%. Hence, we propose to enhance this process with a second stage, which define a new method to automatically generate semantic rules that can reason about human activities. The obtained semantic rules enhance the human activity recognition by reducing the com-plexity of the perception system and they allow the possibility of domain change, which can great improve the synthesis of robot behaviors. The proposed method was evaluated under two complex and challenging scenarios: making a pancake and making a sandwich. The difficulty of these scenarios is that they contain finer and more complex activities than the well known data sets (Hollywood2, KTH, etc). The results show benefits of two stages method, the accuracy of action recognition was significantly improved compared to a single-stage method (above 87 % compared to human expert). This indicates the improvement of the framework using the reasoning engine for the automatic extraction of human activities from observations, thus, providing a rich mechanism for transferring a wide range of human skills to humanoid robots. I.

Keywords

Computer scienceArtificial intelligenceProcess (computing)Action (physics)Machine learningFeature (linguistics)Subspace topologyPattern recognition (psychology)Domain (mathematical analysis)Feature extraction

Related papers

Browse all PERCEPTION papers