Compositional Structure Learning for Action Understanding

Ran Xu, Gang Chen, Caiming Xiong, Wei Chen, Jason J. Corso

Year: 2014
Access: Open access

Abstract

The focus of the action understanding literature has predominately been classification, how- ever, there are many applications demanding richer action understanding such as mobile robotics and video search, with solutions to classification, localization and detection. In this paper, we propose a compositional model that leverages a new mid-level representation called compositional trajectories and a locally articulated spatiotemporal deformable parts model (LALSDPM) for fully action understanding. Our methods is advantageous in capturing the variable structure of dynamic human activity over a long range. First, the compositional trajectories capture long-ranging, frequently co-occurring groups of trajectories in space time and represent them in discriminative hierarchies, where human motion is largely separated from camera motion; second, LASTDPM learns a structured model with multi-layer deformable parts to capture multiple levels of articulated motion. We implement our methods and demonstrate state of the art performance on all three problems: action detection, localization, and recognition.

Keywords

cs.CV

Compositional Structure Learning for Action Understanding

Abstract

Keywords

Related papers

A dual-loop framework for manufacturability-aware topology optimization of electric vehicle structures via wire arc additive manufacturing

Geometric digital twin: A digital and intelligent model for aero-engine assembly accuracy prediction

Revolutionizing Industries Through AI-Driven Robotics

Design and dynamic performance prediction of a novel large-aperture offset-feed deployable antenna