Discovering discriminative action parts from mid-level video representations 论文

2012引用 239

Human Pose and Action RecognitionVideo Analysis and SummarizationVideo Surveillance and Tracking Methods

Video Surveillance and Tracking Methods Human Pose and Action Recognition Video Analysis and Summarization

作者

摘要

We describe a mid-level approach for action recognition. From an input video, we extract salient spatio-temporal structures by forming clusters of trajectories that serve as candidates for the parts of an action. The assembly of these clusters into an action class is governed by a graphical model that incorporates appearance and motion constraints for the individual parts and pairwise constraints for the spatio-temporal dependencies among them. During training, we estimate the model parameters discriminatively. During classification, we efficiently match the model to a video using discrete optimization. We validate the model's classification ability in standard benchmark datasets and illustrate its potential to support a fine-grained analysis that not only gives a label to a video, but also identifies and localizes its constituent parts.

作者查看全部 (3)

Stefano Soatto

I. Kokkinos

Michalis Raptis

Discovering discriminative action parts from mid-level video representations 论文

详细信息

摘要

作者查看全部 (3)

相关技术查看全部 (2)

相关事件

相关文章