Zhen, Xiantong and Shao, Ling (2013) Spatio-temporal steerable pyramid for human action recognition. In: 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 22nd - 26th April 2013, Shanghai, China.
Full text not available from this repository. (Request a copy)Abstract
In this paper, we propose a novel holistic representation based on the spatio-temporal steerable pyramid (STSP) for human action recognition. The spatio-temporal Laplacian pyramid provides an effective technique for multi-scale analysis of video sequences. By decomposing spatio-temporal volumes into band-passed sub-volumes, spatio-temporal patterns residing in different scales will be nicely localized. Then three-dimensional separable steerable filters are conducted on each of the sub-volume to capture the spatio-temporal orientation information efficiently. The outputs of the quadrature pair of steerable filters are squared and summed to yield a more robust measure of motion energy. To make the representation invariant to shifting and applicable with coarsely-extracted bounding boxes for the performed actions, max pooling operations are employed between responses of the filtering at adjacent scales, and over spatio-temporal local neighborhoods. Taking advantage of multi-scale and multi-orientation analysis and feature pooling, STSP produces a compact but informative and invariant representation of human actions. We conduct extensive experiments on the KTH, IXMAS and HMDB51 datasets, and the proposed STSP achieves comparable results with the state-of-the-art methods.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Uncontrolled Keywords: | filtering theory, image motion analysis, image representation, image sequences, object recognition, video signal processing |
Subjects: | G400 Computer Science |
Department: | Faculties > Engineering and Environment > Computer and Information Sciences |
Depositing User: | Paul Burns |
Date Deposited: | 16 Jun 2015 12:45 |
Last Modified: | 13 Oct 2019 00:33 |
URI: | http://nrl.northumbria.ac.uk/id/eprint/22949 |
Downloads
Downloads per month over past year