Loading...
Supervised spatio-temporal kernel descriptor for human action recognition from RGB-depth videos
Asadi Aghbolaghi, M ; Sharif University of Technology
449
Viewed
- Type of Document: Article
- DOI: 10.1007/s11042-017-5017-y
- Abstract:
- One of the most challenging tasks in computer vision is human action recognition. The recent development of depth sensors has created new opportunities in this field of research. In this paper, a novel supervised spatio-temporal kernel descriptor (SSTKDes) is proposed from RGB-depth videos to establish a discriminative and compact feature representation of actions. To enhance the descriptive and discriminative ability of the descriptor, extracted primary kernel-based features are transformed into a new space by exploiting a supervised training strategy; i.e., large margin nearest neighbor (LMNN). The LMNN highly reduces the error of a nearest neighbor classifier by minimizing the intra-class variations and maximizing the inter-class distances. Subsequently, the efficient match kernel (EMK) is used to abstract the mid-level kernel features for a more efficient classification. The proposed approach is evaluated on five public benchmark datasets. The experimental evaluations demonstrate that the proposed method achieves superior performance to the state-of-the-art methods. © 2017 Springer Science+Business Media, LLC
- Keywords:
- Action recognition ; EMK ; Supervised kernel descriptor ; Hardware ; Action recognition ; Human-action recognition ; Kernel descriptor ; Large margin nearest neighbors ; LMNN ; Nearest neighbor classifier ; RGB-D video ; State-of-the-art methods ; Multimedia systems
- Source: Multimedia Tools and Applications ; 2017 , Pages 1-21 ; 13807501 (ISSN)
- URL: https://link.springer.com/article/10.1007/s11042-017-5017-y