Loading...

Semantic segmentation of RGB-D images using 3D and local neighbouring features

Fooladgar, F ; Sharif University of Technology | 2015

742 Viewed
  1. Type of Document: Article
  2. DOI: 10.1109/DICTA.2015.7371307
  3. Publisher: Institute of Electrical and Electronics Engineers Inc , 2015
  4. Abstract:
  5. 3D scene understanding is one of the most important problems in the field of computer vision. Although, in the past decades, considerable attention has been devoted on the 2D scene understanding problem, now with the development of the depth sensors (like Microsoft Kinect), the 3D scene understanding has become a very challenging task. Traditionally, the scene understanding problem was considered as the semantic labeling of each image pixel. Semantic labeling of RGB-D images has not attained a comparable success, as the RGB semantic labeling, due to the lack of a challenging dataset. With the introduction of an RGB-D dataset, called NYU-V2, it became possible to propose a novel method to improve the labeling accuracy. In this paper, a semantic segmentation algorithm for RGB-D images is presented. The concentration of the proposed algorithm is on the feature description and classification steps. In the feature description step, the more discriminative features from RGB images and the 3D point cloud data are grouped with local neighboring features to incorporate their context into the classification step. In the classification step, a pairwise multi-class conditional random field framework is utilized in which the unary potential function is considered as the probabilistic output of a random forest classifier. The proposed algorithm is evaluated on the NYU-V2 dataset and the performance is compared to that of other methods presented in the literature. The proposed algorithm achieves the state-of-The-Art results on the NYU-V2 dataset
  6. Keywords:
  7. RGB-D image segmentation ; Semantic scene labeling ; Algorithms ; Classification (of information) ; Computer vision ; Decision trees ; Image processing ; Semantics ; Three dimensional computer graphics ; 3D features ; Conditional random field ; Discriminative features ; Feature description ; Probabilistic output ; Random forest classifier ; Scene understanding ; Semantic segmentation ; Image segmentation
  8. Source: 2015 International Conference on Digital Image Computing: Techniques and Applications, DICTA 2015, 23 November 2015 through 25 November 2015 ; 2015 ; 9781467367950 (ISBN)
  9. URL: http://ieeexplore.ieee.org/document/7371307/?reload=true