Loading...
Search for:
fooladgar--fahimeh
0.053 seconds
Learning strengths and weaknesses of classifiers for RGB-D semantic segmentation
, Article 9th Iranian Conference on Machine Vision and Image Processing, 18 November 2015 through 19 November 2015 ; Volume 2016-February , 2015 , Pages 176-179 ; 21666776 (ISSN) ; 9781467385398 (ISBN) ; Kasaei, S ; Sharif University of Technology
IEEE Computer Society
2015
Abstract
3D scene understanding is an open challenge in the field of computer vision. Most of the focus is on 2D methods in which the semantic labeling of each RGB pixel is considered. But, in this paper, the 3D semantic labeling of RGB-D images is considered. In the proposed method, to extract some meaningful features, the superpixel generation algorithm is applied to the RGB image to segment it into a set of disjoint pixels. After that, the set of three powerful classifiers are utilized to semantically label each superpixel. In the proposed method, the probability outputs of these classifiers are concatenated as the novel feature vector for each superpixel. Consequently, to analyze the strengths...
Semantic segmentation of RGB-D images using 3D and local neighbouring features
, Article 2015 International Conference on Digital Image Computing: Techniques and Applications, DICTA 2015, 23 November 2015 through 25 November 2015 ; 2015 ; 9781467367950 (ISBN) ; Kasaei, S ; Sharif University of Technology
Institute of Electrical and Electronics Engineers Inc
2015
Abstract
3D scene understanding is one of the most important problems in the field of computer vision. Although, in the past decades, considerable attention has been devoted on the 2D scene understanding problem, now with the development of the depth sensors (like Microsoft Kinect), the 3D scene understanding has become a very challenging task. Traditionally, the scene understanding problem was considered as the semantic labeling of each image pixel. Semantic labeling of RGB-D images has not attained a comparable success, as the RGB semantic labeling, due to the lack of a challenging dataset. With the introduction of an RGB-D dataset, called NYU-V2, it became possible to propose a novel method to...
3M2RNet: Multi-modal multi-resolution refinement network for semantic segmentation
, Article Computer Vision Conference, CVC 2019, 25 April 2019 through 26 April 2019 ; Volume 944 , 2020 , Pages 544-557 ; Kasaei, S ; Sharif University of Technology
Springer Verlag
2020
Abstract
One of the most important steps towards 3D scene understanding is the semantic segmentation of images. The 3D scene understanding is considered as the crucial requirement in computer vision and robotic applications. With the availability of RGB-D cameras, it is desired to improve the accuracy of the scene understanding process by exploiting the depth along with appearance features. One of the main problems in RGB-D semantic segmentation is how to fuse or combine these two modalities to achieve more advantages of the common and specific features of each modality. Recently, the methods that encounter deep convolutional neural networks have reached the state-of-the-art results in dense...
Lightweight residual densely connected convolutional neural network
, Article Multimedia Tools and Applications ; Volume 79, Issue 35-36 , 2020 , Pages 25571-25588 ; Kasaei, S ; Sharif University of Technology
Springer
2020
Abstract
Extremely efficient convolutional neural network architectures are one of the most important requirements for limited-resource devices (such as embedded and mobile devices). The computing power and memory size are two important constraints of these devices. Recently, some architectures have been proposed to overcome these limitations by considering specific hardware-software equipment. In this paper, the lightweight residual densely connected blocks are proposed to guaranty the deep supervision, efficient gradient flow, and feature reuse abilities of convolutional neural network. The proposed method decreases the cost of training and inference processes without using any special...
A survey on indoor RGB-D semantic segmentation: from hand-crafted features to deep convolutional neural networks
, Article Multimedia Tools and Applications ; Volume 79, Issue 7-8 , 2020 , Pages 4499-4524 ; Kasaei, S ; Sharif University of Technology
Springer
2020
Abstract
Semantic segmentation is one of the most important tasks in the field of computer vision. It is the main step towards scene understanding. With the advent of RGB-Depth sensors, such as Microsoft Kinect, nowadays RGB-Depth images are easily available. This has changed the landscape of some tasks such as semantic segmentation. As the depth images are independent of illumination, the combination of depth and RGB images can improve the quality of semantic labeling. The related research has been divided into two main categories, based on the usage of hand-crafted features and deep learning. Although the state-of-the-art results are mainly achieved by deep learning methods, traditional methods...
Supervised Semantic Segmentation of RGB-Depth Images
, Ph.D. Dissertation Sharif University of Technology ; Kasaei, Shohreh (Supervisor)
Abstract
The labeling process is one of the most important tasks in the field of computer vision. The dense labeling problem is the main step towards 2D and 3D scene understanding. The main goal of dense labeling is to label all pixels of images that are known as a semantic segmentation of images in the related literature. Although the state-of-the-art results are mainly achieved by deep learning methods, traditional methods had also been at the center of attention for some years. In the last decades, convolutional neural networks have changed the landscape of visual recognition tasks such as labeling and semantic segmentation. The most important issues in deep learning models are the hardware and...
Geometrical analysis of localization error in stereo vision systems
, Article IEEE Sensors Journal ; Volume 13, Issue 11 , 2013 , Pages 4236-4246 ; 1530437X (ISSN) ; Samavi, S ; Soroushmehr, S. M. R ; Shirani, S ; Sharif University of Technology
2013
Abstract
Determining an object location in a specific region is an important task in many machine vision applications. Different parameters affect the accuracy of the localization process. The quantization process in charge-coupled device of a camera is one of the sources of error that causes estimation rather than identifying the exact position of the observed object. A cluster of points, in the field of view of a camera are mapped into a pixel. These points form an uncertainty region. In this paper, we present a geometrical model to analyze the volume of this uncertainty region as a criterion for object localization error. The proposed approach models the field of view of each pixel as an oblique...
Principles of Designing a Chair for Iranian Kitchen, Concerning Kansei Engineering Approach
, M.Sc. Thesis Sharif University of Technology ; Eshraghniaye Jahromi, Abdolhamid (Supervisor) ; Sadeghi Naeini, Hassan ($item.subfieldsMap.e)
Abstract
Iranian women spend a lot of times in the kitchen area in order to have good effects on the family. Unfortunately, we see many of them are suffering from body injuries and particularly musculoskeletal disorders in their middle ages. Many factors cause such problems among which awkward postures are mentioned and with identifying these postures, corrective solutions could be proposed.In this study, firstly, relationship of body injuries with working in the kitchen area is considered, awkward postures are identified and expedient actions are suggested. The aim of this study is chosen designing a suitable task chair for kitchen among the proposed solutions. Then, chair specifications are...