Sharif Digital Repository / Sharif University of Technology / Search result

Weakly Supervised Semantic Segmentation Using Deep Neural Networks

, M.Sc. Thesis Sharif University of Technology Khairi Atani, Masoud (Author) ; Kasaei, Shohreh (Supervisor)

Abstract

Semantic segmentation which is the classification of every pixel in an input image is a fundamental task in the fields of computer vision and scene understanding. Applications of semantic segmentation include usage in autonomous vehicles and robotics. Since in this task dense annotation of images in the dataset is needed, recent methods have been proposed to utilize weakly-supervised and semi-supervised learning using data with weak labels and unlabeled data respectively. Because the amount of fully labeled data might not be sufficient in such methods, some papers have proposed to employ depth input data due to its rich geometrical and local information when available. In this research, an...

محتواي کتاب

Semi-supervised parallel shared encoders for speech emotion recognition

, Article Digital Signal Processing: A Review Journal ; Volume 118 , 2021 ; 10512004 (ISSN) Pourebrahim, Y ; Razzazi, F ; Sameti, H ; Sharif University of Technology

Elsevier Inc 2021

Abstract

Supervised speech emotion recognition requires a large number of labeled samples that limit its use in practice. Due to easy access to unlabeled samples, a new semi-supervised method based on auto-encoders is proposed in this paper for speech emotion recognition. The proposed method performed the classification operation by extracting the information contained in unlabeled samples and combining it with the information in labeled samples. In addition, it employed maximum mean discrepancy cost function to reduce the distribution difference when the labeled and unlabeled samples were gathered from different datasets. Experimental results obtained on different emotional speech datasets...

One step toward a richer model of unsupervised grammar induction

, Article International Conference on Recent Advances in Natural Language Processing, RANLP 2005, 21 September 2005 through 23 September 2005 ; Volume 2005-January , 2005 , Pages 197-203 ; 13138502 (ISSN) ; 9549174336 (ISBN) Feili, H ; Ghassem Sani, G. R ; Angelova G ; Bontcheva K ; Mitkov R ; Nicolov N ; Nikolov N ; Sharif University of Technology

Association for Computational Linguistics (ACL) 2005

Abstract

Probabilistic Context-Free Grammars (PCFGs) are useful tools for syntactic analysis of natural languages. Availability of large Treebank has encouraged many researchers to use PCFG in language modeling. Automatic learning of PCFGs is divided into three different categories, based on the needed data set for the training phase: supervised, semi-supervised and unsupervised. Most current inductive methods are supervised, which need a bracketed data set in the training phase. However, lack of this kind of data set in many languages, has encouraged us to pay more attention to unsupervised approaches. So far, unsupervised approaches have achieved little success. By considering a history-based...

Continual Learning Using Unsupervised Data

, M.Sc. Thesis Sharif University of Technology Ameli Kalkhoran, Amir Hossein (Author) ; Soleymani Baghshah, Mahdieh (Supervisor)

Abstract

The existing continual learning methods are mainly focused on fully-supervised scenarios and are still not able to take advantage of unlabeled data available in the environment. Some recent works tried to investigate semi-supervised continual learning (SSCL) settings in which the unlabeled data are available, but it is only from the same distribution as the labeled data. This assumption is still not general enough for real-world applications and restricts the utilization of unsupervised data. In this work, we introduce Open-Set Semi-Supervised Continual Learning (OSSCL), a more realistic semi-supervised continual learning setting in which out-of-distribution (OoD) unlabeled samples in the...

محتواي کتاب

Image Annotation Using Semi-supervised Learning

, Ph.D. Dissertation Sharif University of Technology Amiri, Hamid (Author) ; Jamzad, Mansour (Supervisor)

Abstract

Aautomatic image annotation that assigns some labels to input images and provides a textual description for the contents of images has become an active field in machine vision community. To design an annotation system, we need a dataset that contains images and labels for them. However, a large amount of manual efforts is required to annotate all images in a dataset. To reduce the demand of annotation systems on the labeled images, one solution is to exploit useful information embedded into the unlabeled images and incorporate them into learning process. In machine learning community, semi-supervised learning (SSL) has been introduced with the aim of incorporating unlabeled samples into the...

محتواي کتاب

Context-based Persian Grapheme-to-Phoneme Conversion using Sequence-to-Sequence Models

, M.Sc. Thesis Sharif University of Technology Rahmati, Elnaz (Author) ; Sameti, Hossein (Supervisor)

Abstract

Many Text-to-Speech (TTS) systems, particularly in low-resource environments, struggle to produce natural and intelligible speech from grapheme sequences. One solution to this problem is to use Grapheme-to-Phoneme (G2P) conversion to increase the information in the input sequence and improve the TTS output. However, current G2P systems are not accurate or efficient enough for Persian texts due to the language’s complexity and the lack of short vowels in Persian grapheme sequences. In our study, we aimed to improve resources for the Persian language. To achieve this, we introduced two new G2P training datasets, one manually-labeled and the other machine-generated, containing over five million...

محتواي کتاب

3D Medical Images Segmentation by Effective Use of Unlabeled Data

, M.Sc. Thesis Sharif University of Technology Khalili, Hossein (Author) ; Soleymani Baghshah, Mahdieh (Supervisor)

Abstract

Image segmentation in medical imaging, as one of the most important branches of medical image analysis, often faces the challenge of limited labeled data for application in deep learning methods. The high cost of data collection and the need for expertise in image segmentation, particularly in three-dimensional images such as MRI and CT or sequence images like CMR, have all contributed to this problem, even for popular networks like U-Net, which struggle to achieve high accuracy. As a result, research efforts have focused on semi-supervised learning approaches, weakly supervised learning, as well as multi-instance learning in medical image segmentation. Unfortunately, each of these methods...

محتواي کتاب

Deep Zero-shot Learning

, M.Sc. Thesis Sharif University of Technology Shojaee, Mohsen (Author) ; Soleymani, Mahdieh (Supervisor)

Abstract

In some of object recognition problems, labeled data may not be available for all categories. Zero-shot learning utilizes auxiliary information (also called signatures) describing each category in order to find a classifier that can recognize samples from categories with no labeled instance. On the other hand, with recent advances made by deep neural networks in computer vision, a rich representation can be obtained from images that discriminates different categorizes and therefore obtaining a unsupervised information from images is made possible. However, in the previous works, little attention has been paid to using such unsupervised information for the task of zero-shot learning. In this...

محتواي کتاب

Deep Semi-Supervised Text Classification

, M.Sc. Thesis Sharif University of Technology Karimi, Ali (Author) ; Semati, Hossein (Supervisor)

Abstract

Large data sources labeled by experts at cost are essential for deep learning success in various domains. But, when labeling is expensive and labeled data is scarce, deep learning generally does not perform well. The goal of semi-supervised learning is to leverage abundant unlabeled data that one can easily collect. New semi-supervised algorithms based on data augmentation techniques have reached new advances in this field. In this work, by studying different textual augmentation techniques, a new approach is proposed that can obtain effective information signals from unlabeled data. The method encourages the model to generate the same representation vectors for different augmented versions...

محتواي کتاب