Loading...
Search for: semi-supervised-learning
0.013 seconds
Total 49 records

    Deep Zero-shot Learning

    , M.Sc. Thesis Sharif University of Technology Shojaee, Mohsen (Author) ; Soleymani, Mahdieh (Supervisor)
    Abstract
    In some of object recognition problems, labeled data may not be available for all categories. Zero-shot learning utilizes auxiliary information (also called signatures) describing each category in order to find a classifier that can recognize samples from categories with no labeled instance. On the other hand, with recent advances made by deep neural networks in computer vision, a rich representation can be obtained from images that discriminates different categorizes and therefore obtaining a unsupervised information from images is made possible. However, in the previous works, little attention has been paid to using such unsupervised information for the task of zero-shot learning. In this... 

    Deep Semi-Supervised Text Classification

    , M.Sc. Thesis Sharif University of Technology Karimi, Ali (Author) ; Semati, Hossein (Supervisor)
    Abstract
    Large data sources labeled by experts at cost are essential for deep learning success in various domains. But, when labeling is expensive and labeled data is scarce, deep learning generally does not perform well. The goal of semi-supervised learning is to leverage abundant unlabeled data that one can easily collect. New semi-supervised algorithms based on data augmentation techniques have reached new advances in this field. In this work, by studying different textual augmentation techniques, a new approach is proposed that can obtain effective information signals from unlabeled data. The method encourages the model to generate the same representation vectors for different augmented versions... 

    A Semi-Supervised Ensemble Learning Algorithm for Nonstationary Data Streams Classification

    , M.Sc. Thesis Sharif University of Technology Hosseini, Mohammad Javad (Author) ; Beigy, Hamid (Supervisor)
    Abstract
    Recent advances in storage and processing, have provided the ability of automatic gathering of information which in turn leads to fast and contineous flow of data. The data which are produced and stored in this way, are named data streams. data streams have many applications such as processing financial transactions, the recorded data of various sensors or the collected data by web sevices. Data streams are produced with high speed, large size and much dynamism and have some unique properties which make them applicable in precise modeling of many real data mining applications. The main challenge of data streams is the occurrence of concept drift which can be in four types: sudden, gradual,... 

    Data Labelling Using Manifold-Based Semi-Supervised Learning in Multispectral Remote Sensing

    , M.Sc. Thesis Sharif University of Technology Khajenezhad, Ahmad (Author) ; Rabiee, Hamid Reza (Supervisor) ; Safari, Mohammad Ali (Co-Advisor)
    Abstract
    Classification of hyperspectral remote sensing images is a challenging problem, because of the small number of labeled pixels, high dimensionality of the data and large number of pixels. In this context, semisupervised learning can improve the classification accuracy by extracting information form the distribution of all the labeled and unlabeled data. Among semi-supervised methods, manifold-based algorithms have been frequently used in recent years. In most of the previous works, manifolds are constructed according to spectral representation of data, while spatial dependency of pixel labels is an important property of the images in remote sensing applications. In this thesis, after... 

    Online Semi-supervised Learning and its Application in Image Classification

    , M.Sc. Thesis Sharif University of Technology Shaban, Amir Reza (Author) ; Rabiee, Hamid Reza (Supervisor)
    Abstract
    Image classification, i.e. the task of assigning an image to a class chosen from a predefined set of classes, has addressed in this thesis. At first the classifier is divided into two major sub partitions, feature extraction and classifier. Then we show that by using local feature extraction techniques such as BOW the classification accuracy will improve. In addition, using unlabeled data is argued as the fact to deal with high nonlinear structure of features. Recently, many SSL methods have been developed based on the manifold assumption in a batch mode. However, when data arrive sequentially and in large quantities, both computation and storage limitations become a bottleneck. So in large... 

    Context-based Persian Grapheme-to-Phoneme Conversion using Sequence-to-Sequence Models

    , M.Sc. Thesis Sharif University of Technology Rahmati, Elnaz (Author) ; Sameti, Hossein (Supervisor)
    Abstract
    Many Text-to-Speech (TTS) systems, particularly in low-resource environments, struggle to produce natural and intelligible speech from grapheme sequences. One solution to this problem is to use Grapheme-to-Phoneme (G2P) conversion to increase the information in the input sequence and improve the TTS output. However, current G2P systems are not accurate or efficient enough for Persian texts due to the language’s complexity and the lack of short vowels in Persian grapheme sequences. In our study, we aimed to improve resources for the Persian language. To achieve this, we introduced two new G2P training datasets, one manually-labeled and the other machine-generated, containing over five million... 

    An ensemble of cluster-based classifiers for semi-supervised classification of non-stationary data streams

    , Article Knowledge and Information Systems ; Volume 46, Issue 3 , 2016 , Pages 567-597 ; 02191377 (ISSN) Hosseini, M. J ; Gholipour, A ; Beigy, H ; Sharif University of Technology
    Springer-Verlag London Ltd 
    Abstract
    Recent advances in storage and processing have provided the possibility of automatic gathering of information, which in turn leads to fast and continuous flows of data. The data which are produced and stored in this way are called data streams. Data streams are produced in large size, and much dynamism and have some unique properties which make them applicable to model many real data mining applications. The main challenge of streaming data is the occurrence of concept drift. In addition, regarding the costs of labeling of instances, it is often assumed that only a small fraction of instances are labeled. In this paper, we propose an ensemble algorithm to classify instances of non-stationary... 

    Transductive multi-label learning from missing data using smoothed rank function

    , Article Pattern Analysis and Applications ; Volume 23, Issue 3 , 2020 , Pages 1225-1233 Esmaeili, A ; Behdin, K ; Fakharian, M. A ; Marvasti, F ; Sharif University of Technology
    Springer  2020
    Abstract
    In this paper, we propose two new algorithms for transductive multi-label learning from missing data. In transductive matrix completion (MC), the challenge is prediction while the data matrix is partially observed. The joint MC and prediction tasks are addressed simultaneously to enhance accuracy in comparison with separate tackling of each. In this setting, the labels to be predicted are modeled as missing entries inside a stacked matrix along the feature-instance data. Assuming the data matrix is of low rank, we propose a new recommendation method for transductive MC by posing the problem as a minimization of the smoothed rank function with non-affine constraints, rather than its convex... 

    Combining Supervised and Semi-Supervised Learning in the Design of a New Identifier for NPPs Transients

    , Article IEEE Transactions on Nuclear Science ; Volume 63, Issue 3 , 2016 , Pages 1882-1888 ; 00189499 (ISSN) Moshkbar Bakhshayesh, K ; Ghofrani, M. B ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2016
    Abstract
    This study introduces a new identifier for nuclear power plants (NPPs) transients. The proposed identifier performs its function in two steps. First, the transient is identified by the previously developed supervised classifier combining ARIMA model and EBP algorithm. In the second step, the patterns of unknown transients are fed to the identifier based on the semi-supervised learning (SSL). The transductive support vector machine (TSVM) as a semi-supervised algorithm is trained by the labeled data of transients to predict some unlabeled data. The labeled and newly predicted data is then used to train the TSVM for another portion of unlabeled data. Training and prediction is continued until... 

    An efficient semi-supervised multi-label classifier capable of handling missing labels

    , Article IEEE Transactions on Knowledge and Data Engineering ; 2018 ; 10414347 (ISSN) Hosseini Akbarnejad, A ; Soleymani Baghshah, M ; Sharif University of Technology
    IEEE Computer Society  2018
    Abstract
    Multi-label classification has received considerable interest in recent years. Multi-label classifiers usually need to address many issues including: handling large-scale datasets with many instances and a large set of labels, compensating missing label assignments in the training set, considering correlations between labels, as well as exploiting unlabeled data to improve prediction performance. To tackle datasets with a large set of labels, embedding-based methods represent the label assignments in a low dimensional space. Many state-of-the-art embedding-based methods use a linear dimensionality reduction to map the label assignments to a low-dimensional space. However, by doing so, these... 

    An Efficient semi-supervised multi-label classifier capable of handling missing labels

    , Article IEEE Transactions on Knowledge and Data Engineering ; Volume 31, Issue 2 , 2019 , Pages 229-242 ; 10414347 (ISSN) Hosseini Akbarnejad, A ; Soleymani Baghshah, M ; Sharif University of Technology
    IEEE Computer Society  2019
    Abstract
    Multi-label classification has received considerable interest in recent years. Multi-label classifiers usually need to address many issues including: handling large-scale datasets with many instances and a large set of labels, compensating missing label assignments in the training set, considering correlations between labels, as well as exploiting unlabeled data to improve prediction performance. To tackle datasets with a large set of labels, embedding-based methods represent the label assignments in a low-dimensional space. Many state-of-the-art embedding-based methods use a linear dimensionality reduction to map the label assignments to a low-dimensional space. However, by doing so, these... 

    Incremental evolving domain adaptation

    , Article IEEE Transactions on Knowledge and Data Engineering ; Volume 28, Issue 8 , 2016 , Pages 2128-2141 ; 10414347 (ISSN) Bitarafan, A ; Soleymani Baghshah, M ; Gheisari, M ; Sharif University of Technology
    IEEE Computer Society 
    Abstract
    Almost all of the existing domain adaptation methods assume that all test data belong to a single stationary target distribution. However, in many real world applications, data arrive sequentially and the data distribution is continuously evolving. In this paper, we tackle the problem of adaptation to a continuously evolving target domain that has been recently introduced. We assume that the available data for the source domain are labeled but the examples of the target domain can be unlabeled and arrive sequentially. Moreover, the distribution of the target domain can evolve continuously over time. We propose the Evolving Domain Adaptation (EDA) method that first finds a new feature space... 

    Automatic image annotation using semi-supervised generative modeling

    , Article Pattern Recognition ; Volume 48, Issue 1 , January , 2015 , Pages 174-188 ; 00313203 (ISSN) Amiri, S. H ; Jamzad, M ; Sharif University of Technology
    Elsevier Ltd  2015
    Abstract
    Image annotation approaches need an annotated dataset to learn a model for the relation between images and words. Unfortunately, preparing a labeled dataset is highly time consuming and expensive. In this work, we describe the development of an annotation system in semi-supervised learning framework which by incorporating unlabeled images into training phase reduces the system demand to labeled images. Our approach constructs a generative model for each semantic class in two main steps. First, based on Gamma distribution, a generative model is constructed for each semantic class using labeled images in that class. The second step incorporates the unlabeled images by using a modified EM... 

    Classification of NPPs transients using change of representation technique: A hybrid of unsupervised MSOM and supervised SVM

    , Article Progress in Nuclear Energy ; Volume 117 , 2019 ; 01491970 (ISSN) Moshkbar Bakhshayesh, K ; Mohtashami, S ; Sharif University of Technology
    Elsevier Ltd  2019
    Abstract
    This study introduces a new identifier for nuclear power plants (NPPs) transients. The proposed identifier changes the representation of input patterns. Change of representation is a semi-supervised learning algorithm which employs both of labeled and unlabeled input data. In the first step, modified self-organizing map (MSOM) carries out an unsupervised learning algorithm on labeled and unlabeled patterns and generates a new metric for input data. In the second step, support vector machine (SVM) as a supervised learning algorithm classifies the input patterns using the generated metric of the first step. In contrast to unsupervised learning algorithms, the proposed identifier does not... 

    Network-based direction of movement prediction in financial markets

    , Article Engineering Applications of Artificial Intelligence ; Volume 88 , February , 2020 Kia, A. N ; Haratizadeh, S ; Shouraki, S. B ; Sharif University of Technology
    Elsevier Ltd  2020
    Abstract
    Market prediction has been an important research problem for decades. Having better predictive models that are both more accurate and faster has been attractive for both researchers and traders. Among many approaches, semi-supervised graph-based prediction has been used as a solution in recent researches. Based on this approach, we present two prediction models. In the first model, a new network structure is introduced that can capture more information about markets’ direction of movements compared to the previous state of the art methods. Based on this novel network, a new algorithm for semi-supervised label propagation is designed that is able to prediction the direction of movement faster... 

    Semi-supervised parallel shared encoders for speech emotion recognition

    , Article Digital Signal Processing: A Review Journal ; Volume 118 , 2021 ; 10512004 (ISSN) Pourebrahim, Y ; Razzazi, F ; Sameti, H ; Sharif University of Technology
    Elsevier Inc  2021
    Abstract
    Supervised speech emotion recognition requires a large number of labeled samples that limit its use in practice. Due to easy access to unlabeled samples, a new semi-supervised method based on auto-encoders is proposed in this paper for speech emotion recognition. The proposed method performed the classification operation by extracting the information contained in unlabeled samples and combining it with the information in labeled samples. In addition, it employed maximum mean discrepancy cost function to reduce the distribution difference when the labeled and unlabeled samples were gathered from different datasets. Experimental results obtained on different emotional speech datasets... 

    ACoPE: An adaptive semi-supervised learning approach for complex-policy enforcement in high-bandwidth networks

    , Article Computer Networks ; Volume 166 , 2020 Noferesti, M ; Jalili, R ; Sharif University of Technology
    Elsevier B.V  2020
    Abstract
    Today's high-bandwidth networks require adaptive analyzing approaches to recognize the network variable behaviors. The analyzing approaches should be robust against the lack of prior knowledge and provide data to impose more complex policies. In this paper, ACoPE is proposed as an adaptive semi-supervised learning approach for complex-policy enforcement in high-bandwidth networks. ACoPE detects and maintains inter-flows relationships to impose complex-policies. It employs a statistical process control technique to monitor accuracy. Whenever the accuracy decreased, ACoPE considers it as a changed behavior and uses data from a deep packet inspection module to adapt itself with the change. The... 

    One step toward a richer model of unsupervised grammar induction

    , Article International Conference on Recent Advances in Natural Language Processing, RANLP 2005, 21 September 2005 through 23 September 2005 ; Volume 2005-January , 2005 , Pages 197-203 ; 13138502 (ISSN) ; 9549174336 (ISBN) Feili, H ; Ghassem Sani, G. R ; Angelova G ; Bontcheva K ; Mitkov R ; Nicolov N ; Nikolov N ; Sharif University of Technology
    Association for Computational Linguistics (ACL)  2005
    Abstract
    Probabilistic Context-Free Grammars (PCFGs) are useful tools for syntactic analysis of natural languages. Availability of large Treebank has encouraged many researchers to use PCFG in language modeling. Automatic learning of PCFGs is divided into three different categories, based on the needed data set for the training phase: supervised, semi-supervised and unsupervised. Most current inductive methods are supervised, which need a bracketed data set in the training phase. However, lack of this kind of data set in many languages, has encouraged us to pay more attention to unsupervised approaches. So far, unsupervised approaches have achieved little success. By considering a history-based... 

    Leveraging multi-modal fusion for graph-based image annotation

    , Article Journal of Visual Communication and Image Representation ; Volume 55 , 2018 , Pages 816-828 ; 10473203 (ISSN) Amiri, S. H ; Jamzad, M ; Sharif University of Technology
    Academic Press Inc  2018
    Abstract
    Considering each of the visual features as one modality in image annotation task, efficient fusion of different modalities is essential in graph-based learning. Traditional graph-based methods consider one node for each image and combine its visual features into a single descriptor before constructing the graph. In this paper, we propose an approach that constructs a subgraph for each modality in such a way that edges of subgraph are determined using a search-based approach that handles class-imbalance challenge in the annotation datasets. Multiple subgraphs are then connected to each other to have a supergraph. This follows by introducing a learning framework to infer the tags of... 

    Exploiting multiview properties in semi-supervised video classification

    , Article 2012 6th International Symposium on Telecommunications, IST 2012 ; 2012 , Pages 837-842 ; 9781467320733 (ISBN) Karimian, M ; Tavassolipour, M ; Kasaei, S ; Sharif University of Technology
    Abstract
    In large databases, availability of labeled training data is mostly prohibitive in classification. Semi-supervised algorithms are employed to tackle the lack of labeled training data problem. Video databases are the epitome for such a scenario; that is why semi-supervised learning has found its niche in it. Graph-based methods are a promising platform for semi-supervised video classification. Based on the multiview characteristic of video data, different features have been proposed (such as SIFT, STIP and MFCC) which can be utilized to build a graph. In this paper, we have proposed a new classification method which fuses the results of manifold regularization over different graphs. Our...