Loading...
Search for: semi-supervised-learning
0.004 seconds
Total 49 records

    Identification and Forecasting of Nuclear Power Plants Transients by Semi-Supervised Method with Change of Representation Technique

    , M.Sc. Thesis Sharif University of Technology Mirzaei Dam-Abi, Ali (Author) ; Ghofrani, Mohamad Bagher (Supervisor) ; Moshkbar Bakhshayesh, Khalil (Supervisor)
    Abstract
    In this work, we aim to find a way to identify and forecast transients in nuclear power plants with the aid of semi-supervised machine learning algorithm. Forecasting and identifying transients in nuclear power plants at the early stages of formation are essential for safety considerations and precautionary measures. The use of machine learning algorithms provides an intelligent control mechanism that, along with the main operator of the power plant, raises the transient detection and identification rate. Our algorithm of choice is to change the way data is presented, which is a semi-supervised learning approach. The algorithm consists of two methods: quantum dynamics clustering... 

    3D Medical Images Segmentation by Effective Use of Unlabeled Data

    , M.Sc. Thesis Sharif University of Technology Khalili, Hossein (Author) ; Soleymani Baghshah, Mahdieh (Supervisor)
    Abstract
    Image segmentation in medical imaging, as one of the most important branches of medical image analysis, often faces the challenge of limited labeled data for application in deep learning methods. The high cost of data collection and the need for expertise in image segmentation, particularly in three-dimensional images such as MRI and CT or sequence images like CMR, have all contributed to this problem, even for popular networks like U-Net, which struggle to achieve high accuracy. As a result, research efforts have focused on semi-supervised learning approaches, weakly supervised learning, as well as multi-instance learning in medical image segmentation. Unfortunately, each of these methods... 

    Context-based Persian Grapheme-to-Phoneme Conversion using Sequence-to-Sequence Models

    , M.Sc. Thesis Sharif University of Technology Rahmati, Elnaz (Author) ; Sameti, Hossein (Supervisor)
    Abstract
    Many Text-to-Speech (TTS) systems, particularly in low-resource environments, struggle to produce natural and intelligible speech from grapheme sequences. One solution to this problem is to use Grapheme-to-Phoneme (G2P) conversion to increase the information in the input sequence and improve the TTS output. However, current G2P systems are not accurate or efficient enough for Persian texts due to the language’s complexity and the lack of short vowels in Persian grapheme sequences. In our study, we aimed to improve resources for the Persian language. To achieve this, we introduced two new G2P training datasets, one manually-labeled and the other machine-generated, containing over five million... 

    Continual Learning Using Unsupervised Data

    , M.Sc. Thesis Sharif University of Technology Ameli Kalkhoran, Amir Hossein (Author) ; Soleymani Baghshah, Mahdieh (Supervisor)
    Abstract
    The existing continual learning methods are mainly focused on fully-supervised scenarios and are still not able to take advantage of unlabeled data available in the environment. Some recent works tried to investigate semi-supervised continual learning (SSCL) settings in which the unlabeled data are available, but it is only from the same distribution as the labeled data. This assumption is still not general enough for real-world applications and restricts the utilization of unsupervised data. In this work, we introduce Open-Set Semi-Supervised Continual Learning (OSSCL), a more realistic semi-supervised continual learning setting in which out-of-distribution (OoD) unlabeled samples in the... 

    Weakly Supervised Semantic Segmentation Using Deep Neural Networks

    , M.Sc. Thesis Sharif University of Technology Khairi Atani, Masoud (Author) ; Kasaei, Shohreh (Supervisor)
    Abstract
    Semantic segmentation which is the classification of every pixel in an input image is a fundamental task in the fields of computer vision and scene understanding. Applications of semantic segmentation include usage in autonomous vehicles and robotics. Since in this task dense annotation of images in the dataset is needed, recent methods have been proposed to utilize weakly-supervised and semi-supervised learning using data with weak labels and unlabeled data respectively. Because the amount of fully labeled data might not be sufficient in such methods, some papers have proposed to employ depth input data due to its rich geometrical and local information when available. In this research, an... 

    Deep Semi-Supervised Text Classification

    , M.Sc. Thesis Sharif University of Technology Karimi, Ali (Author) ; Semati, Hossein (Supervisor)
    Abstract
    Large data sources labeled by experts at cost are essential for deep learning success in various domains. But, when labeling is expensive and labeled data is scarce, deep learning generally does not perform well. The goal of semi-supervised learning is to leverage abundant unlabeled data that one can easily collect. New semi-supervised algorithms based on data augmentation techniques have reached new advances in this field. In this work, by studying different textual augmentation techniques, a new approach is proposed that can obtain effective information signals from unlabeled data. The method encourages the model to generate the same representation vectors for different augmented versions... 

    Behavior-Driven Security Policy Enforcement on High Bandwidth Networks

    , Ph.D. Dissertation Sharif University of Technology Noferesti, Morteza (Author) ; Jalili, Rasool (Supervisor)
    Abstract
    High-bandwidth network analysis is challenging, resource consuming, and inaccurate due to the high volume, velocity, and variety characteristics of the network traffic. Today's high-bandwidth networks require adaptive analyzing approaches to recognize the network variable behaviors. The analyzing approaches should be robust against the lack of prior knowledge and provide data to impose more complex policies.This thesis introduces complex policy relation and proposes a two-layer framework to enforce complex policies, named HB2DS. The proposed framework is equipped with the mechanism and policy layers. The mechanism layer processes network packets header and payload to generate a flow stream.... 

    Deep Zero-shot Learning

    , M.Sc. Thesis Sharif University of Technology Shojaee, Mohsen (Author) ; Soleymani, Mahdieh (Supervisor)
    Abstract
    In some of object recognition problems, labeled data may not be available for all categories. Zero-shot learning utilizes auxiliary information (also called signatures) describing each category in order to find a classifier that can recognize samples from categories with no labeled instance. On the other hand, with recent advances made by deep neural networks in computer vision, a rich representation can be obtained from images that discriminates different categorizes and therefore obtaining a unsupervised information from images is made possible. However, in the previous works, little attention has been paid to using such unsupervised information for the task of zero-shot learning. In this... 

    Adaptation for Evolving Domains

    , M.Sc. Thesis Sharif University of Technology Bitarafan, Adeleh (Author) ; Soleymani Baghshah, Mahdieh (Supervisor)
    Abstract
    Until now many domain adaptation methods have been proposed. A major limitation of almost all of these methods is their assumption that all test data belong to a single stationary target distribution and a large amount of unlabeled data is available for modeling this target distribution. In fact, in many real world applications, such as classifying scene image with gradually changing lighting and spam email identification, data arrives sequentially and the data distribution is continuously evolving. In this thesis, we tackle the problem of adaptation to a continuously evolving target domain that has been recently introduced and propose the Evolving Domain Adaptation (EDA) method to classify... 

    Image Annotation Using Semi-supervised Learning

    , Ph.D. Dissertation Sharif University of Technology Amiri, Hamid (Author) ; Jamzad, Mansour (Supervisor)
    Abstract
    Aautomatic image annotation that assigns some labels to input images and provides a textual description for the contents of images has become an active field in machine vision community. To design an annotation system, we need a dataset that contains images and labels for them. However, a large amount of manual efforts is required to annotate all images in a dataset. To reduce the demand of annotation systems on the labeled images, one solution is to exploit useful information embedded into the unlabeled images and incorporate them into learning process. In machine learning community, semi-supervised learning (SSL) has been introduced with the aim of incorporating unlabeled samples into the... 

    Unsupervised Domain Adaptation via Representation Learning

    , M.Sc. Thesis Sharif University of Technology Gheisary, Marzieh (Author) ; Soleymani, Mahdieh (Supervisor)
    Abstract
    The existing learning methods usually assume that training and test data follow the same distribution, while this is not always true. Thus, in many cases the performance of these learning methods on the test data will be severely degraded. We often have sufficient labeled training data from a source domain but wish to learn a classifier which performs well on a target domain with a different distribution and no labeled training data. In this thesis, we study the problem of unsupervised domain adaptation, where no labeled data in the target domain is available. We propose a framework which finds a new representation for both the source and the target domain in which the distance between these... 

    Online Distance Metric Learning

    , M.Sc. Thesis Sharif University of Technology Vazifedan, Afrooz (Author) ; Beigy, Hamid (Supervisor)
    Abstract
    Distance Metric Learning algorithms have been widely used in Machine Learning methods recently. In these algorithms a distance function between objecs (data points) is learned based on their labels or similarity and dissimilarity constraints. Recent works have shown that a good precision is obtained in classification or clustering methods which use these functions. Since in the current systems many of data points do not exist at the beginning and are added to the training set as the algorithm is run, online methods are needed to update learned metric due to new data.
    In this thesis, we proposed a new online distance metric learning method that has higher performance than existing... 

    Fault Detection and Smart Monitoring of Industrial Fans Based on Vibration Signals

    , M.Sc. Thesis Sharif University of Technology Moeeni, Hamed (Author) ; Manzuri Shalmani, Mohammad Taghi (Supervisor)
    Abstract
    Data Oriented Smart Monitoring for Industrial Machineries include approaches for fault detection and prognosis which only rely on non-stationary signals sampled from sensors and do not rely on physical model of machineries nor expert knowledge. Fault detection is task of determining state of machinery in present moment using past data. But in Prognosis focus is on predicting future state of machinery using past data. Most researches in this category are based on supervised algorithms, but in many applications labeling data is expensive. In this thesis some approaches for semi-superviseddiagnosis, based on markov random walk an K-NN have been implemented, also some improvements for K-NN have... 

    Improving Graph Construction for Semi-supervised Learning in Computer Vision Applications

    , M.Sc. Thesis Sharif University of Technology Mahdieh, Mostafa (Author) ; Rabiee, Hamid Reza (Supervisor)
    Abstract
    Semi-supervised Learning (SSL) is an extremely useful approach in many applications where unlabeled data can be easily obtained. Graph based methods are among the most studied branches in SSL. Since neighborhood graph is a key component in these methods, we focus on methods of graph construction in this project. Graph construction methods based on Euclidean distance have the common problem of creating shortcut edges. Shortcut edges refer to the edges which connect two nearby points that are far apart on the manifold. Specifically, we show both in theory and practice that using geodesic distance for selecting and weighting edges results in more appropriate neighborhood graphs. We propose an... 

    Online Semi-supervised Learning and its Application in Image Classification

    , M.Sc. Thesis Sharif University of Technology Shaban, Amir Reza (Author) ; Rabiee, Hamid Reza (Supervisor)
    Abstract
    Image classification, i.e. the task of assigning an image to a class chosen from a predefined set of classes, has addressed in this thesis. At first the classifier is divided into two major sub partitions, feature extraction and classifier. Then we show that by using local feature extraction techniques such as BOW the classification accuracy will improve. In addition, using unlabeled data is argued as the fact to deal with high nonlinear structure of features. Recently, many SSL methods have been developed based on the manifold assumption in a batch mode. However, when data arrive sequentially and in large quantities, both computation and storage limitations become a bottleneck. So in large... 

    Data Labelling Using Manifold-Based Semi-Supervised Learning in Multispectral Remote Sensing

    , M.Sc. Thesis Sharif University of Technology Khajenezhad, Ahmad (Author) ; Rabiee, Hamid Reza (Supervisor) ; Safari, Mohammad Ali (Co-Advisor)
    Abstract
    Classification of hyperspectral remote sensing images is a challenging problem, because of the small number of labeled pixels, high dimensionality of the data and large number of pixels. In this context, semisupervised learning can improve the classification accuracy by extracting information form the distribution of all the labeled and unlabeled data. Among semi-supervised methods, manifold-based algorithms have been frequently used in recent years. In most of the previous works, manifolds are constructed according to spectral representation of data, while spatial dependency of pixel labels is an important property of the images in remote sensing applications. In this thesis, after... 

    A Semi-Supervised Ensemble Learning Algorithm for Nonstationary Data Streams Classification

    , M.Sc. Thesis Sharif University of Technology Hosseini, Mohammad Javad (Author) ; Beigy, Hamid (Supervisor)
    Abstract
    Recent advances in storage and processing, have provided the ability of automatic gathering of information which in turn leads to fast and contineous flow of data. The data which are produced and stored in this way, are named data streams. data streams have many applications such as processing financial transactions, the recorded data of various sensors or the collected data by web sevices. Data streams are produced with high speed, large size and much dynamism and have some unique properties which make them applicable in precise modeling of many real data mining applications. The main challenge of data streams is the occurrence of concept drift which can be in four types: sudden, gradual,... 

    Semi-Supervised Kernel Learning for Pattern Classification

    , Ph.D. Dissertation Sharif University of Technology Rohban, Mohammad Hossein (Author) ; Rabiee, Hamid Reza (Supervisor)
    Abstract
    Supervised kernel learning has been the focus of research in recent years. Although these methods are developed based on rigorous frameworks, they fail to improve the classification accuracy in real world applications. In order to find the origin of this problem, it should be noted that the kernel function represents a prior knowledge on the labeling function. Similar to other learning problem, learning this prior knowledge needs another prior knowledge. In supervised kernel learning, only naive assumptions can be used as the prior knowledge. These include minimizing the ℓ1 and ℓ2 norms of the kernel parameters.
    As an alternative approach, in Semi-Supervised Learning (SSL), unlabeled... 

    Persian Statistical Natural Language Understanding Based on Partially Annotated Corpus

    , M.Sc. Thesis Sharif University of Technology Jabbari, Fattaneh (Author) ; Sameti, Hossein (Supervisor)
    Abstract
    Spoken language understanding unit is one of the most important parts of a spoken dialogue system. The input of this system is the output of speech recognition unit. The main function of this unit is to extract the semantic information from the input utterances. There are two main types of approaches to do this task: rule-based approaches, and data-driven approaches. Today data-driven approaches are of more interest because they are more flexible and robust compared to the rule-based approaches. The main drawback of these methods is that they need a large amount of fully annotated or in some cases Treebank data. Preparing such data is time consuming and expensive. The goal of this thesis is... 

    Semi-supervised Learning and its Application to Image Categorization

    , M.Sc. Thesis Sharif University of Technology Farajtabar, Mehrdad (Author) ; Rabiee, Hamid Reza (Supervisor)
    Abstract
    Traditional methods for data classification only make use of the labeled data. However, in most of the applications, labeling the unlabeled data is expensive, time consuming and requires expert knowledge. To overcome these problems, Semi-supervised Learning (SSL) methods have become an area of recent research that aim to effectively addressing the problem of limited labeled data.One of the recently introduced SSL methods is the classification based on geometric structure of the data, namely the data manifold. In this approach unlabeled data is utilized to recover the underlying structure of the data. The common assumption is that despite of being represented in a high dimensional space, data...