Sharif Digital Repository / Sharif University of Technology / Search result

Semi-supervised parallel shared encoders for speech emotion recognition

, Article Digital Signal Processing: A Review Journal ; Volume 118 , 2021 ; 10512004 (ISSN) Pourebrahim, Y ; Razzazi, F ; Sameti, H ; Sharif University of Technology

Elsevier Inc 2021

Abstract

Supervised speech emotion recognition requires a large number of labeled samples that limit its use in practice. Due to easy access to unlabeled samples, a new semi-supervised method based on auto-encoders is proposed in this paper for speech emotion recognition. The proposed method performed the classification operation by extracting the information contained in unlabeled samples and combining it with the information in labeled samples. In addition, it employed maximum mean discrepancy cost function to reduce the distribution difference when the labeled and unlabeled samples were gathered from different datasets. Experimental results obtained on different emotional speech datasets...

Transductive multi-label learning from missing data using smoothed rank function

, Article Pattern Analysis and Applications ; Volume 23, Issue 3 , 2020 , Pages 1225-1233 Esmaeili, A ; Behdin, K ; Fakharian, M. A ; Marvasti, F ; Sharif University of Technology

Springer 2020

Abstract

In this paper, we propose two new algorithms for transductive multi-label learning from missing data. In transductive matrix completion (MC), the challenge is prediction while the data matrix is partially observed. The joint MC and prediction tasks are addressed simultaneously to enhance accuracy in comparison with separate tackling of each. In this setting, the labels to be predicted are modeled as missing entries inside a stacked matrix along the feature-instance data. Assuming the data matrix is of low rank, we propose a new recommendation method for transductive MC by posing the problem as a minimization of the smoothed rank function with non-affine constraints, rather than its convex...

Network-based direction of movement prediction in financial markets

, Article Engineering Applications of Artificial Intelligence ; Volume 88 , February , 2020 Kia, A. N ; Haratizadeh, S ; Shouraki, S. B ; Sharif University of Technology

Elsevier Ltd 2020

Abstract

Market prediction has been an important research problem for decades. Having better predictive models that are both more accurate and faster has been attractive for both researchers and traders. Among many approaches, semi-supervised graph-based prediction has been used as a solution in recent researches. Based on this approach, we present two prediction models. In the first model, a new network structure is introduced that can capture more information about markets’ direction of movements compared to the previous state of the art methods. Based on this novel network, a new algorithm for semi-supervised label propagation is designed that is able to prediction the direction of movement faster...

ACoPE: An adaptive semi-supervised learning approach for complex-policy enforcement in high-bandwidth networks

, Article Computer Networks ; Volume 166 , 2020 Noferesti, M ; Jalili, R ; Sharif University of Technology

Elsevier B.V 2020

Abstract

Today's high-bandwidth networks require adaptive analyzing approaches to recognize the network variable behaviors. The analyzing approaches should be robust against the lack of prior knowledge and provide data to impose more complex policies. In this paper, ACoPE is proposed as an adaptive semi-supervised learning approach for complex-policy enforcement in high-bandwidth networks. ACoPE detects and maintains inter-flows relationships to impose complex-policies. It employs a statistical process control technique to monitor accuracy. Whenever the accuracy decreased, ACoPE considers it as a changed behavior and uses data from a deep packet inspection module to adapt itself with the change. The...

Classification of NPPs transients using change of representation technique: A hybrid of unsupervised MSOM and supervised SVM

, Article Progress in Nuclear Energy ; Volume 117 , 2019 ; 01491970 (ISSN) Moshkbar Bakhshayesh, K ; Mohtashami, S ; Sharif University of Technology

Elsevier Ltd 2019

Abstract

This study introduces a new identifier for nuclear power plants (NPPs) transients. The proposed identifier changes the representation of input patterns. Change of representation is a semi-supervised learning algorithm which employs both of labeled and unlabeled input data. In the first step, modified self-organizing map (MSOM) carries out an unsupervised learning algorithm on labeled and unlabeled patterns and generates a new metric for input data. In the second step, support vector machine (SVM) as a supervised learning algorithm classifies the input patterns using the generated metric of the first step. In contrast to unsupervised learning algorithms, the proposed identifier does not...

An Efficient semi-supervised multi-label classifier capable of handling missing labels

, Article IEEE Transactions on Knowledge and Data Engineering ; Volume 31, Issue 2 , 2019 , Pages 229-242 ; 10414347 (ISSN) Hosseini Akbarnejad, A ; Soleymani Baghshah, M ; Sharif University of Technology

IEEE Computer Society 2019

Abstract

Multi-label classification has received considerable interest in recent years. Multi-label classifiers usually need to address many issues including: handling large-scale datasets with many instances and a large set of labels, compensating missing label assignments in the training set, considering correlations between labels, as well as exploiting unlabeled data to improve prediction performance. To tackle datasets with a large set of labels, embedding-based methods represent the label assignments in a low-dimensional space. Many state-of-the-art embedding-based methods use a linear dimensionality reduction to map the label assignments to a low-dimensional space. However, by doing so, these...

Leveraging multi-modal fusion for graph-based image annotation

, Article Journal of Visual Communication and Image Representation ; Volume 55 , 2018 , Pages 816-828 ; 10473203 (ISSN) Amiri, S. H ; Jamzad, M ; Sharif University of Technology

Academic Press Inc 2018

Abstract

Considering each of the visual features as one modality in image annotation task, efficient fusion of different modalities is essential in graph-based learning. Traditional graph-based methods consider one node for each image and combine its visual features into a single descriptor before constructing the graph. In this paper, we propose an approach that constructs a subgraph for each modality in such a way that edges of subgraph are determined using a search-based approach that handles class-imbalance challenge in the annotation datasets. Multiple subgraphs are then connected to each other to have a supergraph. This follows by introducing a learning framework to infer the tags of...

An efficient semi-supervised multi-label classifier capable of handling missing labels

, Article IEEE Transactions on Knowledge and Data Engineering ; 2018 ; 10414347 (ISSN) Hosseini Akbarnejad, A ; Soleymani Baghshah, M ; Sharif University of Technology

IEEE Computer Society 2018

Abstract

Multi-label classification has received considerable interest in recent years. Multi-label classifiers usually need to address many issues including: handling large-scale datasets with many instances and a large set of labels, compensating missing label assignments in the training set, considering correlations between labels, as well as exploiting unlabeled data to improve prediction performance. To tackle datasets with a large set of labels, embedding-based methods represent the label assignments in a low dimensional space. Many state-of-the-art embedding-based methods use a linear dimensionality reduction to map the label assignments to a low-dimensional space. However, by doing so, these...

Combining Supervised and Semi-Supervised Learning in the Design of a New Identifier for NPPs Transients

, Article IEEE Transactions on Nuclear Science ; Volume 63, Issue 3 , 2016 , Pages 1882-1888 ; 00189499 (ISSN) Moshkbar Bakhshayesh, K ; Ghofrani, M. B ; Sharif University of Technology

Institute of Electrical and Electronics Engineers Inc 2016

Abstract

This study introduces a new identifier for nuclear power plants (NPPs) transients. The proposed identifier performs its function in two steps. First, the transient is identified by the previously developed supervised classifier combining ARIMA model and EBP algorithm. In the second step, the patterns of unknown transients are fed to the identifier based on the semi-supervised learning (SSL). The transductive support vector machine (TSVM) as a semi-supervised algorithm is trained by the labeled data of transients to predict some unlabeled data. The labeled and newly predicted data is then used to train the TSVM for another portion of unlabeled data. Training and prediction is continued until...

Automatic image annotation using semi-supervised generative modeling

, Article Pattern Recognition ; Volume 48, Issue 1 , January , 2015 , Pages 174-188 ; 00313203 (ISSN) Amiri, S. H ; Jamzad, M ; Sharif University of Technology

Elsevier Ltd 2015

Abstract

Image annotation approaches need an annotated dataset to learn a model for the relation between images and words. Unfortunately, preparing a labeled dataset is highly time consuming and expensive. In this work, we describe the development of an annotation system in semi-supervised learning framework which by incorporating unlabeled images into training phase reduces the system demand to labeled images. Our approach constructs a generative model for each semantic class in two main steps. First, based on Gamma distribution, a generative model is constructed for each semantic class using labeled images in that class. The second step incorporates the unlabeled images by using a modified EM...

Unilateral semi-supervised learning of extended hidden vector state for Persian language understanding

, Article NLP-KE 2011 - Proceedings of the 7th International Conference on Natural Language Processing and Knowledge Engineering, 27 November 2011 through 29 November 2011, Tokushima ; 2011 , Pages 165-168 ; 9781612847283 (ISBN) Jabbari, F ; Sameti, H ; Bokaei, M. H ; Chinese Association for Artificial Intelligence; IEEE Signal Processing Society ; Sharif University of Technology

2011

Abstract

The key element of a spoken dialogue system is Spoken Language Understanding (SLU) part. HVS and EHVS are two most popular statistical methods employed to implement the SLU part which need lightly annotated data. Since annotation is a time consuming, we present a novel semi-supervised learning for EHVS to reduce the human labeling effort using two different statistical classifiers, SVM and KNN. Experiments are done on a Persian corpus, the University Information Kiosk corpus. The experimental results show improvements in performance of semi-supervised EHVS, trained by both labeled and unlabeled data, compared to EHVS trained by just initially labeled data. The performance of EHVS improves...

Isograph: Neighbourhood graph construction based on geodesic distance for semi-supervised learning

, Article Proceedings - IEEE International Conference on Data Mining, ICDM, 11 December 2011 through 14 December 2011 ; December , 2011 , Pages 191-200 ; 15504786 (ISSN) ; 9780769544083 (ISBN) Ghazvininejad, M ; Mahdieh, M ; Rabiee, H. R ; Roshan, P. K ; Rohban, M. H ; Sharif University of Technology

2011

Abstract

Semi-supervised learning based on manifolds has been the focus of extensive research in recent years. Convenient neighbourhood graph construction is a key component of a successful semi-supervised classification method. Previous graph construction methods fail when there are pairs of data points that have small Euclidean distance, but are far apart over the manifold. To overcome this problem, we start with an arbitrary neighbourhood graph and iteratively update the edge weights by using the estimates of the geodesic distances between points. Moreover, we provide theoretical bounds on the values of estimated geodesic distances. Experimental results on real-world data show significant...

Active learning from positive and unlabeled data

, Article Proceedings - IEEE International Conference on Data Mining, ICDM, 11 December 2011 through 11 December 2011 ; December , 2011 , Pages 244-250 ; 15504786 (ISSN) ; 9780769544090 (ISBN) Ghasemi, A ; Rabiee, H. R ; Fadaee, M ; Manzuri, M. T ; Rohban, M. H ; Sharif University of Technology

2011

Abstract

During recent years, active learning has evolved into a popular paradigm for utilizing user's feedback to improve accuracy of learning algorithms. Active learning works by selecting the most informative sample among unlabeled data and querying the label of that point from user. Many different methods such as uncertainty sampling and minimum risk sampling have been utilized to select the most informative sample in active learning. Although many active learning algorithms have been proposed so far, most of them work with binary or multi-class classification problems and therefore can not be applied to problems in which only samples from one class as well as a set of unlabeled data are...

HMM based semi-supervised learning for activity recognition

, Article SAGAware'11 - Proceedings of the 2011 International Workshop on Situation Activity and Goal Awareness, 18 September 2011 through 18 September 2011, Beijing ; September , 2011 , Pages 95-99 ; 9781450309264 (ISBN) Ghazvininejad, M ; Rabiee, H. R ; Pourdamghani, N ; Khanipour, P ; Sharif University of Technology

2011

Abstract

In this paper, we introduce a novel method for human activity recognition that benefits from the structure and sequential properties of the test data as well as the training data. In the training phase, we obtain a fraction of data labels at constant time intervals and use them in a semi-supervised graph-based method for recognizing the user's activities. We use label propagation on a k-nearest neighbor graph to calculate the probability of association of the unlabeled data to each class in this phase. Then we use these probabilities to train an HMM in a way that each of its hidden states corresponds to one class of activity. These probabilities are used to learn the transition probabilities...

Manifold coarse graining for online semi-supervised learning

, Article Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 5 September 2011 through 9 September 2011 ; Volume 6911 LNAI, Issue PART 1 , September , 2011 , Pages 391-406 ; 03029743 (ISSN) ; 9783642237799 (ISBN) Farajtabar, M ; Shaban, A ; Rabiee, H. R ; Rohban, M. H ; Sharif University of Technology

2011

Abstract

When the number of labeled data is not sufficient, Semi-Supervised Learning (SSL) methods utilize unlabeled data to enhance classification. Recently, many SSL methods have been developed based on the manifold assumption in a batch mode. However, when data arrive sequentially and in large quantities, both computation and storage limitations become a bottleneck. In this paper, we present a new semi-supervised coarse graining (CG) algorithm to reduce the required number of data points for preserving the manifold structure. First, an equivalent formulation of Label Propagation (LP) is derived. Then a novel spectral view of the Harmonic Solution (HS) is proposed. Finally an algorithm to reduce...

One step toward a richer model of unsupervised grammar induction

, Article International Conference on Recent Advances in Natural Language Processing, RANLP 2005, 21 September 2005 through 23 September 2005 ; Volume 2005-January , 2005 , Pages 197-203 ; 13138502 (ISSN) ; 9549174336 (ISBN) Feili, H ; Ghassem Sani, G. R ; Angelova G ; Bontcheva K ; Mitkov R ; Nicolov N ; Nikolov N ; Sharif University of Technology

Association for Computational Linguistics (ACL) 2005

Abstract

Probabilistic Context-Free Grammars (PCFGs) are useful tools for syntactic analysis of natural languages. Availability of large Treebank has encouraged many researchers to use PCFG in language modeling. Automatic learning of PCFGs is divided into three different categories, based on the needed data set for the training phase: supervised, semi-supervised and unsupervised. Most current inductive methods are supervised, which need a bracketed data set in the training phase. However, lack of this kind of data set in many languages, has encouraged us to pay more attention to unsupervised approaches. So far, unsupervised approaches have achieved little success. By considering a history-based...

Context-based Persian Grapheme-to-Phoneme Conversion using Sequence-to-Sequence Models

, M.Sc. Thesis Sharif University of Technology Rahmati, Elnaz (Author) ; Sameti, Hossein (Supervisor)

Abstract

Many Text-to-Speech (TTS) systems, particularly in low-resource environments, struggle to produce natural and intelligible speech from grapheme sequences. One solution to this problem is to use Grapheme-to-Phoneme (G2P) conversion to increase the information in the input sequence and improve the TTS output. However, current G2P systems are not accurate or efficient enough for Persian texts due to the language’s complexity and the lack of short vowels in Persian grapheme sequences. In our study, we aimed to improve resources for the Persian language. To achieve this, we introduced two new G2P training datasets, one manually-labeled and the other machine-generated, containing over five million...

محتواي کتاب

3D Medical Images Segmentation by Effective Use of Unlabeled Data

, M.Sc. Thesis Sharif University of Technology Khalili, Hossein (Author) ; Soleymani Baghshah, Mahdieh (Supervisor)

Abstract

Image segmentation in medical imaging, as one of the most important branches of medical image analysis, often faces the challenge of limited labeled data for application in deep learning methods. The high cost of data collection and the need for expertise in image segmentation, particularly in three-dimensional images such as MRI and CT or sequence images like CMR, have all contributed to this problem, even for popular networks like U-Net, which struggle to achieve high accuracy. As a result, research efforts have focused on semi-supervised learning approaches, weakly supervised learning, as well as multi-instance learning in medical image segmentation. Unfortunately, each of these methods...

محتواي کتاب

Continual Learning Using Unsupervised Data

, M.Sc. Thesis Sharif University of Technology Ameli Kalkhoran, Amir Hossein (Author) ; Soleymani Baghshah, Mahdieh (Supervisor)

Abstract

The existing continual learning methods are mainly focused on fully-supervised scenarios and are still not able to take advantage of unlabeled data available in the environment. Some recent works tried to investigate semi-supervised continual learning (SSCL) settings in which the unlabeled data are available, but it is only from the same distribution as the labeled data. This assumption is still not general enough for real-world applications and restricts the utilization of unsupervised data. In this work, we introduce Open-Set Semi-Supervised Continual Learning (OSSCL), a more realistic semi-supervised continual learning setting in which out-of-distribution (OoD) unlabeled samples in the...

محتواي کتاب

Weakly Supervised Semantic Segmentation Using Deep Neural Networks

, M.Sc. Thesis Sharif University of Technology Khairi Atani, Masoud (Author) ; Kasaei, Shohreh (Supervisor)

Abstract

Semantic segmentation which is the classification of every pixel in an input image is a fundamental task in the fields of computer vision and scene understanding. Applications of semantic segmentation include usage in autonomous vehicles and robotics. Since in this task dense annotation of images in the dataset is needed, recent methods have been proposed to utilize weakly-supervised and semi-supervised learning using data with weak labels and unlabeled data respectively. Because the amount of fully labeled data might not be sufficient in such methods, some papers have proposed to employ depth input data due to its rich geometrical and local information when available. In this research, an...

محتواي کتاب