Loading...
Search for: semi-supervised-learning
0.02 seconds
Total 49 records

    Supervised neighborhood graph construction for semi-supervised classification

    , Article Pattern Recognition ; Volume 45, Issue 4 , April , 2012 , Pages 1363-1372 ; 00313203 (ISSN) Rohban, M. H ; Rabiee, H. R ; Sharif University of Technology
    Abstract
    Graph based methods are among the most active and applicable approaches studied in semi-supervised learning. The problem of neighborhood graph construction for these methods is addressed in this paper. Neighborhood graph construction plays a key role in the quality of the classification in graph based methods. Several unsupervised graph construction methods have been proposed that have addressed issues such as data noise, geometrical properties of the underlying manifold and graph hyper-parameters selection. In contrast, in order to adapt the graph construction to the given classification task, many of the recent graph construction methods take advantage of the data labels. However, these... 

    Semi-supervised ensemble learning of data streams in the presence of concept drift

    , Article Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) ; Volume 7209 LNAI, Issue PART 2 , 2012 , Pages 526-537 ; 03029743 (ISSN) ; 9783642289309 (ISBN) Ahmadi, Z ; Beigy, H ; Sharif University of Technology
    Abstract
    Increasing access to very large and non-stationary datasets in many real problems has made the classical data mining algorithms impractical and made it necessary to design new online classification algorithms. Online learning of data streams has some important features, such as sequential access to the data, limitation on time and space complexity and the occurrence of concept drift. The infinite nature of data streams makes it hard to label all observed instances. It seems that using the semi-supervised approaches have much more compatibility with the problem. So in this paper we present a new semi-supervised ensemble learning algorithm for data streams. This algorithm uses the majority... 

    Unilateral semi-supervised learning of extended hidden vector state for Persian language understanding

    , Article NLP-KE 2011 - Proceedings of the 7th International Conference on Natural Language Processing and Knowledge Engineering, 27 November 2011 through 29 November 2011, Tokushima ; 2011 , Pages 165-168 ; 9781612847283 (ISBN) Jabbari, F ; Sameti, H ; Bokaei, M. H ; Chinese Association for Artificial Intelligence; IEEE Signal Processing Society ; Sharif University of Technology
    2011
    Abstract
    The key element of a spoken dialogue system is Spoken Language Understanding (SLU) part. HVS and EHVS are two most popular statistical methods employed to implement the SLU part which need lightly annotated data. Since annotation is a time consuming, we present a novel semi-supervised learning for EHVS to reduce the human labeling effort using two different statistical classifiers, SVM and KNN. Experiments are done on a Persian corpus, the University Information Kiosk corpus. The experimental results show improvements in performance of semi-supervised EHVS, trained by both labeled and unlabeled data, compared to EHVS trained by just initially labeled data. The performance of EHVS improves... 

    Efficient iterative Semi-Supervised Classification on manifold

    , Article Proceedings - IEEE International Conference on Data Mining, ICDM ; 2011 , Pages 228-235 ; 15504786 (ISSN); 9780769544090 (ISBN) Farajtabar, M ; Rabiee, H. R ; Shaban, A ; Soltani Farani, A ; National Science Foundation (NSF) - Where Discoveries Begin; University of Technology Sydney; Google; Alberta Ingenuity Centre for Machine Learning; IBM Research ; Sharif University of Technology
    Abstract
    Semi-Supervised Learning (SSL) has become a topic of recent research that effectively addresses the problem of limited labeled data. Many SSL methods have been developed based on the manifold assumption, among them, the Local and Global Consistency (LGC) is a popular method. The problem with most of these algorithms, and in particular with LGC, is the fact that their naive implementations do not scale well to the size of data. Time and memory limitations are the major problems faced in large-scale problems. In this paper, we provide theoretical bounds on gradient descent, and to overcome the aforementioned problems, a new approximate Newton's method is proposed. Moreover, convergence... 

    Isograph: Neighbourhood graph construction based on geodesic distance for semi-supervised learning

    , Article Proceedings - IEEE International Conference on Data Mining, ICDM, 11 December 2011 through 14 December 2011 ; December , 2011 , Pages 191-200 ; 15504786 (ISSN) ; 9780769544083 (ISBN) Ghazvininejad, M ; Mahdieh, M ; Rabiee, H. R ; Roshan, P. K ; Rohban, M. H ; Sharif University of Technology
    2011
    Abstract
    Semi-supervised learning based on manifolds has been the focus of extensive research in recent years. Convenient neighbourhood graph construction is a key component of a successful semi-supervised classification method. Previous graph construction methods fail when there are pairs of data points that have small Euclidean distance, but are far apart over the manifold. To overcome this problem, we start with an arbitrary neighbourhood graph and iteratively update the edge weights by using the estimates of the geodesic distances between points. Moreover, we provide theoretical bounds on the values of estimated geodesic distances. Experimental results on real-world data show significant... 

    Active learning from positive and unlabeled data

    , Article Proceedings - IEEE International Conference on Data Mining, ICDM, 11 December 2011 through 11 December 2011 ; December , 2011 , Pages 244-250 ; 15504786 (ISSN) ; 9780769544090 (ISBN) Ghasemi, A ; Rabiee, H. R ; Fadaee, M ; Manzuri, M. T ; Rohban, M. H ; Sharif University of Technology
    2011
    Abstract
    During recent years, active learning has evolved into a popular paradigm for utilizing user's feedback to improve accuracy of learning algorithms. Active learning works by selecting the most informative sample among unlabeled data and querying the label of that point from user. Many different methods such as uncertainty sampling and minimum risk sampling have been utilized to select the most informative sample in active learning. Although many active learning algorithms have been proposed so far, most of them work with binary or multi-class classification problems and therefore can not be applied to problems in which only samples from one class as well as a set of unlabeled data are... 

    HMM based semi-supervised learning for activity recognition

    , Article SAGAware'11 - Proceedings of the 2011 International Workshop on Situation Activity and Goal Awareness, 18 September 2011 through 18 September 2011, Beijing ; September , 2011 , Pages 95-99 ; 9781450309264 (ISBN) Ghazvininejad, M ; Rabiee, H. R ; Pourdamghani, N ; Khanipour, P ; Sharif University of Technology
    2011
    Abstract
    In this paper, we introduce a novel method for human activity recognition that benefits from the structure and sequential properties of the test data as well as the training data. In the training phase, we obtain a fraction of data labels at constant time intervals and use them in a semi-supervised graph-based method for recognizing the user's activities. We use label propagation on a k-nearest neighbor graph to calculate the probability of association of the unlabeled data to each class in this phase. Then we use these probabilities to train an HMM in a way that each of its hidden states corresponds to one class of activity. These probabilities are used to learn the transition probabilities... 

    Manifold coarse graining for online semi-supervised learning

    , Article Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 5 September 2011 through 9 September 2011 ; Volume 6911 LNAI, Issue PART 1 , September , 2011 , Pages 391-406 ; 03029743 (ISSN) ; 9783642237799 (ISBN) Farajtabar, M ; Shaban, A ; Rabiee, H. R ; Rohban, M. H ; Sharif University of Technology
    2011
    Abstract
    When the number of labeled data is not sufficient, Semi-Supervised Learning (SSL) methods utilize unlabeled data to enhance classification. Recently, many SSL methods have been developed based on the manifold assumption in a batch mode. However, when data arrive sequentially and in large quantities, both computation and storage limitations become a bottleneck. In this paper, we present a new semi-supervised coarse graining (CG) algorithm to reduce the required number of data points for preserving the manifold structure. First, an equivalent formulation of Label Propagation (LP) is derived. Then a novel spectral view of the Harmonic Solution (HS) is proposed. Finally an algorithm to reduce... 

    A hybrid supervised semi-supervised graph-based model to predict one-day ahead movement of global stock markets and commodity prices

    , Article Expert Systems with Applications ; Volume 105 , 2018 , Pages 159-173 ; 09574174 (ISSN) Negahdari Kia, A ; Haratizadeh, S ; Bagheri Shouraki, S ; Sharif University of Technology
    Abstract
    Market prediction has been an important machine learning research topic in recent decades. A neglected issue in prediction is having a model that can simultaneously pay attention to the interaction of global markets along historical data of the target markets being predicted. As a solution, we present a hybrid supervised semi-supervised model called HyS3 for direction of movement prediction. The graph-based semi-supervised part of HyS3 models the markets global interactions through a network designed with a novel continuous Kruskal-based graph construction algorithm called ConKruG. The supervised part of the model injects results extracted from each market's historical data to the network...