Loading...

Kernel-based metric learning for semi-supervised clustering

Soleymani Baghshah, M ; Sharif University of Technology | 2010

691 Viewed
  1. Type of Document: Article
  2. DOI: 10.1016/j.neucom.2009.12.009
  3. Publisher: 2010
  4. Abstract:
  5. Distance metric plays an important role in many machine learning algorithms. Recently, there has been growing interest in distance metric learning for semi-supervised setting. In the last few years, many methods have been proposed for metric learning when pairwise similarity (must-link) and/or dissimilarity (cannot-link) constraints are available along with unlabeled data. Most of these methods learn a global Mahalanobis metric (or equivalently, a linear transformation). Although some recently introduced methods have devised nonlinear extensions of linear metric learning methods, they usually allow only limited forms of distance metrics and also can use only similarity constraints. In this paper, we propose a nonlinear metric learning method that learns a completely flexible distance metric via learning a nonparametric kernel matrix. The proposed method uses both similarity and dissimilarity constraints and also the topological structure of the data to learn an appropriate distance metric. Our method is formulated as a convex optimization problem for learning a kernel matrix. This convex problem allows us to give a local-optimum-free metric learning method. Experimental results on synthetic and real-world data sets show that the proposed method outperforms the recently introduced metric learning methods for semi-supervised clustering
  6. Keywords:
  7. Kernel learning ; Metric learning ; Non-parametric kernel matrix ; Optimization ; Pairwise constraint ; Semi-supervised clustering ; Kernel learning ; Kernel matrices ; Non-parametric ; Pairwise constraints ; Convex optimization ; Linear transformations ; Matrix algebra ; Structural optimization ; Learning algorithms ; Analytic method ; Article ; Cluster analysis ; Intermethod comparison ; Kernel method ; Mathematical analysis ; Priority journal ; Process optimization ; Semi supervised clustering
  8. Source: Neurocomputing ; Volume 73, Issue 7-9 , 2010 , Pages 1352-1361 ; 09252312 (ISSN)
  9. URL: http://www.sciencedirect.com/science/article/pii/S0925231209004275