Loading...

Probabilistic non-linear distance metric learning for constrained clustering

Babagholami Mohamadabadi, B ; Sharif University of Technology | 2013

679 Viewed
  1. Type of Document: Article
  2. DOI: 10.1145/2501006.2501011
  3. Publisher: 2013
  4. Abstract:
  5. Distance metric learning is a powerful approach to deal with the clustering problem with side information. For semi-supervised clustering, usually a set of pairwise similarity and dissimilarity constraints is provided as supervisory information. Although some of the existing methods can use both equivalence (similarity) and inequivalence (dissimilarity) constraints, they are usually limited to learning a global Mahalanobis metric (i.e., finding a linear transformation). Moreover, they find metrics only according to the data points appearing in constraints, and cannot utilize information of other data points. In this paper, we propose a probabilistic metric learning algorithm which uses information of unconstrained data points (data points which do not appear in neither positive nor negative constraints) along with both positive and negative constraints. We also kernelize our metric learning method based on the kernel trick which provides a non-linear version of the learned metric. Experimental results on synthetic and real-world data sets demonstrate the effectiveness of the proposed metric learning algorithm
  6. Keywords:
  7. Deterministic annealing EM ; Kernel trick ; Logistic regression ; Constrained clustering ; Deterministic annealing ; Distance metric learning ; Logistic regressions ; Metric learning ; Negative constraints ; Semi-supervised clustering ; Clustering algorithms ; Learning algorithms ; Virtual reality ; Learning systems
  8. Source: MultiClust 2013 - 4th Workshop on Multiple Clusterings, Multi-View Data, and Multi-Source Knowledge-Driven Clustering, in Conj. with the 19th ACM SIGKDD Int. Conf. on KDD 2013 ; 2013 ; 9781450323345 (ISBN)
  9. URL: http://dl.acm.org/citation.cfm?doid=2501006.2501011