Loading...

Clustering and outlier detection using isoperimetric number of trees

Daneshgar, A ; Sharif University of Technology | 2013

767 Viewed
  1. Type of Document: Article
  2. DOI: 10.1016/j.patcog.2013.05.015
  3. Publisher: 2013
  4. Abstract:
  5. We propose a graph-based data clustering algorithm which is based on exact clustering of a minimum spanning tree in terms of a minimum isoperimetry criteria. We show that our basic clustering algorithm runs in O(nlogn) and with post-processing in almost O(nlogn) (average case) and O(n2) (worst case) time where n is the size of the data-set. It is also shown that our generalized graph model, which also allows the use of potentials at vertices, can be used to extract an extra piece of information related to anomalous data patterns and outliers. In this regard, we propose an algorithm that extracts outliers in parallel to data clustering. We also provide a comparative performance analysis of our algorithms with other related ones and we show that they behave quite effectively on hard synthetic data-sets as well as real-world benchmarks
  6. Keywords:
  7. Isoperimetric constant ; Normalized cut ; Cheeger constant ; Data clustering ; Graph Partitioning ; Isoperimetric constants ; Normalized cuts ; Outlier Detection ; Perceptual grouping ; Benchmarking ; Cluster analysis ; Data mining ; Graph theory ; Statistics ; Clustering algorithms
  8. Source: Pattern Recognition ; Volume 46, Issue 12 , December , 2013 , Pages 3371-3382 ; 00313203 (ISSN)
  9. URL: http://www.sciencedirect.com/science/article/pii/S0031320313002215