ConiVAT: Cluster Tendency Assessment and Clustering with Partial Background Knowledge

  • 2020-09-28 17:21:09
  • Punit Rathore, James C. Bezdek, Paolo Santi, Carlo Ratti
  • 0

Abstract

The VAT method is a visual technique for determining the potential clusterstructure and the possible number of clusters in numerical data. Its improvedversion, iVAT, uses a path-based distance transform to improve theeffectiveness of VAT for "tough" cases. Both VAT and iVAT have also been usedin conjunction with a single-linkage(SL) hierarchical clustering algorithm.However, they are sensitive to noise and bridge points between clusters in thedataset, and consequently, the corresponding VAT/iVAT images are oftenin-conclusive for such cases. In this paper, we propose a constraint-basedversion of iVAT, which we call ConiVAT, that makes use of background knowledgein the form of constraints, to improve VAT/iVAT for challenging and complexdatasets. ConiVAT uses the input constraints to learn the underlying similaritymetric and builds a minimum transitive dissimilarity matrix, before applyingVAT to it. We demonstrate ConiVAT approach to visual assessment and singlelinkage clustering on nine datasets to show that, it improves the quality ofiVAT images for complex datasets, and it also overcomes the limitation of SLclustering with VAT/iVAT due to "noisy" bridges between clusters. Extensiveexperiment results on nine datasets suggest that ConiVAT outperforms the otherthree semi-supervised clustering algorithms in terms of improved clusteringaccuracy.

 

Quick Read (beta)

loading the full paper ...