TopoMap: A 0-dimensional Homology Preserving Projection of High-Dimensional Data

  • 2020-09-03 08:30:02
  • Harish Doraiswamy, Julien Tierny, Paulo J. S. Silva, Luis Gustavo Nonato, Claudio Silva
  • 17

Abstract

Multidimensional Projection is a fundamental tool for high-dimensional dataanalytics and visualization. With very few exceptions, projection techniquesare designed to map data from a high-dimensional space to a visual space so asto preserve some dissimilarity (similarity) measure, such as the Euclideandistance for example. In fact, although adopting distinct mathematicalformulations designed to favor different aspects of the data, mostmultidimensional projection methods strive to preserve dissimilarity measuresthat encapsulate geometric properties such as distances or the proximityrelation between data objects. However, geometric relations are not the onlyinteresting property to be preserved in a projection. For instance, theanalysis of particular structures such as clusters and outliers could be morereliably performed if the mapping process gives some guarantee as totopological invariants such as connected components and loops. This paperintroduces TopoMap, a novel projection technique which provides topologicalguarantees during the mapping process. In particular, the proposed methodperforms the mapping from a high-dimensional space to a visual space, whilepreserving the 0-dimensional persistence diagram of the Rips filtration of thehigh-dimensional data, ensuring that the filtrations generate the sameconnected components when applied to the original as well as projected data.The presented case studies show that the topological guarantee provided byTopoMap not only brings confidence to the visual analytic process but also canbe used to assist in the assessment of other projection methods.

 

Quick Read (beta)

loading the full paper ...