III:Small: Partitioning Big Data for High Performance Computation of Persistent Homology

Wilsey, Philip

doi:10.6084/m9.figshare.11778285.v1

lightningTalk.pdf (581.7 kB)

III:Small: Partitioning Big Data for High Performance Computation of Persistent Homology

presentation

posted on 2020-01-31, 13:53 authored by Philip WilseyPhilip Wilsey

Persistent Homology (PH) is computationally expensive and cannot be directly applied on more than a few thousand data points. This project aims to develop mechanisms to allow the computation of PH on large, high-dimensional data sets. The proposed method will significantly reduce the run-time and memory requirements for the computation of PH without significantly compromising accuracy of the results.

This project explores techniques to map a large point cloud P to another point cloud P' with fewer total points such that the topology space characterized by P and P' is nearly equivalent. The mapping from P to P' will potentially hide some of the smaller topological features during the PH computation on P'. Restoration of accurate PH results is achieved by (i) upscaling data for the identified large topological features, and (b) partition the data to run concurrent PH computations that locate the smaller topological features.

Funding

IIS-1909096

History

Usage metrics

Keywords

NSF-CSSI-2020-Talk Topological data analysis Persistent Homology Data Reduction and Partitioning Paralllel and Distributed Computing Computer Engineering

Licence

CC BY 4.0

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

III:Small: Partitioning Big Data for High Performance Computation of Persistent Homology

Funding

IIS-1909096

History

Usage metrics

Categories

Keywords

Licence

Exports