figshare
Browse

Structural Classification of PFAS using Molecular Fingerprints and Graph Networks

Version 6 2025-06-09, 23:00
Version 5 2025-06-09, 22:46
Version 4 2025-05-29, 19:48
Version 3 2025-05-29, 18:35
Version 2 2025-05-27, 19:44
Version 1 2025-05-22, 14:08
dataset
posted on 2025-06-09, 23:00 authored by Arpan MukherjeeArpan Mukherjee

This project provides a network-based structural classification resource for 13,028 per- and polyfluoroalkyl substances (PFAS) from the U.S. EPA’s 2024 PFAS8a7v3 list. Using eight molecular fingerprinting methods and K-Nearest Neighbor Graphs (K-NNGs), we assign Proximity Classes to 288 previously unclassified PFAS compounds. The dataset includes Proximity Class outputs, UMAP coordinates, edge lists, and interactive 3D network visualizations across multiple neighborhood sizes. All files are designed for reuse in regulatory prioritization, chemical substitution, and machine learning applications.

History

Usage metrics

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC