figshare
Browse

FlowER - Mechanistic datasets and model checkpoint

Version 2 2025-02-16, 04:32
Version 1 2025-02-16, 03:42
dataset
posted on 2025-02-16, 04:32 authored by Joonyoung Francis JoungJoonyoung Francis Joung, Mun Hong Fong, Nicholas Casetti, Jordan P. Liles, Ne S. Dassanayake, Connor W. Coley

This repository contains the train, validation, and test datasets used for training the FlowER model, along with the model checkpoint files. These checkpoint files allow full reproducibility of all data used in the FlowER paper.

The train, validation, and test datasets are recorded at the elementary step level. Each line in the text files follows the format:


reactants>>products|Number



  • Reactants and products are represented with atom-mapped SMILES.
  • The Number represents the overall reaction. That is, elementary steps with the same Number originate from the same overall reaction.

The FlowER model can be found at: https://github.com/FongMunHong/FlowER

Funding

Machine Learning for Pharmaceutical Discovery and Synthesis consortium

National Science Foundation under Grant No. CHE-2144153

History

Usage metrics

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC