This project contains the dataset used in the manuscript "Solving an inverse problem with generative models". The code and documentation for using this data set are available in the manuscript. See the preprint at https://doi.org/10.26434/chemrxiv-2025-nl9gl.