Collection of reaction SMILES (reactants, reagents, solvents, products) from USPTO as published in 2023. 137K lines total. Data scraping by custom design. Data extraction by OSCAR (semantic) and ChatGPT (LLM), molecule identification by OPSIN and custom synonym list. All SMILES are RDKit-safe. Please note that the data have been collected in an automated process, the dataset is certainly not without errors.