posted on 2016-10-17, 00:00authored byJames
L. McDonagh, David S. Palmer, Tanja van Mourik, John B. O. Mitchell
We
compare a range of computational methods for the prediction
of sublimation thermodynamics (enthalpy, entropy, and free energy
of sublimation). These include a model from theoretical chemistry
that utilizes crystal lattice energy minimization (with the DMACRYS
program) and quantitative structure property relationship (QSPR) models
generated by both machine learning (random forest and support vector
machines) and regression (partial least squares) methods. Using these
methods we investigate the predictability of the enthalpy, entropy
and free energy of sublimation, with consideration of whether such
a method may be able to improve solubility prediction schemes. Previous
work has suggested that the major source of error in solubility prediction
schemes involving a thermodynamic cycle via the solid state is in
the modeling of the free energy change away from the solid state.
Yet contrary to this conclusion other work has found that the inclusion
of terms such as the enthalpy of sublimation in QSPR methods does
not improve the predictions of solubility. We suggest the use of theoretical
chemistry terms, detailed explicitly in the Methods section, as descriptors
for the prediction of the enthalpy and free energy of sublimation.
A data set of 158 molecules with experimental sublimation thermodynamics
values and some CSD refcodes has been collected from the literature
and is provided with their original source references.