figshare
Browse
itxm_a_1422579_sm0076.pdf (63.11 kB)

Mutagenicity, anticancer activity and blood brain barrier: similarity and dissimilarity of molecular alerts

Download (63.11 kB)
journal contribution
posted on 2018-01-16, 10:09 authored by Andrey A. Toropov, Alla P. Toropova, Emilio Benfenati, Mario Salmona

The aim of the present work is an attempt to define computable measure of similarity between different endpoints. The similarity of structural alerts of different biochemical endpoints can be used to solve tasks of medicinal chemistry. Optimal descriptors are a tool to build up models for different endpoints. The optimal descriptor is calculated with simplified molecular input-line entry system (SMILES). A group of elements (single symbol or pair of symbols) can represent any SMILES. Each element of SMILES can be represented by so-called correlation weight i.e. coefficient that should be used to calculate descriptor. Numerical data on the correlation weights are calculated by the Monte Carlo method, i.e. by optimization procedure, which gives maximal correlation coefficient between the optimal descriptor and endpoint for the training set. Statistically stable correlation weights observed in several runs of the optimization can be examined as structural alerts, which are promoters of the increase or the decrease of a biochemical activity of a substance. Having data on several runs of the optimization correlation weights, one can extract list of promoters of increase and list of promoters of decrease for an endpoint. The study of similarity and dissimilarity of the above lists has been carried out for the following pairs of endpoints: (i) mutagenicity and anticancer activity; (ii) mutagenicity and blood brain barrier; and (iii) blood brain barrier and anticancer activity. The computational experiment confirms that similarity and dissimilarity for pairs of endpoints can be measured.

Funding

Authors thank the project LIFE-COMBASE contract [LIFE15 ENV/ES/000416] for financial support.

History