The Experimental Uncertainty of Heterogeneous Public <i>K</i><sub>i</sub> Data 论文

2012Journal of Medicinal Chemistry引用 226

Computational Drug Discovery MethodsMetabolomics and Mass Spectrometry StudiesAnalytical Chemistry and Chromatography

Computational Drug Discovery Methods Analytical Chemistry and Chromatography Metabolomics and Mass Spectrometry Studies

作者

摘要

The maximum achievable accuracy of in silico models depends on the quality of the experimental data. Consequently, experimental uncertainty defines a natural upper limit to the predictive performance possible. Models that yield errors smaller than the experimental uncertainty are necessarily overtrained. A reliable estimate of the experimental uncertainty is therefore of high importance to all originators and users of in silico models. The data deposited in ChEMBL was analyzed for reproducibility, i.e., the experimental uncertainty of independent measurements. Careful filtering of the data was required because ChEMBL contains unit-transcription errors, undifferentiated stereoisomers, and repeated citations of single measurements (90% of all pairs). The experimental uncertainty is estimated to yield a mean error of 0.44 pK(i) units, a standard deviation of 0.54 pK(i) units, and a median error of 0.34 pK(i) units. The maximum possible squared Pearson correlation coefficient (R(2)) on large data sets is estimated to be 0.81.

作者查看全部 (4)

Anna Vulpetti

Peter Gedeck

Tuomo Kalliokoski

Christian Krämer

The Experimental Uncertainty of Heterogeneous Public <i>K</i><sub>i</sub> Data 论文

摘要

作者查看全部 (4)

相关技术查看全部 (1)

相关事件

相关文章