Benchmark Data Set for in Silico Prediction of Ames Mutagenicity 论文

2009Journal of Chemical Information and Modeling引用 360

Computational Drug Discovery MethodsAnalytical Chemistry and ChromatographyMachine Learning in Bioinformatics

人工智能 Computational Drug Discovery Methods Analytical Chemistry and Chromatography Machine Learning in Bioinformatics

作者

摘要

Up to now, publicly available data sets to build and evaluate Ames mutagenicity prediction tools have been very limited in terms of size and chemical space covered. In this report we describe a new unique public Ames mutagenicity data set comprising about 6500 nonconfidential compounds (available as SMILES strings and SDF) together with their biological activity. Three commercial tools (DEREK, MultiCASE, and an off-the-shelf Bayesian machine learner in Pipeline Pilot) are compared with four noncommercial machine learning implementations (Support Vector Machines, Random Forests, k-Nearest Neighbors, and Gaussian Processes) on the new benchmark data set.

作者查看全部 (7)

Klaus‐Robert Müller

Thomas Steger‐Hartmann

Antonius ter Laak

Andreas Sutter

Benchmark Data Set for in Silico Prediction of Ames Mutagenicity 论文

摘要

作者查看全部 (7)

相关技术查看全部 (2)

相关事件

相关文章