Trust, but Verify II: A Practical Guide to Chemogenomics Data Curation 论文

2016Journal of Chemical Information and Modeling引用 289
Metabolomics and Mass Spectrometry StudiesComputational Drug Discovery MethodsBiomedical Text Mining and Ontologies

摘要

There is a growing public concern about the lack of reproducibility of experimental data published in peer-reviewed scientific literature. Herein, we review the most recent alerts regarding experimental data quality and discuss initiatives taken thus far to address this problem, especially in the area of chemical genomics. Going beyond just acknowledging the issue, we propose a chemical and biological data curation workflow that relies on existing cheminformatics approaches to flag, and when appropriate, correct possibly erroneous entries in large chemogenomics data sets. We posit that the adherence to the best practices for data curation is important for both experimental scientists who generate primary data and deposit them in chemical genomics databases and computational researchers who rely on these data for model development.