Preserving Statistical Validity in Adaptive Data Analysis 论文

2015引用 262

Machine Learning and Data ClassificationExplainable Artificial Intelligence (XAI)Statistical Methods and Inference

人工智能 Machine Learning and Data Classification Statistical Methods and Inference Explainable Artificial Intelligence (XAI)

关系图谱

作者

摘要

A great deal of effort has been devoted to reducing the risk of spurious scientific discoveries, from the use of sophisticated validation techniques, to deep statistical methods for controlling the false discovery rate in multiple hypothesis testing. However, there is a fundamental disconnect between the theoretical results and the practice of data analysis: the theory of statistical inference assumes a fixed collection of hypotheses to be tested, or learning algorithms to be applied, selected non-adaptively before the data are gathered, whereas in practice data is shared and reused with hypotheses and new analyses being generated on the basis of data exploration and the outcomes of previous analyses.

作者查看全部 (6)

Aaron Roth

Omer Reingold

Toniann Pitassi

Moritz Hardt

Preserving Statistical Validity in Adaptive Data Analysis 论文

详细信息

摘要

作者查看全部 (6)

相关技术查看全部 (2)

相关事件

相关文章