Do Real-World Datasets Contain Natural Experiments? An Empirical Study Using Causal Feature Selection 文章

ArXiv CS.CV2026-06-03NEWSen作者: Gautam Gare, John Galeotti, Michael Mozer, Deva Ramanan, Nan Rosemary Ke

摘要

arXiv:2606.03251v1 Announce Type: cross Abstract: In nature, events that affect some individuals or groups but not others constitute an implicit intervention and are known as natural experiments. For example, the COVID-19 pandemic was an intervention by the coronavirus on the sub-population infected with COVID. We ask, do natural experiments occur in existing real-world datasets? If yes, how should we treat them? To detect natural experiments in data, we use causal discovery to recover the underlying causal graph and perform feature selection based on causal links. If downstream performance improves by treating the data as interventional rather than observational, we argue that this suggests the dataset contains natural experiments. We first validate this hypothesis by simulating datasets with and without natural experiments using synthetic graphs. We then perform a systematic empirical evaluation on a large suite of real-world datasets.

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据