Causal inference and the data-fusion problem 论文

2016Proceedings of the National Academy of Sciences引用 616

Bayesian Modeling and Causal InferenceAdvanced Causal Inference TechniquesStatistical Methods and Inference

Statistical Methods and Inference Bayesian Modeling and Causal Inference Advanced Causal Inference Techniques

作者

摘要

We review concepts, principles, and tools that unify current approaches to causal analysis and attend to new challenges presented by big data. In particular, we address the problem of data fusion-piecing together multiple datasets collected under heterogeneous conditions (i.e., different populations, regimes, and sampling methods) to obtain valid answers to queries of interest. The availability of multiple heterogeneous datasets presents new opportunities to big data analysts, because the knowledge that can be acquired from combined data would not be possible from any individual source alone. However, the biases that emerge in heterogeneous environments require new analytical tools. Some of these biases, including confounding, sampling selection, and cross-population biases, have been addressed in isolation, largely in restricted parametric models. We here present a general, nonparametric framework for handling these biases and, ultimately, a theoretical solution to the problem of data fusion in causal inference tasks.

作者查看全部 (2)

Judea Pearl

Elias Bareinboim

Causal inference and the data-fusion problem 论文

摘要

作者查看全部 (2)

相关技术查看全部 (3)

相关事件

相关文章