Deep Neural Networks for Estimation and Inference 论文

2021Econometrica引用 297
Gaussian Processes and Bayesian InferenceAdvanced Causal Inference TechniquesStatistical Methods and Inference

详细信息

发表期刊/会议
Econometrica
发表日期
2021-01-01
发表年份
2021

关键词

Gaussian Processes and Bayesian InferenceAdvanced Causal Inference TechniquesStatistical Methods and Inference

摘要

We study deep neural networks and their use in semiparametric inference. We establish novel nonasymptotic high probability bounds for deep feedforward neural nets. These deliver rates of convergence that are sufficiently fast (in some cases minimax optimal) to allow us to establish valid second‐step inference after first‐step estimation with deep learning, a result also new to the literature. Our nonasymptotic high probability bounds, and the subsequent semiparametric inference, treat the current standard architecture: fully connected feedforward neural networks (multilayer perceptrons), with the now‐common rectified linear unit activation function, unbounded weights, and a depth explicitly diverging with the sample size. We discuss other architectures as well, including fixed‐width, very deep networks. We establish the nonasymptotic bounds for these deep nets for a general class of nonparametric regression‐type loss functions, which includes as special cases least squares, logistic regression, and other generalized linear models. We then apply our theory to develop semiparametric inference, focusing on causal parameters for concreteness, and demonstrate the effectiveness of deep learning with an empirical application to direct mail marketing.