Learning Interactions via Hierarchical Group-Lasso Regularization 论文

2014Journal of Computational and Graphical Statistics引用 276
Statistical Methods and InferenceBayesian Methods and Mixture ModelsBayesian Modeling and Causal Inference

摘要

We introduce a method for learning pairwise interactions in a linear regression or logistic regression model in a manner that satisfies strong hierarchy: whenever an interaction is estimated to be nonzero, both its associated main effects are also included in the model. We motivate our approach by modeling pairwise interactions for categorical variables with arbitrary numbers of levels, and then show how we can accommodate continuous variables as well. Our approach allows us to dispense with explicitly applying constraints on the main effects and interactions for identifiability, which results in interpretable interaction models. We compare our method with existing approaches on both simulated and real data, including a genome-wide association study, all using our R package glinternet.