Regression Models for Count Data in R 论文

2008DOAJ (DOAJ: Directory of Open Access Journals)引用 318
Statistical Methods and Bayesian InferenceBayesian Modeling and Causal Inference

详细信息

发表期刊/会议
DOAJ (DOAJ: Directory of Open Access Journals)
发表日期
2008-06-01
发表年份
2008

关键词

Statistical Methods and Bayesian InferenceBayesian Modeling and Causal Inference

摘要

The classical Poisson, geometric and negative binomial regression models for count data belong to the family of generalized linear models and are available at the core of the statistics toolbox in the R system for statistical computing. After reviewing the conceptual and computational features of these methods, a new implementation of hurdle and zero-inflated regression models in the functions hurdle() and zeroinfl() from the package pscl is introduced. It re-uses design and functionality of the basic R functions just as the underlying conceptual tools extend the classical models. Both hurdle and zero-inflated model, are able to incorporate over-dispersion and excess zeros-two problems that typically occur in count data sets in economics and the social sciences—better than their classical counterparts. Using cross-section data on the demand for medical care, it is illustrated how the classical as well as the zero-augmented models can be fitted, inspected and tested in practice.