Model-Based Clustering for Expression Data via a Dirichlet Process Mixture Model 论文

2006Cambridge University Press eBooks引用 293

Gene expression and cancer classificationBayesian Methods and Mixture ModelsBioinformatics and Genomic Networks

生物科技 Bayesian Methods and Mixture Models Gene expression and cancer classification Bioinformatics and Genomic Networks

作者

摘要

This chapter describes a clustering procedure for microarray expression data based on a well-defined statistical model, specifically, a conjugate Dirichlet process mixture model. The clustering algorithm groups genes whose latent variables governing expression are equal, that is, genes belonging to the same mixture component. The model is fit with Markov chain Monte Carlo and the computational burden is eased by exploiting conjugacy. This chapter introduces a method to get a point estimate of the true clustering based on least-squares distances from the posterior probability that two genes are clustered. Unlike ad hoc clustering methods, the model provides measures of uncertainty about the clustering. Further, the model automatically estimates the number of clusters and quantifies uncertainty about this important parameter. The method is compared to other clustering methods in a simulation study. Finally, the method is demonstrated with actual microarray data.

作者查看全部 (1)

David B. Dahl

Model-Based Clustering for Expression Data via a Dirichlet Process Mixture Model 论文

摘要

作者查看全部 (1)

相关技术查看全部 (3)

相关事件

相关文章