Distributed Inference for Latent Dirichlet Allocation 论文
2007引用 220
Topic ModelingNatural Language Processing TechniquesBayesian Methods and Mixture Models
摘要
1 Introduction Very large data sets, such as collections of images, text, and related data, are becoming increasinglycommon, with examples ranging from digitized collections of books by companies such as Google and Amazon, to large collections of images at Web sites such as Flickr, to the recent Netflix customerrecommendation data set. These data sets present major opportunities for machine learning, such as the ability to explore much richer and more expressive models, as well as providing new andinteresting domains for the application of learning algorithms.