Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

University of Massachusetts Amherst

Andrew McCallum

2009

Inference

Articles 1 - 1 of 1

Full-Text Articles in Physical Sciences and Mathematics

Efficient Methods For Topic Model Inference On Streaming Document Collections, Limin Yao, David Mimno, Andrew Mccallum Jan 2009

Efficient Methods For Topic Model Inference On Streaming Document Collections, Limin Yao, David Mimno, Andrew Mccallum

Andrew McCallum

Topic models provide a powerful tool for analyzing large text collections by representing high dimensional data in a low dimensional subspace. Fitting a topic model given a set of training documents requires approximate inference techniques that are computationally expensive. With today's large-scale, constantly expanding document collections, it is useful to be able to infer topic distributions for new documents without retraining the model. In this paper, we empirically evaluate the performance of several methods for topic inference in previously unseen documents, including methods based on Gibbs sampling, variational inference, and a new method inspired by text classification. The classification-based inference …