Open Access. Powered by Scholars. Published by Universities.®

Digital Commons Network

Open Access. Powered by Scholars. Published by Universities.®

Mathematics

PDF

Theses and Dissertations

2013

Latent Dirichlet Allocation

Articles 1 - 2 of 2

Full-Text Articles in Entire DC Network

A Topics Analysis Model For Health Insurance Claims, Jared Anthony Webb Oct 2013

A Topics Analysis Model For Health Insurance Claims, Jared Anthony Webb

Theses and Dissertations

Mathematical probability has a rich theory and powerful applications. Of particular note is the Markov chain Monte Carlo (MCMC) method for sampling from high dimensional distributions that may not admit a naive analysis. We develop the theory of the MCMC method from first principles and prove its relevance. We also define a Bayesian hierarchical model for generating data. By understanding how data are generated we may infer hidden structure about these models. We use a specific MCMC method called a Gibbs' sampler to discover topic distributions in a hierarchical Bayesian model called Topics Over Time. We propose an innovative use …


A Classification Tool For Predictive Data Analysis In Healthcare, Mason Lemoyne Victors Mar 2013

A Classification Tool For Predictive Data Analysis In Healthcare, Mason Lemoyne Victors

Theses and Dissertations

Hidden Markov Models (HMMs) have seen widespread use in a variety of applications ranging from speech recognition to gene prediction. While developed over forty years ago, they remain a standard tool for sequential data analysis. More recently, Latent Dirichlet Allocation (LDA) was developed and soon gained widespread popularity as a powerful topic analysis tool for text corpora. We thoroughly develop LDA and a generalization of HMMs and demonstrate the conjunctive use of both methods in predictive data analysis for health care problems. While these two tools (LDA and HMM) have been used in conjunction previously, we use LDA in a …