Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Statistics and Probability

Brigham Young University

2012

Bayesian mixture model

Articles 1 - 1 of 1

Full-Text Articles in Physical Sciences and Mathematics

Species Identification And Strain Attribution With Unassembled Sequencing Data, Owen Eric Francis Apr 2012

Species Identification And Strain Attribution With Unassembled Sequencing Data, Owen Eric Francis

Theses and Dissertations

Emerging sequencing approaches have revolutionized the way we can collect DNA sequence data for applications in bioforensics and biosurveillance. In this research, we present an approach to construct a database of known biological agents and use this database to develop a statistical framework to analyze raw reads from next-generation sequence data for species identification and strain attribution. Our method capitalizes on a Bayesian statistical framework that accommodates information on sequence quality, mapping quality and provides posterior probabilities of matches to a known database of target genomes. Importantly, our approach also incorporates the possibility that multiple species can be present in …