Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 11 of 11

Full-Text Articles in Physical Sciences and Mathematics

An Integrated Screening And Optimization Strategy, Nathaniel Jackson Rohbock Jul 2012

An Integrated Screening And Optimization Strategy, Nathaniel Jackson Rohbock

Theses and Dissertations

Within statistical methods, design of experiments (DOE) is well suited to make good inference from a minimal amount of data. Two types of designs within DOE are screening designs and optimization designs. Traditionally, these approaches have been necessarily separated by a gap between the objectives of each design and the methods available. Despite being so separated, in practice these designs are frequently connected by sequential experimentation. In fact, from the genesis of a project, the experimentor often knows that both designs will be necessary to accomplish his objectives. Due to advances in the understanding of experimental designs with complex aliasing …


An Applied Investigation Of Gaussian Markov Random Fields, Jessica Lyn Olsen Jun 2012

An Applied Investigation Of Gaussian Markov Random Fields, Jessica Lyn Olsen

Theses and Dissertations

Recently, Bayesian methods have become the essence of modern statistics, specifically, the ability to incorporate hierarchical models. In particular, correlated data, such as the data found in spatial and temporal applications, have benefited greatly from the development and application of Bayesian statistics. One particular application of Bayesian modeling is Gaussian Markov Random Fields. These methods have proven to be very useful in providing a framework for correlated data. I will demonstrate the power of GMRFs by applying this method to two sets of data; a set of temporal data involving car accidents in the UK and a set of spatial …


Xprime-Em: Eliciting Expert Prior Information For Motif Exploration Using The Expectation-Maximization Algorithm, Wei Zhou Jun 2012

Xprime-Em: Eliciting Expert Prior Information For Motif Exploration Using The Expectation-Maximization Algorithm, Wei Zhou

Theses and Dissertations

Understanding the possible mechanisms of gene transcription regulation is a primary challenge for current molecular biologists. Identifying transcription factor binding sites (TFBSs), also called DNA motifs, is an important step in understanding these mechanisms. Furthermore, many human diseases are attributed to mutations in TFBSs, which makes identifying those DNA motifs significant for disease treatment. Uncertainty and variations in specific nucleotides of TFBSs present difficulties for DNA motif searching. In this project, we present an algorithm, XPRIME-EM (Eliciting EXpert PRior Information for Motif Exploration using the Expectation-Maximization Algorithm), which can discover known and de novo (unknown) DNA motifs simultaneously from a …


Estimation Of The Effects Of Parental Measures On Child Aggression Using Structural Equation Modeling, Jordan Daniel Pyper Jun 2012

Estimation Of The Effects Of Parental Measures On Child Aggression Using Structural Equation Modeling, Jordan Daniel Pyper

Theses and Dissertations

A child's parents are the primary source of knowledge and learned behaviors for developing children, and the benefits or repercussions of certain parental practices can be long lasting. Although parenting practices affect behavioral outcomes for children, families tend to be diverse in their circumstances and needs. Research attempting to ascertain cause and effect relationships between parental influences and child behavior can be difficult due to the complex nature of family dynamics and the intricacies of real life. Structural equation modeling (SEM) is an appropriate method for this research as it is able to account for the complicated nature of child-parent …


Support Vector Machines For Classification And Imputation, Spencer David Rogers May 2012

Support Vector Machines For Classification And Imputation, Spencer David Rogers

Theses and Dissertations

Support vector machines (SVMs) are a powerful tool for classification problems. SVMs have only been developed in the last 20 years with the availability of cheap and abundant computing power. SVMs are a non-statistical approach and make no assumptions about the distribution of the data. Here support vector machines are applied to a classic data set from the machine learning literature and the out-of-sample misclassification rates are compared to other classification methods. Finally, an algorithm for using support vector machines to address the difficulty in imputing missing categorical data is proposed and its performance is demonstrated under three different scenarios …


Species Identification And Strain Attribution With Unassembled Sequencing Data, Owen Eric Francis Apr 2012

Species Identification And Strain Attribution With Unassembled Sequencing Data, Owen Eric Francis

Theses and Dissertations

Emerging sequencing approaches have revolutionized the way we can collect DNA sequence data for applications in bioforensics and biosurveillance. In this research, we present an approach to construct a database of known biological agents and use this database to develop a statistical framework to analyze raw reads from next-generation sequence data for species identification and strain attribution. Our method capitalizes on a Bayesian statistical framework that accommodates information on sequence quality, mapping quality and provides posterior probabilities of matches to a known database of target genomes. Importantly, our approach also incorporates the possibility that multiple species can be present in …


Hitters Vs. Pitchers: A Comparison Of Fantasy Baseball Player Performances Using Hierarchical Bayesian Models, Scott D. Huddleston Apr 2012

Hitters Vs. Pitchers: A Comparison Of Fantasy Baseball Player Performances Using Hierarchical Bayesian Models, Scott D. Huddleston

Theses and Dissertations

In recent years, fantasy baseball has seen an explosion in popularity. Major League Baseball, with its long, storied history and the enormous quantity of data available, naturally lends itself to the modern-day recreational activity known as fantasy baseball. Fantasy baseball is a game in which participants manage an imaginary roster of real players and compete against one another using those players' real-life statistics to score points. Early forms of fantasy baseball began in the early 1960s, but beginning in the 1990s, the sport was revolutionized due to the advent of powerful computers and the Internet. The data used in this …


Using An Experimental Mixture Design To Identify Experimental Regions With High Probability Of Creating A Homogeneous Monolithic Column Capable Of Flow, Charles C. Willden Apr 2012

Using An Experimental Mixture Design To Identify Experimental Regions With High Probability Of Creating A Homogeneous Monolithic Column Capable Of Flow, Charles C. Willden

Theses and Dissertations

Graduate students in the Brigham Young University Chemistry Department are working to develop a filtering device that can be used to separate substances into their constituent parts. The device consists of a monomer and water mixture that is polymerized into a monolith inside of a capillary. The ideal monolith is completely solid with interconnected pores that are small enough to cause the constituent parts to pass through the capillary at different rates, effectively separating the substance. Although the end objective is to minimize pore sizes, it is necessary to first identify an experimental region where any combination of input variables …


Bayesian Pollution Source Apportionment Incorporating Multiple Simultaneous Measurements, Jonathan Casey Christensen Mar 2012

Bayesian Pollution Source Apportionment Incorporating Multiple Simultaneous Measurements, Jonathan Casey Christensen

Theses and Dissertations

We describe a method to estimate pollution profiles and contribution levels for distinct prominent pollution sources in a region based on daily pollutant concentration measurements from multiple measurement stations over a period of time. In an extension of existing work, we will estimate common source profiles but distinct contribution levels based on measurements from each station. In addition, we will explore the possibility of extending existing work to allow adjustments for synoptic regimes—large scale weather patterns which may effect the amount of pollution measured from individual sources as well as for particular pollutants. For both extensions we propose Bayesian methods …


Predicting Maximal Oxygen Consumption (Vo2max) Levels In Adolescents, Brent A. Shepherd Mar 2012

Predicting Maximal Oxygen Consumption (Vo2max) Levels In Adolescents, Brent A. Shepherd

Theses and Dissertations

Maximal oxygen consumption (VO2max) is considered by many to be the best overall measure of an individual's cardiovascular health. Collecting the measurement, however, requires subjecting an individual to prolonged periods of intense exercise until their maximal level, the point at which their body uses no additional oxygen from the air despite increased exercise intensity, is reached. Collecting VO2max data also requires expensive equipment and great subject discomfort to get accurate results. Because of this inherent difficulty, it is often avoided despite its usefulness. In this research, we propose a set of Bayesian hierarchical models to predict VO2max levels in adolescents, …


The Effect Of Smoking On Tuberculosis Incidence In Burdened Countries, Natalie Noel Ellison Mar 2012

The Effect Of Smoking On Tuberculosis Incidence In Burdened Countries, Natalie Noel Ellison

Theses and Dissertations

It is estimated that one third of the world's population is infected with tuberculosis. Though once thought a "dead" disease, tuberculosis is very much alive. The rise of drug resistant strains of tuberculosis, and TB-HIV coinfection have made tuberculosis an even greater worldwide threat. While HIV, poverty, and public health infrastructure are historically assumed to affect the burden of tuberculosis, recent research has been done to implicate smoking in this list. This analysis involves combining data from multiple sources in order determine if smoking is a statistically significant factor in predicting the number of incident tuberculosis cases in a country. …