Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Applied Statistics

Theses/Dissertations

2016

Institution
Keyword
Publication

Articles 1 - 30 of 42

Full-Text Articles in Physical Sciences and Mathematics

What Affects Parents’ Choice Of Milk? An Application Of Bayesian Model Averaging, Yingzhe Cheng Dec 2016

What Affects Parents’ Choice Of Milk? An Application Of Bayesian Model Averaging, Yingzhe Cheng

Mathematics & Statistics ETDs

This study identifies the factors that influence parents’ choice of milk for their children, using data from a unique survey administered in 2013 in Hunan province, China. In this survey, we identified two brands of milk, which differ in their prices and safety claims by the producer. Data were collected on parents’ choice of milk between the two brands, demographics, attitude towards food safety and behaviors related to food. Stepwise model selection and Bayesian model averaging (BMA) are used to search for influential factors. The two approaches consistently select the same factors suggested by an economic theoretical model, including price …


Economic Opportunity And Young Adult Mortality: Variations By Race/Ethnicity And Gender, Jocelyn Mineo Dec 2016

Economic Opportunity And Young Adult Mortality: Variations By Race/Ethnicity And Gender, Jocelyn Mineo

Honors Projects

This study examines the relationship between economic opportunity and adolescent and young adult mortality in the United States. In addition, this study explores other variables, such as social support and rurality, and their link to young adult mortality rates. First, we examined the link between economic opportunity and all-cause mortality rates for youth ages 15 to 34 in the United States. Given the increasing racial and ethnic diversity of America’s youth, we pay particular attention to race/ethnic differences. We also examine the differences in mortality by gender.


Development And Validation Of The Statistics Assessment Of Graduate Students, Dammika Lakmal Walpitage Dec 2016

Development And Validation Of The Statistics Assessment Of Graduate Students, Dammika Lakmal Walpitage

Doctoral Dissertations

This study developed the Statistics Assessment of Graduate Students (SAGS) instrument, and established its preliminary item characteristics, reliability, and validity evidence. Even though there are limited number of assessments available for measuring different aspects of statistical cognition, these previously available assessments have numerous limitations. The SAGS instrument was developed using Rasch modeling approach to create a new measure of statistical research methodology knowledge of graduate students in education and other behavioral and social sciences. Thirty-five multiple-choice questions were written with stems representing applied research situations and response options distinguishing between appropriate use of various statistical tests or procedures. A focus …


A Traders Guide To The Predictive Universe- A Model For Predicting Oil Price Targets And Trading On Them, Jimmie Harold Lenz Dec 2016

A Traders Guide To The Predictive Universe- A Model For Predicting Oil Price Targets And Trading On Them, Jimmie Harold Lenz

Doctor of Business Administration Dissertations

At heart every trader loves volatility; this is where return on investment comes from, this is what drives the proverbial “positive alpha.” As a trader, understanding the probabilities related to the volatility of prices is key, however if you could also predict future prices with reliability the world would be your oyster. To this end, I have achieved three goals with this dissertation, to develop a model to predict future short term prices (direction and magnitude), to effectively test this by generating consistent profits utilizing a trading model developed for this purpose, and to write a paper that anyone with …


Monte Carlo Methods In Bayesian Inference: Theory, Methods And Applications, Huarui Zhang Dec 2016

Monte Carlo Methods In Bayesian Inference: Theory, Methods And Applications, Huarui Zhang

Graduate Theses and Dissertations

Monte Carlo methods are becoming more and more popular in statistics due to the fast development of efficient computing technologies. One of the major beneficiaries of this advent is the field of Bayesian inference. The aim of this thesis is two-fold: (i) to explain the theory justifying the validity of the simulation-based schemes in a Bayesian setting (why they should work) and (ii) to apply them in several different types of data analysis that a statistician has to routinely encounter. In Chapter 1, I introduce key concepts in Bayesian statistics. Then we discuss Monte Carlo Simulation methods in detail. Our …


A Multi-Indexed Logistic Model For Time Series, Xiang Liu Dec 2016

A Multi-Indexed Logistic Model For Time Series, Xiang Liu

Electronic Theses and Dissertations

In this thesis, we explore a multi-indexed logistic regression (MILR) model, with particular emphasis given to its application to time series. MILR includes simple logistic regression (SLR) as a special case, and the hope is that it will in some instances also produce significantly better results. To motivate the development of MILR, we consider its application to the analysis of both simulated sine wave data and stock data. We looked at well-studied SLR and its application in the analysis of time series data. Using a more sophisticated representation of sequential data, we then detail the implementation of MILR. We compare …


Multiscale Wind Modelling For Sustainability And Resilience, Djordje Romanic Oct 2016

Multiscale Wind Modelling For Sustainability And Resilience, Djordje Romanic

Electronic Thesis and Dissertation Repository

The research presented herein is a mix of meteorological and wind engineering disciplines. In many cases, there is a gap between these two fields and this thesis is an attempt to bridge that gap through multiscale wind modelling approaches. Data and methods used in this study cover a multitude of spatial and temporal scales. Applications are in the fields of sustainability and resilience. This relationship between multiscale wind modelling and sustainability and resilience is investigated examining several case studies of three different developments: urban, rural and coastal.

An urban wind modelling methodology is proposed and applied for a specific development …


Advances In Portmanteau Diagnostic Tests, Jinkun Xiao Sep 2016

Advances In Portmanteau Diagnostic Tests, Jinkun Xiao

Electronic Thesis and Dissertation Repository

Portmanteau test serves an important role in model diagnostics for Box-Jenkins Modelling procedures. A large number of Portmanteau test based on the autocorrelation function are proposed for a general purpose goodness-of-fit test. Since the asymptotic distributions for the statistics has a complicated form which makes it hard to obtain the p-value directly, the gamma approximation is introduced to obtain the p-value. But the approximation will inevitably introduce approximation errors and needs a large number of observations to yield a good approximation. To avoid some pitfalls in the approximation, the Lin-Mcleod Test is further proposed to obtain a numeric solution to …


Some Nonparametric Ordered Restricted Inference Problems In The Context Of A Statistical Education Study, Bradford M. Dykes Aug 2016

Some Nonparametric Ordered Restricted Inference Problems In The Context Of A Statistical Education Study, Bradford M. Dykes

Dissertations

Over the past 10 years, the Department of Statistics at Western Michigan University has developed a question generating system that can be used for creating multiple forms of exams, quizzes and homework for online and face-to-face use. This system can also be used to provide students with a form of instantaneous feedback. With the goal of analyzing how different levels of feedback in an online learning environment impacts students' performance on assignments, this study presents data collected on two semesters of students enrolled in three different meeting types (strictly online, typical face-to-face, and honors face-to-face) of an introductory Statistics course. …


Variable Selection Via Penalized Regression And The Genetic Algorithm Using Information Complexity, With Applications For High-Dimensional -Omics Data, Tyler J. Massaro Aug 2016

Variable Selection Via Penalized Regression And The Genetic Algorithm Using Information Complexity, With Applications For High-Dimensional -Omics Data, Tyler J. Massaro

Doctoral Dissertations

This dissertation is a collection of examples, algorithms, and techniques for researchers interested in selecting influential variables from statistical regression models. Chapters 1, 2, and 3 provide background information that will be used throughout the remaining chapters, on topics including but not limited to information complexity, model selection, covariance estimation, stepwise variable selection, penalized regression, and especially the genetic algorithm (GA) approach to variable subsetting.

In chapter 4, we fully develop the framework for performing GA subset selection in logistic regression models. We present advantages of this approach against stepwise and elastic net regularized regression in selecting variables from a …


The Influence Of The Electric Supply Industry On Economic Growth In Less Developed Countries, Edward Richard Bee Aug 2016

The Influence Of The Electric Supply Industry On Economic Growth In Less Developed Countries, Edward Richard Bee

Dissertations

This study measures the impact that electrical outages have on manufacturing production in 135 less developed countries using stochastic frontier analysis and data from World Bank’s Investment Climate surveys. Outages of electricity, for firms with and without backup power sources, are the most frequently cited constraint on manufacturing growth in these surveys.

Outages are shown to reduce output below the production frontier by almost five percent in Africa and by a lower percentage in South Asia, Southeast Asia and the Middle East and North Africa. Production response to outages is quadratic in form. Outages also increase labor cost, reduce exports …


Utilizing Computed Tomography Image Features To Advance Prediction Of Radiation Pneumonitis, Shane P. Krafft Aug 2016

Utilizing Computed Tomography Image Features To Advance Prediction Of Radiation Pneumonitis, Shane P. Krafft

Dissertations & Theses (Open Access)

Improving outcomes for non-small-cell lung cancer patients treated with radiation therapy (RT) requires optimizing the balance between local tumor control and risk of normal tissue toxicity. In approximately 20% of patients, severe acute symptomatic lung toxicity, termed radiation pneumonitis (RP), still occurs. Identifying the individuals at risk of RP prior to or early during treatment offers tremendous potential to improve RT by providing the physician with information to assist in making clinical decisions that enhance therapy. Our central goal for this work was to demonstrate the potential gain in predictive accuracy of normal tissue complication probability models for RP by …


Advanced Sequential Monte Carlo Methods And Their Applications To Sparse Sensor Network For Detection And Estimation, Kai Kang Aug 2016

Advanced Sequential Monte Carlo Methods And Their Applications To Sparse Sensor Network For Detection And Estimation, Kai Kang

Doctoral Dissertations

The general state space models present a flexible framework for modeling dynamic systems and therefore have vast applications in many disciplines such as engineering, economics, biology, etc. However, optimal estimation problems of non-linear non-Gaussian state space models are analytically intractable in general. Sequential Monte Carlo (SMC) methods become a very popular class of simulation-based methods for the solution of optimal estimation problems. The advantages of SMC methods in comparison with classical filtering methods such as Kalman Filter and Extended Kalman Filter are that they are able to handle non-linear non-Gaussian scenarios without relying on any local linearization techniques. In this …


Multilevel Models For Longitudinal Data, Aastha Khatiwada Aug 2016

Multilevel Models For Longitudinal Data, Aastha Khatiwada

Electronic Theses and Dissertations

Longitudinal data arise when individuals are measured several times during an ob- servation period and thus the data for each individual are not independent. There are several ways of analyzing longitudinal data when different treatments are com- pared. Multilevel models are used to analyze data that are clustered in some way. In this work, multilevel models are used to analyze longitudinal data from a case study. Results from other more commonly used methods are compared to multilevel models. Also, comparison in output between two software, SAS and R, is done. Finally a method consisting of fitting individual models for each …


Spatio-Temporal Analysis Of Point Patterns, Abdul-Nasah Soale Aug 2016

Spatio-Temporal Analysis Of Point Patterns, Abdul-Nasah Soale

Electronic Theses and Dissertations

In this thesis, the basic tools of spatial statistics and time series analysis are applied to the case study of the earthquakes in a certain geographical region and time frame. Then some of the existing methods for joint analysis of time and space are described and applied. Finally, additional research questions about the spatial-temporal distribution of the earthquakes are posed and explored using statistical plots and models. The focus in the last section is in the relationship between number of events per year and maximum magnitude and its effect on how clustered the spatial distribution is and the relationship between …


Newsvendor Models With Monte Carlo Sampling, Ijeoma W. Ekwegh Aug 2016

Newsvendor Models With Monte Carlo Sampling, Ijeoma W. Ekwegh

Electronic Theses and Dissertations

Newsvendor Models with Monte Carlo Sampling by Ijeoma Winifred Ekwegh The newsvendor model is used in solving inventory problems in which demand is random. In this thesis, we will focus on a method of using Monte Carlo sampling to estimate the order quantity that will either maximizes revenue or minimizes cost given that demand is uncertain. Given data, the Monte Carlo approach will be used in sampling data over scenarios and also estimating the probability density function. A bootstrapping process yields an empirical distribution for the order quantity that will maximize the expected profit. Finally, this method will be used …


Joint Analysis Of Zero-Heavy Longitudinal Outcomes: Models And Comparison Of Study Designs, Erin R. Lundy Jul 2016

Joint Analysis Of Zero-Heavy Longitudinal Outcomes: Models And Comparison Of Study Designs, Erin R. Lundy

Electronic Thesis and Dissertation Repository

Understanding the patterns and mechanisms of the process of desistance from criminal activity is imperative for the development of effective sanctions and legal policy. Methodological challenges in the analysis of longitudinal criminal behaviour data include the need to develop methods for multivariate longitudinal discrete data, incorporating modulating exposure variables and several possible sources of zero-inflation. We develop new tools for zero-heavy joint outcome analysis which address these challenges and provide novel insights on processes related to offending patterns. Comparisons with existing approaches demonstrate the benefits of utilizing modeling frameworks which incorporate distinct sources of zeros. An additional concern in this …


Stochastic Processes And Their Applications To Change Point Detection Problems, Heng Yang Jun 2016

Stochastic Processes And Their Applications To Change Point Detection Problems, Heng Yang

Dissertations, Theses, and Capstone Projects

This dissertation addresses the change point detection problem when either the post-change distribution has uncertainty or the post-change distribution is time inhomogeneous. In the case of post-change distribution uncertainty, attention is drawn to the construction of a family of composite stopping times. It is shown that the proposed composite stopping time has third order optimality in the detection problem with Wiener observations and also provides information to distinguish the different values of post-change drift. In the case of post-change distribution uncertainty, a computationally efficient decision rule with low-complexity based on Cumulative Sum (CUSUM) algorithm is also introduced. In the time …


Metals Additive Manufacturing Powder Aging Characterization, Thomas Russell Lovejoy, Nicholas Karl Muetterties, David Takeo Otsu Jun 2016

Metals Additive Manufacturing Powder Aging Characterization, Thomas Russell Lovejoy, Nicholas Karl Muetterties, David Takeo Otsu

Mechanical Engineering

The metallic additive manufacturing process known as selective laser melting requires highly spherical, normally distributed powder with diameters in the range of 10 to 50 microns. Previous observations have shown a degradation in powder quality over time, resulting in unwanted characteristics in the final printed parts. 21-6-9 stainless steel powder was used to fabricate test parts, with leftover powder recycled back into the machine. Powder samples and test specimens were characterized to observe changes across build cycles. Few changes were observed in the physical and mechanical properties of the specimens, however, there were indications of chemical changes across cycles. Potential …


Classification Trees And Rule-Based Modeling Using The C5.0 Algorithm For Self-Image Across Sex And Race In St. Louis, Rohan Shirali May 2016

Classification Trees And Rule-Based Modeling Using The C5.0 Algorithm For Self-Image Across Sex And Race In St. Louis, Rohan Shirali

Arts & Sciences Electronic Theses and Dissertations

The study population comprised children, adolescents, and adults who were residents of the city of St. Louis at the time of data collection in 2015. The data collected includes sex, age, race, measured height and weight, self-reported height and weight, zip code, educational background, exercise and diet habits, and descriptions and strategies of participants' weight (i.e. overweight and trying to lose weight, respectively). I use the C5.0 algorithm to create classification trees and rule-based models to analyze this population. Specifically, I model a binary self-image variable as a function of sex, age, race, zip code, and a ratio of reported …


Spot Volatility Estimation Of Ito Semimartingales Using Delta Sequences, Weixuan Gao May 2016

Spot Volatility Estimation Of Ito Semimartingales Using Delta Sequences, Weixuan Gao

Arts & Sciences Electronic Theses and Dissertations

This thesis studies a unifying class of nonparametric spot volatility estimators proposed by Mancini et. al.(2013). This method is based on delta sequences and is conceived to include many of the existing estimators in the field as special cases. The thesis first surveys the asymptotic theory of the proposed estimators under an infill asymptotic scheme and fixed time horizon, when the state variable follows a Brownian semimartingale. Then, some extensions to include jumps and financial microstructure noise in the observed price process are also presented. The main goal of the thesis is to assess the suitability of the proposed methods …


Automated Sea State Classification From Parameterization Of Survey Observations And Wave-Generated Displacement Data, Jason A. Teichman May 2016

Automated Sea State Classification From Parameterization Of Survey Observations And Wave-Generated Displacement Data, Jason A. Teichman

University of New Orleans Theses and Dissertations

Sea state is a subjective quantity whose accuracy depends on an observer’s ability to translate local wind waves into numerical scales. It provides an analytical tool for estimating the impact of the sea on data quality and operational safety. Tasks dependent on the characteristics of local sea surface conditions often require accurate and immediate assessment. An attempt to automate sea state classification using eleven years of ship motion and sea state observation data is made using parametric modeling of distribution-based confidence and tolerance intervals and a probabilistic model using sea state frequencies. Models utilizing distribution intervals are not able to …


Population Projection And Habitat Preference Modeling Of The Endangered James Spinymussel (Pleurobema Collina), Marisa Draper May 2016

Population Projection And Habitat Preference Modeling Of The Endangered James Spinymussel (Pleurobema Collina), Marisa Draper

Senior Honors Projects, 2010-2019

The James Spinymussel (Pleurobema collina) is an endangered mussel species at the top of Virginia’s conservation list. The James Spinymussel plays a critical role in the environment by filtering and cleaning stream water while providing shelter and food for macroinvertebrates; however, conservation efforts are complicated by the mussels’ burrowing behavior, camouflage, and complex life cycle. The goals of the research conducted were to estimate detection probabilities that could be used to predict species presence and facilitate field work, and to track individually marked mussels to test for habitat preferences. Using existing literature and mark-recapture field data, these goals were accomplished …


Risk Estimation Toward A Natural History Model For Low Grade Glioma Patients, Anh Thi Hoang Pham May 2016

Risk Estimation Toward A Natural History Model For Low Grade Glioma Patients, Anh Thi Hoang Pham

Graduate Theses and Dissertations

Glioma is a common type of primary brain tumor that represents 28% of all brain tumors and 80% of malignant tumors. According to a recent study by the Centers for Disease Control and Prevention (CDC), gliomas account for 53%, 35% and 29% of all brain tumors (68%, 74% and 81% of malignant brain tumors) among children (aged 0-14), teenagers (aged 15-19) and young adults, respectively. Gliomas are often diagnosed through radiological imaging and histopathology. There are two main groups of gliomas following World Health Organization’s classification: Low grade gliomas (LGG), or grade I and II gliomas; and high grade gliomas …


Spread Trading In Corn Futures Market, Ryan D. Napier May 2016

Spread Trading In Corn Futures Market, Ryan D. Napier

Graduate Theses and Dissertations

The non-linear relationship between old crop – new crop year spreads in corn futures market and stock-to-use (S-U) ratios published by the United States Department of Agriculture is analyzed. Using a non-linear logarithmic smooth transition regression (LSTR) model, we capture asymmetric market behaviors in high and low S-U regimes. Capturing this relationship and understanding the non-linear aspects of the relationship is of interest of grain merchandizers and speculators in the market. A spread trading strategy is simulated for the sample period, January 1985 through April 2015, to determine if the non-linear relationship is a profitable arbitrage opportunity in the market.


Statistical Modeling Of The Temporal Dynamics In A Large Scale-Citation Network, Luis Javier Ek Jr. May 2016

Statistical Modeling Of The Temporal Dynamics In A Large Scale-Citation Network, Luis Javier Ek Jr.

Graduate Theses and Dissertations

Citation Networks of papers are vast networks that grow over time. The manner or the form a citation network grows is not entirely a random process, but a preferential attachment relationship; highly cited papers are more likely to be cited by newly published papers. The result is a network whose degree distribution follows a power law. This growth of citation network of papers will be modeled with a negative binomial regression coupled with logistic growth and/or Cauchy distribution curve. Then a Barabasi-Albert model, based on the negative binomial models, and a combination of the Dirichlet distribution and multinomial will be …


Identification Of Biomarkers For The Overall Survival Of Ovarian Cancer Patients, Kristi Mai May 2016

Identification Of Biomarkers For The Overall Survival Of Ovarian Cancer Patients, Kristi Mai

Graduate Theses and Dissertations

Rapid advance in sequencing technology has led to genome-wide analysis of genetic and epigenetic features simultaneously, making it possible to understand the biological mechanisms underlying cancer initiation and progression. However, how to identify important prognostic features poses a great challenge for both statistical modeling and computing. In this thesis, a network-based approach is applied to the Cancer Genome Atlas (TCGA) ovarian cancer data to identify important genes related to the overall survival of ovarian cancer patients. In the first step, a stepwise correlation-based selector is used to reduce the dimensionality of TCGA data, by filtering out a large number of …


Effects Of Bullying And Victimization On Friendship Selection, Reciprocation, And Maintenance In Elementary School Children, Marisa Lynn Whitley May 2016

Effects Of Bullying And Victimization On Friendship Selection, Reciprocation, And Maintenance In Elementary School Children, Marisa Lynn Whitley

Masters Theses

This study examined the effects of elementary school children’s bullying and victimization experiences on their friendships over time. The majority of children experience acts of aggression or bullying before the end of elementary school, and bullying and peer victimization is associated with academic, social, behavioral, and psychological difficulties. This study used social networks analysis (R SIENA 4.0) to examine whether peer reports of forms of bullying and victimization (i.e., overt and relational) affect the likelihood of friendship selection, reciprocation, and maintenance in 2nd-4th grade children. Children (N = 143) from the Midwestern region of the United …


Propensity Score Methods : A Simulation And Case Study Involving Breast Cancer Patients., John Craycroft May 2016

Propensity Score Methods : A Simulation And Case Study Involving Breast Cancer Patients., John Craycroft

Electronic Theses and Dissertations

Observational data presents unique challenges for analysis that are not encountered with experimental data resulting from carefully designed randomized controlled trials. Selection bias and unbalanced treatment assignments can obscure estimations of treatment effects, making the process of causal inference from observational data highly problematic. In 1983, Paul Rosenbaum and Donald Rubin formalized an approach for analyzing observational data that adjusts treatment effect estimates for the set of non-treatment variables that are measured at baseline. The propensity score is the conditional probability of assignment to a treatment group given the covariates. Using this score, one may balance the covariates across treatment …


Empirical Evaluation Of Different Features Of Design In Confirmatory Factor Analysis, Deyab Almaleki Apr 2016

Empirical Evaluation Of Different Features Of Design In Confirmatory Factor Analysis, Deyab Almaleki

Dissertations

Factor analysis (FA) is the study of variance within a group. Within-subject variance (WSV) is affected by multiple features in a study context, such as: the study experimental design (ED) and sampling design (SD), thus anything that influences or changes variance may affect the conclusions related to FA.

The aim of this study was to provide empirical evaluation of the influence of different aspects of ED and SD on WSV in the context of FA in terms of model precision and model estimate stability. Four Monte Carlo population correlation matrices were hypothesized based on different communality magnitudes (high, moderate, low, …