Open Access. Powered by Scholars. Published by Universities.®

Applied Statistics Commons

Open Access. Powered by Scholars. Published by Universities.®

Bayesian

Discipline
Institution
Publication Year
Publication
Publication Type

Articles 1 - 29 of 29

Full-Text Articles in Applied Statistics

Comparative Analysis Of Teacher Effects Parameters In Models Used For Assessing School Effectiveness: Value-Added Models & Persistence, Merlin J. Kamgue Dec 2023

Comparative Analysis Of Teacher Effects Parameters In Models Used For Assessing School Effectiveness: Value-Added Models & Persistence, Merlin J. Kamgue

Graduate Theses and Dissertations

Longitudinal measures for students have become increasingly popular to estimate the effects of individual teachers and schools. Value-added models are one of the approaches using longitudinal data to evaluate teachers and schools. In the value-added model (VAM) literature, many statistical approaches have been developed and used to estimate teacher or school effects on student learning. This study opted to use a Bayesian multivariate model for evaluating teacher effects. The generalized persistence models can handle longitudinal data, not vertically scaled, allowing for a below-par teacher’s effects correlation across test administrations. This study first generated longitudinal students’ test score data and used …


A New Method To Determine The Posterior Distribution Of Coefficient Alpha, John Mart V. Delosreyes Oct 2023

A New Method To Determine The Posterior Distribution Of Coefficient Alpha, John Mart V. Delosreyes

Psychology Theses & Dissertations

There is a focus within the behavioral/social sciences on non-physical, psychological constructs (i.e., constructs). These constructs are indirectly measured using measurement instruments that consist of questions that capture the manifestations of these constructs. The indirect nature of measuring constructs results in a need of ensuring that measurement instruments are reliable. The most popular statistic used to estimate reliability is coefficient alpha as it is easy to compute and has properties that make it desirable to use. Coefficient alpha’s popularity has resulted in a wide breadth of research into its qualities. Notably, research about coefficient alpha’s distribution has led to developments …


Self-Learning Algorithms For Intrusion Detection And Prevention Systems (Idps), Juan E. Nunez, Roger W. Tchegui Donfack, Rohit Rohit, Hayley Horn Mar 2023

Self-Learning Algorithms For Intrusion Detection And Prevention Systems (Idps), Juan E. Nunez, Roger W. Tchegui Donfack, Rohit Rohit, Hayley Horn

SMU Data Science Review

Today, there is an increased risk to data privacy and information security due to cyberattacks that compromise data reliability and accessibility. New machine learning models are needed to detect and prevent these cyberattacks. One application of these models is cybersecurity threat detection and prevention systems that can create a baseline of a network's traffic patterns to detect anomalies without needing pre-labeled data; thus, enabling the identification of abnormal network events as threats. This research explored algorithms that can help automate anomaly detection on an enterprise network using Canadian Institute for Cybersecurity data. This study demonstrates that Neural Networks with Bayesian …


Bayesian Methods For Graphical Models With Neighborhood Selection., Sagnik Bhadury Dec 2022

Bayesian Methods For Graphical Models With Neighborhood Selection., Sagnik Bhadury

Electronic Theses and Dissertations

Graphical models determine associations between variables through the notion of conditional independence. Gaussian graphical models are a widely used class of such models, where the relationships are formalized by non-null entries of the precision matrix. However, in high-dimensional cases, covariance estimates are typically unstable. Moreover, it is natural to expect only a few significant associations to be present in many realistic applications. This necessitates the injection of sparsity techniques into the estimation method. Classical frequentist methods, like GLASSO, use penalization techniques for this purpose. Fully Bayesian methods, on the contrary, are slow because they require iteratively sampling over a quadratic …


A Copula Model Approach To Identify The Differential Gene Expression, Prasansha Liyanaarachchi Dec 2021

A Copula Model Approach To Identify The Differential Gene Expression, Prasansha Liyanaarachchi

Mathematics & Statistics Theses & Dissertations

Deoxyribonucleic acid, more commonly known as DNA, is a complex double helix-shaped molecule present in all living organisms and hosts thousands of genes. However, only a few genes exhibit differential expression and play a vital role in a particular disease such as breast cancer. Microarray technology is one of the modern technologies developed to study these gene expressions. There are two major microarray technologies available for expression analysis: Spotted cDNA array and oligonucleotide array. The focus of our research is the statistical analysis of data that arises from the spotted cDNA microarray. Numerous models have been proposed in the literature …


Parametric, Nonparametric, And Semiparametric Linear Regression In Classical And Bayesian Statistical Quality Control, Chelsea L. Jones Jan 2021

Parametric, Nonparametric, And Semiparametric Linear Regression In Classical And Bayesian Statistical Quality Control, Chelsea L. Jones

Theses and Dissertations

Statistical process control (SPC) is used in many fields to understand and monitor desired processes, such as manufacturing, public health, and network traffic. SPC is categorized into two phases; in Phase I historical data is used to inform parameter estimates for a statistical model and Phase II implements this statistical model to monitor a live ongoing process. Within both phases, profile monitoring is a method to understand the functional relationship between response and explanatory variables by estimating and tracking its parameters. In profile monitoring, control charts are often used as graphical tools to visually observe process behaviors. We construct a …


Bayesian Topological Machine Learning, Christopher A. Oballe Aug 2020

Bayesian Topological Machine Learning, Christopher A. Oballe

Doctoral Dissertations

Topological data analysis encompasses a broad set of ideas and techniques that address 1) how to rigorously define and summarize the shape of data, and 2) use these constructs for inference. This dissertation addresses the second problem by developing new inferential tools for topological data analysis and applying them to solve real-world data problems. First, a Bayesian framework to approximate probability distributions of persistence diagrams is established. The key insight underpinning this framework is that persistence diagrams may be viewed as Poisson point processes with prior intensities. With this assumption in hand, one may compute posterior intensities by adopting techniques …


Bayesian Analysis Of Extended Cox Model With Time-Varying Covariates Using Bootstrap Prior, Oyebayo R. Olaniran, Mohd Asrul A. Abdullah Jul 2020

Bayesian Analysis Of Extended Cox Model With Time-Varying Covariates Using Bootstrap Prior, Oyebayo R. Olaniran, Mohd Asrul A. Abdullah

Journal of Modern Applied Statistical Methods

A new Bayesian estimation procedure for extended cox model with time varying covariate was presented. The prior was determined using bootstrapping technique within the framework of parametric empirical Bayes. The efficiency of the proposed method was observed using Monte Carlo simulation of extended Cox model with time varying covariates under varying scenarios. Validity of the proposed method was also ascertained using real life data set of Stanford heart transplant. Comparison of the proposed method with its competitor established appreciable supremacy of the method.


Small Area Estimation On Zero-Inflated Data Using Frequentist And Bayesian Approach, Kusman Sadik, Rahma Anisa, Euis Aqmaliyah Feb 2020

Small Area Estimation On Zero-Inflated Data Using Frequentist And Bayesian Approach, Kusman Sadik, Rahma Anisa, Euis Aqmaliyah

Journal of Modern Applied Statistical Methods

The most commonly used method of small area estimation (SAE) is the empirical best linear unbiased prediction method based on a linear mixed model. However, it is not appropriate in the case of the zero-inflated target variable with a mixture of zeros and continuously distributed positive values. Therefore, various model-based SAE methods for zero-inflated data are developed, such as the Frequentist approach and the Bayesian approach. Both approaches are compared with the survey regression (SR) method which ignores the presence of zero-inflation in the data. The results show that the two SAE approaches for zero-inflated data are capable to yield …


Habitat Associations And Reproduction Of Fishes On The Northwestern Gulf Of Mexico Shelf Edge, Elizabeth Marie Keller Nov 2019

Habitat Associations And Reproduction Of Fishes On The Northwestern Gulf Of Mexico Shelf Edge, Elizabeth Marie Keller

LSU Doctoral Dissertations

Several of the northwestern Gulf of Mexico (GOM) shelf-edge banks provide critical hard bottom habitat for coral and fish communities, supporting a wide diversity of ecologically and economically important species. These sites may be fish aggregation and spawning sites and provide important habitat for fish growth and reproduction. Already designated as habitat areas of particular concern, many of these banks are also under consideration for inclusion in the expansion of the Flower Garden Banks National Marine Sanctuary. This project aimed to gain a more comprehensive understanding of the communities and fish species on shelf-edge banks by way of gonad histology, …


Allocative Poisson Factorization For Computational Social Science, Aaron Schein Jul 2019

Allocative Poisson Factorization For Computational Social Science, Aaron Schein

Doctoral Dissertations

Social science data often comes in the form of high-dimensional discrete data such as categorical survey responses, social interaction records, or text. These data sets exhibit high degrees of sparsity, missingness, overdispersion, and burstiness, all of which present challenges to traditional statistical modeling techniques. The framework of Poisson factorization (PF) has emerged in recent years as a natural way to model high-dimensional discrete data sets. This framework assumes that each observed count in a data set is a Poisson random variable $y ~ Pois(\mu)$ whose rate parameter $\mu$ is a function of shared model parameters. This thesis examines a specific …


A Bayesian Framework For Estimating Seismic Wave Arrival Time, Hua Zhong May 2019

A Bayesian Framework For Estimating Seismic Wave Arrival Time, Hua Zhong

Graduate Theses and Dissertations

Because earthquakes have a large impact on human society, statistical methods for better studying earthquakes are required. One characteristic of earthquakes is the arrival time of seismic waves at a seismic signal sensor. Once we can estimate the earthquake arrival time accurately, the earthquake location can be triangulated, and assistance can be sent to that area correctly. This study presents a Bayesian framework to predict the arrival time of seismic waves with associated uncertainty. We use a change point framework to model the different conditions before and after the seismic wave arrives. To evaluate the performance of the model, we …


Bayesian Hierarchical Meta-Analysis Of Asymptomatic Ebola Seroprevalence, Peter Brody-Moore Jan 2019

Bayesian Hierarchical Meta-Analysis Of Asymptomatic Ebola Seroprevalence, Peter Brody-Moore

CMC Senior Theses

The continued study of asymptomatic Ebolavirus infection is necessary to develop a more complete understanding of Ebola transmission dynamics. This paper conducts a meta-analysis of eight studies that measure seroprevalence (the number of subjects that test positive for anti-Ebolavirus antibodies in their blood) in subjects with household exposure or known case-contact with Ebola, but that have shown no symptoms. In our two random effects Bayesian hierarchical models, we find estimated seroprevalences of 8.76% and 9.72%, significantly higher than the 3.3% found by a previous meta-analysis of these eight studies. We also produce a variation of this meta-analysis where we exclude …


Bayesian And Semi-Bayesian Estimation Of The Parameters Of Generalized Inverse Weibull Distribution, Kamaljit Kaur, Kalpana K. Mahajan, Sangeeta Arora Sep 2018

Bayesian And Semi-Bayesian Estimation Of The Parameters Of Generalized Inverse Weibull Distribution, Kamaljit Kaur, Kalpana K. Mahajan, Sangeeta Arora

Journal of Modern Applied Statistical Methods

Bayesian and semi-Bayesian estimators of parameters of the generalized inverse Weibull distribution are obtained using Jeffreys’ prior and informative prior under specific assumptions of loss function. Using simulation, the relative efficiency of the proposed estimators is obtained under different set-ups. A real life example is also given.


A Bayesian Variable Selection Method With Applications To Spatial Data, Xiahan Tang May 2017

A Bayesian Variable Selection Method With Applications To Spatial Data, Xiahan Tang

Graduate Theses and Dissertations

This thesis first describes the general idea behind Bayes Inference, various sampling methods based on Bayes theorem and many examples. Then a Bayes approach to model selection, called Stochastic Search Variable Selection (SSVS) is discussed. It was originally proposed by George and McCulloch (1993). In a normal regression model where the number of covariates is large, only a small subset tend to be significant most of the times. This Bayes procedure specifies a mixture prior for each of the unknown regression coefficient, the mixture prior was originally proposed by Geweke (1996). This mixture prior will be updated as data becomes …


Modelling Cash Crop Growth In Tn, Spencer Weston May 2017

Modelling Cash Crop Growth In Tn, Spencer Weston

Chancellor’s Honors Program Projects

No abstract provided.


Regularized Neural Network To Identify Potential Breast Cancer: A Bayesian Approach, Hansapani S. Rodrigo, Chris P. Tsokos, Taysseer Sharaf Nov 2016

Regularized Neural Network To Identify Potential Breast Cancer: A Bayesian Approach, Hansapani S. Rodrigo, Chris P. Tsokos, Taysseer Sharaf

Journal of Modern Applied Statistical Methods

In the current study, we have exemplified the use of Bayesian neural networks for breast cancer classification using the evidence procedure. The optimal Bayesian network has 81% overall accuracy in correctly classifying the true status of breast cancer patients, 59% sensitivity in correctly detecting the malignancy and 83% specificity in correctly detecting the non-malignancy. The area under the receiver operating characteristic curve (0.7940) shows that this is a moderate classification model.


Methods To Account For Breed Composition In A Bayesian Gwas Method Which Utilizes Haplotype Clusters, Danielle F. Wilson-Wells Aug 2016

Methods To Account For Breed Composition In A Bayesian Gwas Method Which Utilizes Haplotype Clusters, Danielle F. Wilson-Wells

Department of Statistics: Dissertations, Theses, and Student Work

In livestock, prediction of an animal’s genetic merit using genomic information is becoming increasingly common. The models used to make these predictions typically assume that we are sampling from a homogeneous population. However, in both commercial and experimental populations the sire and dam of an individual may be a mixture of different breeds. Haplotype models can capture this population structure.

Two models based on breed specific haplotype clusters where developed to account for differences across multiple breeds. The first model utilizes the breed composition of the individual, while the second utilizes the breed composition from the sire and dam. Haplotype …


Teaching The Quandary Of Statistical Jurisprudence: A Review-Essay On Math On Trial By Schneps And Colmez, Noah Giansiracusa Jul 2016

Teaching The Quandary Of Statistical Jurisprudence: A Review-Essay On Math On Trial By Schneps And Colmez, Noah Giansiracusa

Journal of Humanistic Mathematics

This review-essay on the mother-and-daughter collaboration Math on Trial stems from my recent experience using this book as the basis for a college freshman seminar on the interactions between math and law. I discuss the strengths and weaknesses of this book as an accessible introduction to this enigmatic yet deeply important topic. For those considering teaching from this text (a highly recommended endeavor) I offer some curricular suggestions.


A Bayesian Gwas Method Utilizing Haplotype Clusters For A Composite Breed Population, Danielle F. Wilson-Wells, Stephen D. Kachman May 2016

A Bayesian Gwas Method Utilizing Haplotype Clusters For A Composite Breed Population, Danielle F. Wilson-Wells, Stephen D. Kachman

Conference on Applied Statistics in Agriculture

Commercial beef cattle are often composites of multiple breeds. Current methods used to produce genomic predictors are based on the underlying assumption of animals being sampled from a homogeneous population. As a result, the predictors can perform poorly when used to predict the relative genetic merit of animals whose breed composition are different. In part, this is due to the changes in linkage disequilibrium between the markers and the quantitative trait loci as we move from one breed to the next. An alternative model based on breed specific haplotype clusters was developed to allow for differences in linkage disequilibrium across …


A Comparison Of Estimation Methods For Nonlinear Mixed-Effects Models Under Model Misspecification And Data Sparseness: A Simulation Study, Jeffrey R. Harring, Junhui Liu May 2016

A Comparison Of Estimation Methods For Nonlinear Mixed-Effects Models Under Model Misspecification And Data Sparseness: A Simulation Study, Jeffrey R. Harring, Junhui Liu

Journal of Modern Applied Statistical Methods

A Monte Carlo simulation is employed to investigate the performance of five estimation methods of nonlinear mixed effects models in terms of parameter recovery and efficiency of both regression coefficients and variance/covariance parameters under varying levels of data sparseness and model misspecification.


Objective Priors For Estimation Of Extended Exponential Geometric Distribution, Pedro L. Ramos, Fernando A. Moala, Jorge A. Achcar Nov 2014

Objective Priors For Estimation Of Extended Exponential Geometric Distribution, Pedro L. Ramos, Fernando A. Moala, Jorge A. Achcar

Journal of Modern Applied Statistical Methods

A Bayesian analysis was developed with different noninformative prior distributions such as Jeffreys, Maximal Data Information, and Reference. The aim was to investigate the effects of each prior distribution on the posterior estimates of the parameters of the extended exponential geometric distribution, based on simulated data and a real application.


Exonest: Bayesian Model Selection Applied To The Detection And Characterization Of Exoplanets Via Photometric Variations, Ben Placek, Kevin H. Knuth, Daniel Angerhausen Oct 2014

Exonest: Bayesian Model Selection Applied To The Detection And Characterization Of Exoplanets Via Photometric Variations, Ben Placek, Kevin H. Knuth, Daniel Angerhausen

Physics Faculty Scholarship

EXONEST is an algorithm dedicated to detecting and characterizing the photometric signatures of exoplanets, which include reflection and thermal emission, Doppler boosting, and ellipsoidal variations. Using Bayesian inference, we can test between competing models that describe the data as well as estimate model parameters. We demonstrate this approach by testing circular versus eccentric planetary orbital models, as well as testing for the presence or absence of four photometric effects. In addition to using Bayesian model selection, a unique aspect of EXONEST is the potential capability to distinguish between reflective and thermal contributions to the light curve. A case study is …


Dynamic Bayesian Approaches To The Statistical Calibration Problem, Derick Lorenzo Rivers Jan 2014

Dynamic Bayesian Approaches To The Statistical Calibration Problem, Derick Lorenzo Rivers

Theses and Dissertations

The problem of statistical calibration of a measuring instrument can be framed both in a statistical context as well as in an engineering context. In the first, the problem is dealt with by distinguishing between the "classical" approach and the "inverse" regression approach. Both of these models are static models and are used to estimate "exact" measurements from measurements that are affected by error. In the engineering context, the variables of interest are considered to be taken at the time at which you observe the measurement. The Bayesian time series analysis method of Dynamic Linear Models (DLM) can be used …


Robust Predictive Inference For Multivariate Linear Models With Elliptically Contoured Distribution Using Bayesian, Classical And Structural Approaches, B. M. Golam Kibria Nov 2008

Robust Predictive Inference For Multivariate Linear Models With Elliptically Contoured Distribution Using Bayesian, Classical And Structural Approaches, B. M. Golam Kibria

Journal of Modern Applied Statistical Methods

Predictive distributions of future response and future regression matrices under multivariate elliptically contoured distributions are discussed. Under the elliptically contoured response assumptions, these are identical to those obtained under matric normal or matric-t errors using structural, Bayesian with improper prior, or classical approaches. This gives inference robustness with respect to departure from the reference case of independent sampling from the matric normal or matric t to multivariate elliptically contoured distributions. The importance of the predictive distribution for skewed elliptical models is indicated; the elliptically contoured distribution, as well as matric t distribution, have significant applications in statistical practices.


On The Power Function Of Bayesian Tests With Application To Design Of Clinical Trials: The Fixed-Sample Case, Lyle Broemeling, Dongfeng Wu May 2005

On The Power Function Of Bayesian Tests With Application To Design Of Clinical Trials: The Fixed-Sample Case, Lyle Broemeling, Dongfeng Wu

Journal of Modern Applied Statistical Methods

Using a Bayesian approach to clinical trial design is becoming more common. For example, at the MD Anderson Cancer Center, Bayesian techniques are routinely employed in the design and analysis of Phase I and II trials. It is important that the operating characteristics of these procedures be determined as part of the process when establishing a stopping rule for a clinical trial. This study determines the power function for some common fixed-sample procedures in hypothesis testing, namely the one and two-sample tests involving the binomial and normal distributions. Also considered is a Bayesian test for multi-response (response and toxicity) in …


An Overview Of The Respondent-Generated Intervals (Rgi) Approach To Sample Surveys, S. James Press, Judith M. Tanur Nov 2004

An Overview Of The Respondent-Generated Intervals (Rgi) Approach To Sample Surveys, S. James Press, Judith M. Tanur

Journal of Modern Applied Statistical Methods

This article brings together many years of research on the Respondent-Generated Intervals (RGI) approach to recall in factual sample surveys. Additionally presented is new research on the use of RGI in opinion surveys and the use of RGI with gamma-distributed data. The research combines Bayesian hierarchical modeling with various cognitive aspects of sample surveys.


Vertical Integration In The Chicken Broiler Industry, Juana Sanchez Apr 1994

Vertical Integration In The Chicken Broiler Industry, Juana Sanchez

Conference on Applied Statistics in Agriculture

This paper analyzes three hypotheses concerning supply in the U.S. chicken broiler industry: (a) there has been a cycle in the industry of approximately 27-36 months length; (b) the seasonal and other periodic components, as well as relations between variables, have changed as a result of vertical integration in the industry; (c) the effects of vertical integration in the industry were counteracted in the early seventies by such forces external to the industry as domestic and international economic conditions .

The hypotheses are analyzed using new monthly, non-seasonally adjusted time series data for chick placement, wholesale broiler prices, chicks hatched …


The Prior Distribution In Bayesian Statistics, Kai-Tang Chen May 1979

The Prior Distribution In Bayesian Statistics, Kai-Tang Chen

All Graduate Theses and Dissertations, Spring 1920 to Summer 2023

A major problem associated with Bayesian estimation is selecting the prior distribution. The more recent literature on the selection of the prior is reviewed. Very little of a general nature on the selection of the prior is formed in the literature except for non-informative priors. This class of priors is seen to have limited usefulness. A method of selecting an informative prior is generalized in this thesis to include estimation of several parameters using a multivariate prior distribution. The concepts required for quantifying prior information is based on intuitive principles. In this way, it can be understood and controlled by …