Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Theses/Dissertations

Statistics and Probability

Bayesian

Institution
Publication Year
Publication

Articles 1 - 30 of 65

Full-Text Articles in Physical Sciences and Mathematics

Comparative Analysis Of Teacher Effects Parameters In Models Used For Assessing School Effectiveness: Value-Added Models & Persistence, Merlin J. Kamgue Dec 2023

Comparative Analysis Of Teacher Effects Parameters In Models Used For Assessing School Effectiveness: Value-Added Models & Persistence, Merlin J. Kamgue

Graduate Theses and Dissertations

Longitudinal measures for students have become increasingly popular to estimate the effects of individual teachers and schools. Value-added models are one of the approaches using longitudinal data to evaluate teachers and schools. In the value-added model (VAM) literature, many statistical approaches have been developed and used to estimate teacher or school effects on student learning. This study opted to use a Bayesian multivariate model for evaluating teacher effects. The generalized persistence models can handle longitudinal data, not vertically scaled, allowing for a below-par teacher’s effects correlation across test administrations. This study first generated longitudinal students’ test score data and used …


A New Method To Determine The Posterior Distribution Of Coefficient Alpha, John Mart V. Delosreyes Oct 2023

A New Method To Determine The Posterior Distribution Of Coefficient Alpha, John Mart V. Delosreyes

Psychology Theses & Dissertations

There is a focus within the behavioral/social sciences on non-physical, psychological constructs (i.e., constructs). These constructs are indirectly measured using measurement instruments that consist of questions that capture the manifestations of these constructs. The indirect nature of measuring constructs results in a need of ensuring that measurement instruments are reliable. The most popular statistic used to estimate reliability is coefficient alpha as it is easy to compute and has properties that make it desirable to use. Coefficient alpha’s popularity has resulted in a wide breadth of research into its qualities. Notably, research about coefficient alpha’s distribution has led to developments …


Penalized Bayesian Exponential Random Graph Models., Vicki Modisette Aug 2023

Penalized Bayesian Exponential Random Graph Models., Vicki Modisette

Electronic Theses and Dissertations

Networks have the critical ability to represent the complex interconnectedness of social relationships, biological processes, and the spread of diseases and information. Exponential random graph models (ERGM) are one of the popular statistical methods for analyzing network data. ERGM, however, struggle with computational challenges and degeneracy issues, further exacerbated by their inability to handle high-dimensional network data. Bayesian techniques provide a promising avenue to overcome these two problems. This paper considers penalized Bayesian exponential random graph models with adaptive lasso and adaptive ridge penalties to perform variable selection and reduce multicollinearity on a variety of networks. The experimental results demonstrate …


Spatially Adaptive Estimation Of Spectrum, Yi Xie May 2023

Spatially Adaptive Estimation Of Spectrum, Yi Xie

Open Access Theses & Dissertations

A time series may be analyzed either in the time or in the frequency domain. When working in the frequency domain, the main objective is to estimate the underlying spectrum. Various approaches have been proposed to this end, but most are based on smoothing the periodogram using a single smoothing parameter across all Fourier frequencies. Such a global smoothing parameter may result in a biased estimate. To improve the estimation, in this paper, we smooth the log periodogram by placing a dynamic shrinkage prior, such that varying degrees of smoothing may be applied to different regions of the Fourier frequencies, …


Model-Based Imputation Of Below Detection Limit Missing Data And Group Selection In Bayesian Group Index Regression, Matthew Carli Jan 2023

Model-Based Imputation Of Below Detection Limit Missing Data And Group Selection In Bayesian Group Index Regression, Matthew Carli

Theses and Dissertations

Investigations into the association between chemical exposure and health outcomes are increasingly focused on the role of chemical mixtures, as opposed to individual chemicals. The analysis of chemical mixture data required the development of novel statistical methods, one of these being Bayesian group index regression. A statistical challenge common to all chemical mixture analyses is the ubiquitous presence of below detection limit (BDL) data. We propose an extension of Bayesian group index regression that treats both regression effects and missing BDL observations as parameters in a model estimated through a Markov Chain Monte Carlo algorithm that we refer to as …


Power Approximations For Generalized Linear Mixed Models In R Using Steep Priors On Variance Components, Sydney Geisler Dec 2022

Power Approximations For Generalized Linear Mixed Models In R Using Steep Priors On Variance Components, Sydney Geisler

All Graduate Theses and Dissertations, Spring 1920 to Summer 2023

When designing an experiment, researchers often want to know how likely they are to detect statistically significant effects in the resulting data, i.e., they want to estimate their statistical power. The probability distribution method is a flexible way to do this, and it is currently implemented in the statistical software package SAS. This method requires a hypothetical data set (showing the magnitude of hypothesized effects) and constant values of variance components, which are critical elements of the statistical models used. The statistical software package R is increasingly popular, but the probability distribution method has not yet been implemented in R, …


Bayesian Methods For Graphical Models With Neighborhood Selection., Sagnik Bhadury Dec 2022

Bayesian Methods For Graphical Models With Neighborhood Selection., Sagnik Bhadury

Electronic Theses and Dissertations

Graphical models determine associations between variables through the notion of conditional independence. Gaussian graphical models are a widely used class of such models, where the relationships are formalized by non-null entries of the precision matrix. However, in high-dimensional cases, covariance estimates are typically unstable. Moreover, it is natural to expect only a few significant associations to be present in many realistic applications. This necessitates the injection of sparsity techniques into the estimation method. Classical frequentist methods, like GLASSO, use penalization techniques for this purpose. Fully Bayesian methods, on the contrary, are slow because they require iteratively sampling over a quadratic …


Dataset Evaluation For Data Trading Using Expected Loss And Homomorphic Encryption, Minsung Joo May 2022

Dataset Evaluation For Data Trading Using Expected Loss And Homomorphic Encryption, Minsung Joo

Senior Honors Papers / Undergraduate Theses

Supervised machine learning suffers from the ``garbage-in garbage-out" phenomenon where the performance of a model is limited by the quality of the data. While a myriad of data is collected every second, there is no general rigorous method of evaluating the quality of a given dataset. This hinders fair pricing of data in scenarios where a buyer may look to buy data for use with machine learning. In this work, I propose using the expected loss corresponding to a dataset as a measure of its quality, relying on Bayesian methods for uncertainty quantification. Furthermore, I present a secure multi-party computation …


A Copula Model Approach To Identify The Differential Gene Expression, Prasansha Liyanaarachchi Dec 2021

A Copula Model Approach To Identify The Differential Gene Expression, Prasansha Liyanaarachchi

Mathematics & Statistics Theses & Dissertations

Deoxyribonucleic acid, more commonly known as DNA, is a complex double helix-shaped molecule present in all living organisms and hosts thousands of genes. However, only a few genes exhibit differential expression and play a vital role in a particular disease such as breast cancer. Microarray technology is one of the modern technologies developed to study these gene expressions. There are two major microarray technologies available for expression analysis: Spotted cDNA array and oligonucleotide array. The focus of our research is the statistical analysis of data that arises from the spotted cDNA microarray. Numerous models have been proposed in the literature …


Bayesian Calibration Of The Icrp Zirconium Biokinetic Model And Use Of Canned Priors For The Evaluation Of Bioassay, Thomas Raymond Labone Oct 2021

Bayesian Calibration Of The Icrp Zirconium Biokinetic Model And Use Of Canned Priors For The Evaluation Of Bioassay, Thomas Raymond Labone

Theses and Dissertations

The International Commission on Radiological Protection (ICRP) publishes biokinetic models that relate measurements of radioactive material in the body and excreta (bioassay) to the amount of the material taken into the body (intake). Given the intake and the biokinetic model, radiation dose to organs and tissues can be calculated. The ICRP approximates the biokinetics of radioactive materials in the body with compartmental models expressed mathematically as a system of ordinary differential equations, for which they provide point estimates of the rate constants. Inaccurate estimates of intake and radiation dose can result in cases where the biokinetics of an individual differ …


Optimal Transport Driven Bayesian Inversion With Application To Signal Processing, Elijah F. Perez Jul 2021

Optimal Transport Driven Bayesian Inversion With Application To Signal Processing, Elijah F. Perez

Mathematics & Statistics ETDs

This paper will outline a Debiased Sinkhorn Divergence driven Bayesian inversion framework. Conventionally, a Gaussian Driven Bayesian framework is used when performing Bayesian inversion. A major issue with this Gaussian framework is that the Gaussian likelihood, driven by the L2 norm, is not affected by phase shift in a given signal. This issue has been addressed in [1] using a Wasserstein framework. However, the Wasserstein framework still has an issue because it assumes statistical independence when multidimensional signals are analyzed. This assumption of statistical independence cannot always be made when analyzing signals where multiple detectors are recording one event, say …


Bayesian Nonparametric Model For Functional Data Analysis, Tahmidul Islam Apr 2021

Bayesian Nonparametric Model For Functional Data Analysis, Tahmidul Islam

Theses and Dissertations

Functional data analysis (FDA) experienced a burst of growth after Ramsay and Silverman published their textbook in 1997. Functional data analysis interests researchers because of the challenges it adds to well-established multivariate analysis. Unlike finite dimensional random vectors, we visualize infinite dimensional random functions; for example, curves, images, brain scans, etc. A vast amount of literature have been dedicated to developing models for functional data. The ideas are mostly based on basis function representations and kernel-based nonparametric methods. In this dissertation, we propose a Bayesian treatment of nonparametric functional data analysis by introducing a Gaussian process (GP) over the space …


Parametric, Nonparametric, And Semiparametric Linear Regression In Classical And Bayesian Statistical Quality Control, Chelsea L. Jones Jan 2021

Parametric, Nonparametric, And Semiparametric Linear Regression In Classical And Bayesian Statistical Quality Control, Chelsea L. Jones

Theses and Dissertations

Statistical process control (SPC) is used in many fields to understand and monitor desired processes, such as manufacturing, public health, and network traffic. SPC is categorized into two phases; in Phase I historical data is used to inform parameter estimates for a statistical model and Phase II implements this statistical model to monitor a live ongoing process. Within both phases, profile monitoring is a method to understand the functional relationship between response and explanatory variables by estimating and tracking its parameters. In profile monitoring, control charts are often used as graphical tools to visually observe process behaviors. We construct a …


Modified-Half-Normal Distribution And Different Methods To Estimate Average Treatment Effect., Jingchao Sun Dec 2020

Modified-Half-Normal Distribution And Different Methods To Estimate Average Treatment Effect., Jingchao Sun

Electronic Theses and Dissertations

This dissertation consists of three projects related to Modified-Half-Normal distribution and causal inference. In my first project, a new distribution called Modified-Half-Normal distribution was introduced. I explored a few of its distributional properties, the procedures for generating random samples based on Bayesian approaches, and the parameter estimation based on the method of moments. The second project deals with the problem of selection bias of average treatment effect (ATE) if we use the observational data. I combined the propensity score based inverse probability of treatment weighting (IPTW) method and the directed acyclic graph (DAG) to solve this problem. The third project …


Bayesian Topological Machine Learning, Christopher A. Oballe Aug 2020

Bayesian Topological Machine Learning, Christopher A. Oballe

Doctoral Dissertations

Topological data analysis encompasses a broad set of ideas and techniques that address 1) how to rigorously define and summarize the shape of data, and 2) use these constructs for inference. This dissertation addresses the second problem by developing new inferential tools for topological data analysis and applying them to solve real-world data problems. First, a Bayesian framework to approximate probability distributions of persistence diagrams is established. The key insight underpinning this framework is that persistence diagrams may be viewed as Poisson point processes with prior intensities. With this assumption in hand, one may compute posterior intensities by adopting techniques …


Methods Of Uncertainty Quantification For Physical Parameters, Kellin Rumsey Jul 2020

Methods Of Uncertainty Quantification For Physical Parameters, Kellin Rumsey

Mathematics & Statistics ETDs

Uncertainty Quantification (UQ) is an umbrella term referring to a broad class of methods which typically involve the combination of computational modeling, experimental data and expert knowledge to study a physical system. A parameter, in the usual statistical sense, is said to be physical if it has a meaningful interpretation with respect to the physical system. Physical parameters can be viewed as inherent properties of a physical process and have a corresponding true value. Statistical inference for physical parameters is a challenging problem in UQ due to the inadequacy of the computer model. In this thesis, we provide a comprehensive …


Models For Data Analysis In Accelerated Reliability Growth, Cesar Alexander Ruiz Torres Jul 2020

Models For Data Analysis In Accelerated Reliability Growth, Cesar Alexander Ruiz Torres

Graduate Theses and Dissertations

This work develops new methodologies for analyzing accelerated testing data in the context of a reliability growth program for a complex multi-component system. Each component has multiple failure modes and the growth program consists of multiple test-fix stages with corrective actions applied at the end of each stage. The first group of methods considers time-to-failure data and test covariates for predicting the final reliability of the system. The time-to-failure of each failure mode is assumed to follow a Weibull distribution with rate parameter proportional to an acceleration factor. Acceleration factors are specific to each failure mode and test covariates. We …


Bayesian Zero-Inflated Model For Ordinal Data, Huizhong Yang Jul 2020

Bayesian Zero-Inflated Model For Ordinal Data, Huizhong Yang

Theses and Dissertations

Datasets with a relatively large number of zeros is commonly seen in medical applications. Although models like Zero-inflated Poisson (ZIP) model are proposed for counts data, there is still some issues with ordinal data which have excess zeros. In this paper, we developed a Bayesian approach to accommodate the excess zero in ordinal data. Intellectual disability (ID), also known as mental retardation (MR), is a disability characterized by below-average intelligence or mental ability and a lack of the learning necessary skills for daily life. A person with intellectual disability has intellectual functioning and adaptive behaviors limitations. Intellectual disability is a …


Bayesian Analysis Of Binary Diagnostic Tests And Panel Count Data, Chunling Wang Apr 2020

Bayesian Analysis Of Binary Diagnostic Tests And Panel Count Data, Chunling Wang

Theses and Dissertations

This dissertation mainly explores several challenging topics that arise in diagnostic tests and panel count data in the Bayesian framework. Binary diagnostic tests, particularly multiple diagnostic tests with repeated measures and diagnostic procedures with a large number of raters, are studied. For panel count data, most traditional methods only handle panel count data for a single type of recurrent event. In this dissertation, we primarily focus on the case with multiple types of recurrent events.

In Chapter 1, an introduction to the binary diagnostic tests data and panel count data is presented and related literature works are briefly reviewed. To …


Bayesian Approach To Finding The Most Likely Circuit Structure, Shannon Harms Jan 2020

Bayesian Approach To Finding The Most Likely Circuit Structure, Shannon Harms

Graduate Research Theses & Dissertations

Systems, and their reliabilities, depend on the reliabilities of the components that theyare composed of, and in this paper we want to nd the system structure that is the most likely given observed data. Bayesian methods were utilized in order to discover the posterior means, or observed reliabilities, of both the components and the systems. Assuming the serial and parallel system structures have independent components, we calculated system reliabilities based on observed component reliabilities by using the multiplication and addi- tion probability rules. We are then able to expand upon the numerical comparison method through a maximum likelihood analysis that …


Applications Of Dynamic Linear Models To Random Allocation Models, Albert H. Lee Iii Jan 2020

Applications Of Dynamic Linear Models To Random Allocation Models, Albert H. Lee Iii

Theses and Dissertations

Although advances in modern computational algorithms have provided researchers the ability to work problems which were once too computationally complex to solve, problems with high computation or large parameter spaces still remain. Problems such as those involving Time Series can be such problems. Chapter 1 looks at the the use of Exponentially Weighted Moving Averages developed by \citep{holt2004forecasting, winters1960forecasting} which were thought to provide sufficient solutions to these Time Series. A discussion is provided which illustrates the shortcomings of the EWMA and how its infinite number of possible starting values provides the modeler with an endless number of possible solutions …


On Improving Performance Of The Binary Logistic Regression Classifier, Michael Chang Dec 2019

On Improving Performance Of The Binary Logistic Regression Classifier, Michael Chang

UNLV Theses, Dissertations, Professional Papers, and Capstones

Logistic Regression, being both a predictive and an explanatory method, is one of the most commonly used statistical and machine learning method in almost all disciplines. There are many situations, however, when the accuracies of the fitted model are low for predicting either the success event or the failure event. Several statistical and machine learning approaches exist in the literature to handle these situations. This thesis presents several new approaches to improve the performance of the fitted model, and the proposed methods have been applied to real datasets.

Transformations of predictors is a common approach in fitting multiple linear and …


Habitat Associations And Reproduction Of Fishes On The Northwestern Gulf Of Mexico Shelf Edge, Elizabeth Marie Keller Nov 2019

Habitat Associations And Reproduction Of Fishes On The Northwestern Gulf Of Mexico Shelf Edge, Elizabeth Marie Keller

LSU Doctoral Dissertations

Several of the northwestern Gulf of Mexico (GOM) shelf-edge banks provide critical hard bottom habitat for coral and fish communities, supporting a wide diversity of ecologically and economically important species. These sites may be fish aggregation and spawning sites and provide important habitat for fish growth and reproduction. Already designated as habitat areas of particular concern, many of these banks are also under consideration for inclusion in the expansion of the Flower Garden Banks National Marine Sanctuary. This project aimed to gain a more comprehensive understanding of the communities and fish species on shelf-edge banks by way of gonad histology, …


Allocative Poisson Factorization For Computational Social Science, Aaron Schein Jul 2019

Allocative Poisson Factorization For Computational Social Science, Aaron Schein

Doctoral Dissertations

Social science data often comes in the form of high-dimensional discrete data such as categorical survey responses, social interaction records, or text. These data sets exhibit high degrees of sparsity, missingness, overdispersion, and burstiness, all of which present challenges to traditional statistical modeling techniques. The framework of Poisson factorization (PF) has emerged in recent years as a natural way to model high-dimensional discrete data sets. This framework assumes that each observed count in a data set is a Poisson random variable $y ~ Pois(\mu)$ whose rate parameter $\mu$ is a function of shared model parameters. This thesis examines a specific …


A Bayesian Framework For Estimating Seismic Wave Arrival Time, Hua Zhong May 2019

A Bayesian Framework For Estimating Seismic Wave Arrival Time, Hua Zhong

Graduate Theses and Dissertations

Because earthquakes have a large impact on human society, statistical methods for better studying earthquakes are required. One characteristic of earthquakes is the arrival time of seismic waves at a seismic signal sensor. Once we can estimate the earthquake arrival time accurately, the earthquake location can be triangulated, and assistance can be sent to that area correctly. This study presents a Bayesian framework to predict the arrival time of seismic waves with associated uncertainty. We use a change point framework to model the different conditions before and after the seismic wave arrives. To evaluate the performance of the model, we …


Mle And Bayesian Methods To Analyze Data With Missing Values Below The Limit Of Detection, Xinxin Hu Apr 2019

Mle And Bayesian Methods To Analyze Data With Missing Values Below The Limit Of Detection, Xinxin Hu

Theses and Dissertations

As pesticides are widely used in agriculture, more and more people who work at places like farm are exposed to the pesticides. According to enviroment re- searches [Villarejo; 2003; Reigart and Roberts; 1999], being exposed to some kind of pesticides like Organophosphorus (OP) insecticides has significantly effected the health of farmworkers and their family. The actual level of pesticides can be detected with some limitation for now. However, it is hard to detect when the level is below the limit of detection (LOD). Therefore, the goal of our research is to propose several different methods to analyze data …


Exploring The Behavior Of Model Fit Criteria In The Bayesian Approximate Measurement Invariance: A Simulation Study, Abeer Atallah S. Alamri Feb 2019

Exploring The Behavior Of Model Fit Criteria In The Bayesian Approximate Measurement Invariance: A Simulation Study, Abeer Atallah S. Alamri

USF Tampa Graduate Theses and Dissertations

Measurement invariance (MI) is conducted to ensure that differences found in the results of group comparisons are due to true substantive differences and not methodological artifacts. Previous cross-cultural and cross-national studies with large number of groups showed that the advanced measurement invariance level was rarely held when utilizing the traditional (frequentist) MI approach. The Bayesian approximate measurement invariance (BAMI) was introduced to override the traditional MI strict assumption, because trivial non-invariance in parameters across groups is allowed. Although the concept of the BAMI, which has been utilized since 2013, was incorporated into the context of structural equation modeling, there is …


Site- And Location-Adjusted Approaches To Adaptive Allocation Clinical Trial Designs, Brian S. Di Pace Jan 2019

Site- And Location-Adjusted Approaches To Adaptive Allocation Clinical Trial Designs, Brian S. Di Pace

Theses and Dissertations

Response-Adaptive (RA) designs are used to adaptively allocate patients in clinical trials. These methods have been generalized to include Covariate-Adjusted Response-Adaptive (CARA) designs, which adjust treatment assignments for a set of covariates while maintaining features of the RA designs. Challenges may arise in multi-center trials if differential treatment responses and/or effects among sites exist. We propose Site-Adjusted Response-Adaptive (SARA) approaches to account for inter-center variability in treatment response and/or effectiveness, including either a fixed site effect or both random site and treatment-by-site interaction effects to calculate conditional probabilities. These success probabilities are used to update assignment probabilities for allocating patients …


Bayesian Hierarchical Meta-Analysis Of Asymptomatic Ebola Seroprevalence, Peter Brody-Moore Jan 2019

Bayesian Hierarchical Meta-Analysis Of Asymptomatic Ebola Seroprevalence, Peter Brody-Moore

CMC Senior Theses

The continued study of asymptomatic Ebolavirus infection is necessary to develop a more complete understanding of Ebola transmission dynamics. This paper conducts a meta-analysis of eight studies that measure seroprevalence (the number of subjects that test positive for anti-Ebolavirus antibodies in their blood) in subjects with household exposure or known case-contact with Ebola, but that have shown no symptoms. In our two random effects Bayesian hierarchical models, we find estimated seroprevalences of 8.76% and 9.72%, significantly higher than the 3.3% found by a previous meta-analysis of these eight studies. We also produce a variation of this meta-analysis where we exclude …


Exploring A Bayesian Analysis Of Opinion Dynamics Using The Approximate Bayesian Computation Method, Jessica L. Bishop Jan 2019

Exploring A Bayesian Analysis Of Opinion Dynamics Using The Approximate Bayesian Computation Method, Jessica L. Bishop

Graduate Research Theses & Dissertations

Social media has created a whole new framework in the way we understand ones expression of opinion, and how ones' opinion can influence others. Models of opinion dynamics, such as a probabilistic modeling framework of opinion dynamics over time are given by Abir De, Isabel Valera, Niloy Ganguly, Sourangshu Bhattacharya, and Manuel Gomez Rodriguez in ``Learning and Forecasting Opinion Dynamics in Social Networks." In this paper, we will continue to explore their models, now coming from a Bayesian statistical standpoint, specifically looking at the Approximate Bayesian Computation (ABC) method for the computation of better estimations for the data. We will …