Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Statistics and Probability

Theses/Dissertations

2024

Institution
Keyword
Publication

Articles 1 - 30 of 60

Full-Text Articles in Physical Sciences and Mathematics

Limit Theorems For L-Functions In Analytic Number Theory, Asher Roberts Sep 2024

Limit Theorems For L-Functions In Analytic Number Theory, Asher Roberts

Dissertations, Theses, and Capstone Projects

We use the method of Radziwill and Soundararajan to prove Selberg’s central limit theorem for the real part of the logarithm of the Riemann zeta function on the critical line in the multivariate case. This gives an alternate proof of a result of Bourgade. An upshot of the method is to determine a rate of convergence in the sense of the Dudley distance. This is the same rate Selberg claims using the Kolmogorov distance. We also achieve the same rate of convergence in the case of Dirichlet L-functions. Assuming the Riemann hypothesis, we improve the rate of convergence by using …


An Application Of An In-Depth Advanced Statistical Analysis In Exploring The Dynamics Of Depression, Sleep Deprivation, And Self-Esteem, Muslihat Gaffari Aug 2024

An Application Of An In-Depth Advanced Statistical Analysis In Exploring The Dynamics Of Depression, Sleep Deprivation, And Self-Esteem, Muslihat Gaffari

Electronic Theses and Dissertations

Depression, intertwined with sleep deprivation and self-esteem, presents a significant challenge to mental health worldwide. The research shown in this paper employs advanced statistical methodologies to unravel the complex interactions among these factors. Through log-linear homogeneous association, multinomial logistic regression, and generalized linear models, the study scrutinizes large datasets to uncover nuanced patterns and relationships. By elucidating how depression, sleep disturbances, and self-esteem intersect, the research aims to deepen understanding of mental health phenomena. The study clarifies the relationship between these variables and explores reasons for prioritizing depression research. It evaluates how statistical models, such as log-linear, multinomial logistic regression, …


Examining The Interaction Between Calcium Supplement Use, Demographics, And Lifestyle Factors On Bone Health Of Women, Vix J. Talbot Jun 2024

Examining The Interaction Between Calcium Supplement Use, Demographics, And Lifestyle Factors On Bone Health Of Women, Vix J. Talbot

University Honors Theses

Osteoporosis is a condition which poses a significant health threat, particularly among women during the menopause transition, where accelerated bone loss increases fracture risk. Calcium supplementation has been shown to be an important intervention to mitigate bone mineral density (BMD) decline during this and other periods of life. However, the efficacy of calcium supplementation is influenced by various individual factors, including demographics and lifestyle habits. This study investigates the interaction between calcium supplement use, and several interaction terms on bone health in women. Multiple linear regression analysis is employed to assess the impact of these factors on BMD. Data from …


On Limiting Distributions For Eigenvalue Spectra Of Sample Correlation Matrices From Heavy-Tailed Populations: Literature Review, Sachini Sandhareka Wijesundara Jun 2024

On Limiting Distributions For Eigenvalue Spectra Of Sample Correlation Matrices From Heavy-Tailed Populations: Literature Review, Sachini Sandhareka Wijesundara

Major Papers

This major paper offers an extensive review of literature concerning the limiting distributions for the eigenvalue spectrum of sample correlation matrices from a p-dimensional population, where both the dimension p and the sample size n grow to infinity. The study systematically categorizes the reviewed literature based on underlying assumptions regarding the data characteristics. Specifically, it examines several distinct cases: the independent and identically distributed (i.i.d) case with finite fourth moments, the i.i.d case with infinite fourth moments, the i.i.d case with infinite second moments, and scenarios where rows and columns of the data are linearly dependent. Additionally, the major paper …


Qwixx Strategies Using Simulation And Mcmc Methods, Joshua W. Blank Jun 2024

Qwixx Strategies Using Simulation And Mcmc Methods, Joshua W. Blank

Master's Theses

This study explores optimal strategies for maximizing scores and winning in the popular dice game Qwixx, analyzing both single and multiplayer gameplay scenarios. Through extensive simulations, various strategies were tested and compared, including a scorebased approach that uses a formula tuned by MCMC random walks, and race-to-lock approaches which use absorbing Markov chain qualities of individual score sheet rows to find ways to lock rows as quickly as possible. Results indicate that employing a scorebased strategy, considering gap, count, position, skip, and likelihood scores, significantly improves performance in single player games, while move restrictions based on specific dice roll sums …


Reports Of Autosomal Recessive Disease And Consanguineous Mating Within The Human Population, Johnathon L. Schluter May 2024

Reports Of Autosomal Recessive Disease And Consanguineous Mating Within The Human Population, Johnathon L. Schluter

Master's Theses

It is anecdotally evident when investigating published reports of autosomal recessive disease that a substantial number of cases are the result of related (consanguineous) mating. This research seeks to quantify the percent of manuscripts describing autosomal recessive diseases published between 2000 and 2020 in which consanguineous mating is indicated. We analyzed 602 peer-reviewed manuscripts to identify the percentage of cases presented in which consanguineous mating was indicated, the underlying genes (novel gene or new mutation) and geographical region. These papers were accessed through a specific set of parameters on the free access PubMed Central (PMC) database. A total of 552 …


Towards A New Role Of Mitochondrial Hydrogen Peroxide In Synaptic Function, Cliyahnelle Z. Alexander May 2024

Towards A New Role Of Mitochondrial Hydrogen Peroxide In Synaptic Function, Cliyahnelle Z. Alexander

Student Theses and Dissertations

Aerobic metabolism is known to generate damaging ROS, particularly hydrogen peroxide. Reactive oxygen species (ROS) are highly reactive molecules containing oxygen that have the potential to cause damage to cells and tissues in the body. ROS are highly reactive atoms or molecules that rapidly interact with other molecules within a cell. Intracellular accumulation can result in oxidative damage, dysfunction, and cell death. Due to the limitations of H2O2 (hydrogen peroxide) detectors, other impacts of ROS exposure may have been missed. HyPer7, a genetically encoded sensor, measures hydrogen peroxide emissions precisely and sensitively, even at sublethal levels, during …


Statistical Approaches For The Early Detection Of Colorectal Cancer Using Longitudinal Biomarkers, Emily Berry May 2024

Statistical Approaches For The Early Detection Of Colorectal Cancer Using Longitudinal Biomarkers, Emily Berry

Statistical Science Theses and Dissertations

Colorectal cancer (CRC) is the third leading cause of cancer-related death in the United States [45]. CRC is believed to advance from adenomatous polyps creating a unique opportunity for both early detection and cancer prevention [4, 23]. Like other diseases, CRC screening reduces mortality by detecting cancer at earlier, more treatable stages; however, it can also reduce incidence through the removal of precancerous lesions [4]. As a result, screening is recommended for average-risk adults ≥ 45 years of age and includes a variety of tests [4, 12]. Despite alternate screening options, colonoscopy capacity is often cited as a barrier to …


Markov Chain Model Of Three-Dimensional Daphnia Magna Movement, Helen L. Kafka May 2024

Markov Chain Model Of Three-Dimensional Daphnia Magna Movement, Helen L. Kafka

Theses and Dissertations

Daphnia magna make turns through an antennae-whipping action. This action occursevery few seconds, hence, during the intervening time, the animal either remains in place or continues movement roughly along its current course. We view their movement in three dimensions. We divide the movement in the three dimensions into the movement on a two-dimensional lattice and the movement between the different planes. For the movement on the lattice, we construct a second-order Markov chain model to make predictions about which region of the lattice the animal moves to based on where it was at the last two time points. The movement …


Utilizing Arma Models For Non-Independent Replications Of Point Processes, Lucas M. Fellmeth May 2024

Utilizing Arma Models For Non-Independent Replications Of Point Processes, Lucas M. Fellmeth

Theses and Dissertations

The use of a functional principal component analysis (FPCA) approach for estimatingintensity functions from prior work allows us to obtain component scores of replicated point processes under the assumption of independent replications. We show these component scores can be modeled using classical autoregressive moving average (ARMA) models, thus allowing us to also apply the FPCA model to non-independent replications. The Divvy bike-sharing system in the city of Chicago is showcased as an application.


Bayesian Change Point Detection In Segmented Multi-Group Autoregressive Moving-Average Data For The Study Of Covid-19 In Wisconsin, Russell Latterman May 2024

Bayesian Change Point Detection In Segmented Multi-Group Autoregressive Moving-Average Data For The Study Of Covid-19 In Wisconsin, Russell Latterman

Theses and Dissertations

Changepoint detection involves the discovery of abrupt fluctuations in population dynamics over time. We take a Bayesian approach to estimating points in time at which the parameters of an autoregressive moving average (ARMA) change, applying a Markov chain Monte Carlo method. We specifically assume that data may originate from one of two groups. We provide estimates of all multi-group parameters of a model of this form for both simulated and real-world data sets. We include a provision to resolve the problem of confounding ARMA parameter estimates and variance of segment data. We apply our model to identify points in time …


High-Dimensional Mediation Analysis Of Multi-Omics Data, Sunyi Chi May 2024

High-Dimensional Mediation Analysis Of Multi-Omics Data, Sunyi Chi

Dissertations & Theses (Open Access)

Environmental exposures such as cigarette smoking influence health outcomes through intermediate molecular phenotypes, such as the methylome, transcriptome, and metabolome. Mediation analysis is a useful tool for investigating the role of potentially high-dimensional intermediate phenotypes in the relationship between environmental exposures and health outcomes. Rapid development of high-throughput technologies have made mediation analysis of multi-omics data critical to gain groundbreaking insights into the biological mechanisms underlying the disease etiology. This dissertation aims to develop mediation analysis methods that utilize the enormous amount of multi-omics data in assessing mechanisms of disease etiology. It contains three projects where I propose advanced mediation …


Code For Care: Hypertension Prediction In Women Aged 18-39 Years, Kruti Sheth May 2024

Code For Care: Hypertension Prediction In Women Aged 18-39 Years, Kruti Sheth

Electronic Theses, Projects, and Dissertations

The longstanding prevalence of hypertension, often undiagnosed, poses significant risks of severe chronic and cardiovascular complications if left untreated. This study investigated the causes and underlying risks of hypertension in females aged between 18-39 years. The research questions were: (Q1.) What factors affect the occurrence of hypertension in females aged 18-39 years? (Q2.) What machine learning algorithms are suited for effectively predicting hypertension? (Q3.) How can SHAP values be leveraged to analyze the factors from model outputs? The findings are: (Q1.) Performing Feature selection using binary classification Logistic regression algorithm reveals an array of 30 most influential factors at an …


Factors Predictive Of The Development Of Surgical Site Infection In Thyroidectomy, A Replication Study Of Myssiorek (2018), Kaitlyn M. Kenig May 2024

Factors Predictive Of The Development Of Surgical Site Infection In Thyroidectomy, A Replication Study Of Myssiorek (2018), Kaitlyn M. Kenig

Capstone Experience

The original study aimed to show that thyroidectomy does not result in surgical site infection (SSI) in most cases, and thus routine prescription of antibiotics is not necessary. The study looked to see what risk factors could predict the incidence of SSI. This would highlight those individuals who were at most risk of developing SSI, and then antibiotics would only be prescribed to these individuals instead of all or most individuals who undergo thyroidectomy.

This study used NSQIP data to look at incidence of SSI and look for risk factors that may be predictive of SSI. Only surgeries that were …


Using The History Of Statistics To Teach Introductory Statistics, Melissa Hansen May 2024

Using The History Of Statistics To Teach Introductory Statistics, Melissa Hansen

All Graduate Reports and Creative Projects, Fall 2023 to Present

While often taught in high school and required as part of a college degree, statistics classes are sometimes viewed by students as an obstacle rather than a support for their overall goals. One way to increase student engagement in a statistics course is to use the history of statistics. Within the literature review, the advantages to using the history of statistics are discussed as well as the more extensive research on using the history of mathematics in mathematics courses. Included are instructional strategies for using the context around the development of mathematical ideas in math classrooms which can be extended …


The Quantitative Analysis And Visualization Of Nfl Passing Routes, Sandeep Chitturi May 2024

The Quantitative Analysis And Visualization Of Nfl Passing Routes, Sandeep Chitturi

Computer Science and Computer Engineering Undergraduate Honors Theses

The strategic planning of offensive passing plays in the NFL incorporates numerous variables, including defensive coverages, player positioning, historical data, etc. This project develops an application using an analytical framework and an interactive model to simulate and visualize an NFL offense's passing strategy under varying conditions. Using R-programming and data management, the model dynamically represents potential passing routes in response to different defensive schemes. The system architecture integrates data from historical NFL league years to generate quantified route scores through designed mathematical equations. This allows for the prediction of potential passing routes for offensive skill players in response to the …


Exploring Application Of The Coordinate Exchange To Generate Optimal Designs Robust To Data Loss, Asher Hanson May 2024

Exploring Application Of The Coordinate Exchange To Generate Optimal Designs Robust To Data Loss, Asher Hanson

All Graduate Theses and Dissertations, Fall 2023 to Present

The primary objective of this study is to evaluate the efficacy of the coordinate exchange (CEXCH) algorithm in the generation of robust optimal designs. The assessment involves a comparative analysis, wherein designs produced by the Point Exchange (PEXCH) Algorithm are employed as benchmarks for evaluating the efficiency of CEXCH designs. Three modified criteria, selected from the traditional alphabet criteria pool, are utilized to score each algorithm. To enhance the reliability of the comparative analysis, multiple rounds of validation are conducted, focusing on visual assessments, design scores, and criteria efficiencies. The findings from each round of validation contribute to a comprehensive …


Exploring Optimal Design Of Experiments For Random Effects Models, Ryan C. Bushman May 2024

Exploring Optimal Design Of Experiments For Random Effects Models, Ryan C. Bushman

All Graduate Theses and Dissertations, Fall 2023 to Present

The majority of research in the field of optimal design of experiments has focused on producing designs for fixed effects models. The purpose of this thesis is to explore how the optimal design framework applies to nested random effects models. The object that is being optimized is the model information matrix. We explore the full derivation of the random effects information matrix to highlight the complexity of the problem and show how the optimization is a function of the model's parameters. In conjunction with this research, the ODVC (Optimal Design for Variance Components) package was built to provide tools that …


On The Existence Of Periodic Traveling-Wave Solutions To Certain Systems Of Nonlinear, Dispersive Wave Equations, Jacob Daniels May 2024

On The Existence Of Periodic Traveling-Wave Solutions To Certain Systems Of Nonlinear, Dispersive Wave Equations, Jacob Daniels

All Graduate Theses and Dissertations, Fall 2023 to Present

A variety of physical phenomena can be modeled by systems of nonlinear, dispersive wave equations. Such examples include the propagation of a wave through a canal, deep ocean waves with small amplitude and long wavelength, and even the propagation of long-crested waves on the surface of lakes. An important task in the study of water wave equations is to determine whether a solution exists. This thesis aims to determine whether there exists solutions that both travel at a constant speed and are periodic for several systems of water wave equations. The work done in this thesis contributes to the subfields …


Ianova: Multi-Sample Means Comparisons For Imprecise Interval Data, Zachary Rios May 2024

Ianova: Multi-Sample Means Comparisons For Imprecise Interval Data, Zachary Rios

All Graduate Theses and Dissertations, Fall 2023 to Present

In recent years, interval data has become an increasingly popular tool to solve modern data problems. Intervals are now often used for dimensionality reduction, data aggregation, privacy censorship, and quantifying awareness of various uncertainties. Among many statistical methods that are being studied and developed for interval data, the significance test is particularly of importance due to its fundamental value both in theory and practice. The difficulty in developing such tests mainly lies in the fact that the concept of normality does not extend naturally to interval data (due the range of an interval being necessarily non-negative), causing the exact tests …


Assessing Extant Methods For Generating G-Optimal Designs And A Novel Methodology To Compute The G-Score Of A Candidate Design, Hyrum John Hansen May 2024

Assessing Extant Methods For Generating G-Optimal Designs And A Novel Methodology To Compute The G-Score Of A Candidate Design, Hyrum John Hansen

All Graduate Theses and Dissertations, Fall 2023 to Present

Experimental designs are used by scientists to allocate treatments such that statistical inference is appropriate. Most traditional experimental designs have mathematical properties that make them desirable under certain conditions. Optimal experimental designs are those where the researcher can exercise total control over the treatment levels to maximize a chosen mathematical property. As is common in literature, the experimental design is represented as a matrix where each column represents a variable, and each row represents a trial. We define a function that takes as input the design matrix and outputs its score. We then algorithmically adjust each entry until a design …


A Comprehensive Uncertainty Quantification Methodology For Metrology Calibration And Method Comparison Problems Via Numeric Solutions To Maximum Likelihood Estimation And Parametric Bootstrapping, Aloka B. S. N. Dayarathne May 2024

A Comprehensive Uncertainty Quantification Methodology For Metrology Calibration And Method Comparison Problems Via Numeric Solutions To Maximum Likelihood Estimation And Parametric Bootstrapping, Aloka B. S. N. Dayarathne

All Graduate Theses and Dissertations, Fall 2023 to Present

In metrology, the science of measurements, straight line calibration models are frequently employed. These models help understand the instrumental response to an analyte, whose chemical constituents are unknown, and predict the analyte’s concentration in a sample. Techniques such as ordinary least squares and generalized least squares are commonly used to fit these calibration curves. However, these methods may yield biased estimates of slope and intercept when the calibrant, substance used to calibrate an analytical procedure with known chemical constituents (x-values), carries uncertainty. To address this, Ripley and Thompson (1987) proposed functional relationship estimation by maximum likelihood (FREML), which considers uncertainties …


Comparing North American Professional Sports League Season Formats Using Monte Carlo Simulation, Lathan Gregg May 2024

Comparing North American Professional Sports League Season Formats Using Monte Carlo Simulation, Lathan Gregg

Industrial Engineering Undergraduate Honors Theses

Each NFL, NBA, and MLB season consists of a regular season, in which teams play a set number of scheduled games and a playoff, in which qualifying teams compete for a championship. At the conclusion of each season, teams are ranked based on their performance throughout the season. This study aims to investigate the ability of each league's season format to accurately rank teams using Monte Carlo simulation. Matches between two teams are simulated by using the team’s assigned strength ranks to calculate a winning probability for each team. The winning probabilities are simulated with different skill values, dictating how …


Information Based Approach For Detecting Change Points In Inverse Gaussian Model With Applications, Alexis Anne Wallace May 2024

Information Based Approach For Detecting Change Points In Inverse Gaussian Model With Applications, Alexis Anne Wallace

Electronic Theses, Projects, and Dissertations

Change point analysis is a method used to estimate the time point at which a change in the mean or variance of data occurs. It is widely used as changes appear in various datasets such as the stock market, temperature, and quality control, allowing statisticians to take appropriate measures to mitigate financial losses, operational disruptions, or other adverse impacts. In this thesis, we develop a change point detection procedure in the Inverse Gaussian (IG) model using the Modified Information Criterion (MIC). The IG distribution, originating as the distribution of the first passage time of Brownian motion with positive drift, offers …


Cost-Risk Analysis Of The Ercot Region Using Modern Portfolio Theory, Megan Sickinger May 2024

Cost-Risk Analysis Of The Ercot Region Using Modern Portfolio Theory, Megan Sickinger

Master's Theses

In this work, we study the use of modern portfolio theory in a cost-risk analysis of the Electric Reliability Council of Texas (ERCOT). Based upon the risk-return concepts of modern portfolio theory, we develop an n-asset minimization problem to create a risk-cost frontier of portfolios of technologies within the ERCOT electricity region. The levelized cost of electricity for each technology in the region is a step in evaluating the expected cost of the portfolio, and the historical data of cost factors estimate the variance of cost for each technology. In addition, there are several constraints in our minimization problem to …


Selected Topics On Sequential Designs For Decision Making, Caroline Kerfonta May 2024

Selected Topics On Sequential Designs For Decision Making, Caroline Kerfonta

All Dissertations

This dissertation is comprised of three parts. The first proposes a sequential approach to determine the experimental setting with the minimum variance (Kerfonta et al., 2024). Two acquisition functions are developed to assist developing the approach. Theoretical results along with a case study using data from crystallization experiments is conducted to show the ability of the proposed method to correctly select the experiment with the minimum variance. The second and third parts propose adaptations to the Bayesian optimization algorithm using transformed additive Gaussian processes (TAG) as the surrogate model. The goal of using the TAG framework is to decompose the …


Efficient Fully Bayesian Approaches To Brain Activity Mapping With Complex-Valued Fmri Data: Analysis Of Real And Imaginary Components In A Cartesian Model And Extension To Magnitude And Phase In A Polar Model, Zhengxin Wang May 2024

Efficient Fully Bayesian Approaches To Brain Activity Mapping With Complex-Valued Fmri Data: Analysis Of Real And Imaginary Components In A Cartesian Model And Extension To Magnitude And Phase In A Polar Model, Zhengxin Wang

All Dissertations

Functional magnetic resonance imaging (fMRI) plays a crucial role in neuroimaging, enabling the exploration of brain activity through complex-valued signals. Traditional fMRI analyses have largely focused on magnitude information, often overlooking the potential insights offered by phase data, and therefore, lead to underutilization of available data and flawed statistical assumptions. This dissertation proposes two efficient, fully Bayesian approaches for the analysis of complex-valued functional magnetic resonance imaging (cv-fMRI) time series.

Chapter 2 introduces the model, referred to as CV-sSGLMM, using the real and imaginary components of cv-fMRI data and sparse spatial generalized linear mixed model prior. This model extends the …


A Novel Correction For The Multivariate Ljung-Box Test, Minhao Huang May 2024

A Novel Correction For The Multivariate Ljung-Box Test, Minhao Huang

Computational and Data Sciences (PhD) Dissertations

This research introduces an analytical improvement to the Multivariate Ljung-Box test that addresses significant deviations of the original test from the nominal Type I error rates under almost all scenarios. Prior attempts to mitigate this issue have been directed at modification of the test statistics or correction of the test distribution to achieve precise results in finite samples. In previous studies, focused on designing corrections to the univariate Ljung-Box, a method that specifically adjusts the test rejection region has been the most successful of attaining the best Type I error rates. We adopt the same approach for the more complex, …


Modeling Prices In Limit Order Book Using Univariate Hawkes Point Process, Wenqing Jiang May 2024

Modeling Prices In Limit Order Book Using Univariate Hawkes Point Process, Wenqing Jiang

University of New Orleans Theses and Dissertations

This thesis presents a time-changed geometric Brownian price model with the univariate Hawkes processes to trace the price changes in a limit order book. Limit order books are the core mechanism for trading in modern financial markets, continuously collecting outstanding buy and sell orders from market participants. The arrival of orders causes fluctuations in prices over time. A Hawkes process is a type of point process that exhibits self-exciting behavior, where the occurrence of one event increases the probability of other events happening in the near future. This makes Hawkes processes well-suited for capturing the clustered arrival patterns of orders …


Stability Of Quantum Computers, Samudra Dasgupta May 2024

Stability Of Quantum Computers, Samudra Dasgupta

Doctoral Dissertations

Quantum computing's potential is immense, promising super-polynomial reductions in execution time, energy use, and memory requirements compared to classical computers. This technology has the power to revolutionize scientific applications such as simulating many-body quantum systems for molecular structure understanding, factorization of large integers, enhance machine learning, and in the process, disrupt industries like telecommunications, material science, pharmaceuticals and artificial intelligence. However, quantum computing's potential is curtailed by noise, further complicated by non-stationary noise parameter distributions across time and qubits. This dissertation focuses on the persistent issue of noise in quantum computing, particularly non-stationarity of noise parameters in transmon processors. It …