Open Access. Powered by Scholars. Published by Universities.®

Digital Commons Network

Open Access. Powered by Scholars. Published by Universities.®

Statistics and Probability

PDF

Theses/Dissertations

Institution
Keyword
Publication Year
Publication

Articles 1 - 30 of 3697

Full-Text Articles in Entire DC Network

Towards A New Role Of Mitochondrial Hydrogen Peroxide In Synaptic Function, Cliyahnelle Z. Alexander May 2024

Towards A New Role Of Mitochondrial Hydrogen Peroxide In Synaptic Function, Cliyahnelle Z. Alexander

Student Theses and Dissertations

Aerobic metabolism is known to generate damaging ROS, particularly hydrogen peroxide. Reactive oxygen species (ROS) are highly reactive molecules containing oxygen that have the potential to cause damage to cells and tissues in the body. ROS are highly reactive atoms or molecules that rapidly interact with other molecules within a cell. Intracellular accumulation can result in oxidative damage, dysfunction, and cell death. Due to the limitations of H2O2 (hydrogen peroxide) detectors, other impacts of ROS exposure may have been missed. HyPer7, a genetically encoded sensor, measures hydrogen peroxide emissions precisely and sensitively, even at sublethal levels, during …


A Novel Correction For The Multivariate Ljung-Box Test, Minhao Huang May 2024

A Novel Correction For The Multivariate Ljung-Box Test, Minhao Huang

Computational and Data Sciences (PhD) Dissertations

This research introduces an analytical improvement to the Multivariate Ljung-Box test that addresses significant deviations of the original test from the nominal Type I error rates under almost all scenarios. Prior attempts to mitigate this issue have been directed at modification of the test statistics or correction of the test distribution to achieve precise results in finite samples. In previous studies, focused on designing corrections to the univariate Ljung-Box, a method that specifically adjusts the test rejection region has been the most successful of attaining the best Type I error rates. We adopt the same approach for the more complex, …


High-Dimensional Mediation Analysis Of Multi-Omics Data, Sunyi Chi May 2024

High-Dimensional Mediation Analysis Of Multi-Omics Data, Sunyi Chi

Dissertations & Theses (Open Access)

Environmental exposures such as cigarette smoking influence health outcomes through intermediate molecular phenotypes, such as the methylome, transcriptome, and metabolome. Mediation analysis is a useful tool for investigating the role of potentially high-dimensional intermediate phenotypes in the relationship between environmental exposures and health outcomes. Rapid development of high-throughput technologies have made mediation analysis of multi-omics data critical to gain groundbreaking insights into the biological mechanisms underlying the disease etiology. This dissertation aims to develop mediation analysis methods that utilize the enormous amount of multi-omics data in assessing mechanisms of disease etiology. It contains three projects where I propose advanced mediation …


Code For Care: Hypertension Prediction In Women Aged 18-39 Years, Kruti Sheth May 2024

Code For Care: Hypertension Prediction In Women Aged 18-39 Years, Kruti Sheth

Electronic Theses, Projects, and Dissertations

The longstanding prevalence of hypertension, often undiagnosed, poses significant risks of severe chronic and cardiovascular complications if left untreated. This study investigated the causes and underlying risks of hypertension in females aged between 18-39 years. The research questions were: (Q1.) What factors affect the occurrence of hypertension in females aged 18-39 years? (Q2.) What machine learning algorithms are suited for effectively predicting hypertension? (Q3.) How can SHAP values be leveraged to analyze the factors from model outputs? The findings are: (Q1.) Performing Feature selection using binary classification Logistic regression algorithm reveals an array of 30 most influential factors at an …


Factors Predictive Of The Development Of Surgical Site Infection In Thyroidectomy, A Replication Study Of Myssiorek (2018), Kaitlyn M. Kenig May 2024

Factors Predictive Of The Development Of Surgical Site Infection In Thyroidectomy, A Replication Study Of Myssiorek (2018), Kaitlyn M. Kenig

Capstone Experience

The original study aimed to show that thyroidectomy does not result in surgical site infection (SSI) in most cases, and thus routine prescription of antibiotics is not necessary. The study looked to see what risk factors could predict the incidence of SSI. This would highlight those individuals who were at most risk of developing SSI, and then antibiotics would only be prescribed to these individuals instead of all or most individuals who undergo thyroidectomy.

This study used NSQIP data to look at incidence of SSI and look for risk factors that may be predictive of SSI. Only surgeries that were …


Using The History Of Statistics To Teach Introductory Statistics, Melissa Hansen May 2024

Using The History Of Statistics To Teach Introductory Statistics, Melissa Hansen

All Graduate Reports and Creative Projects, Fall 2023 to Present

While often taught in high school and required as part of a college degree, statistics classes are sometimes viewed by students as an obstacle rather than a support for their overall goals. One way to increase student engagement in a statistics course is to use the history of statistics. Within the literature review, the advantages to using the history of statistics are discussed as well as the more extensive research on using the history of mathematics in mathematics courses. Included are instructional strategies for using the context around the development of mathematical ideas in math classrooms which can be extended …


The Quantitative Analysis And Visualization Of Nfl Passing Routes, Sandeep Chitturi May 2024

The Quantitative Analysis And Visualization Of Nfl Passing Routes, Sandeep Chitturi

Computer Science and Computer Engineering Undergraduate Honors Theses

The strategic planning of offensive passing plays in the NFL incorporates numerous variables, including defensive coverages, player positioning, historical data, etc. This project develops an application using an analytical framework and an interactive model to simulate and visualize an NFL offense's passing strategy under varying conditions. Using R-programming and data management, the model dynamically represents potential passing routes in response to different defensive schemes. The system architecture integrates data from historical NFL league years to generate quantified route scores through designed mathematical equations. This allows for the prediction of potential passing routes for offensive skill players in response to the …


Exploring Application Of The Coordinate Exchange To Generate Optimal Designs Robust To Data Loss, Asher Hanson May 2024

Exploring Application Of The Coordinate Exchange To Generate Optimal Designs Robust To Data Loss, Asher Hanson

All Graduate Theses and Dissertations, Fall 2023 to Present

The primary objective of this study is to evaluate the efficacy of the coordinate exchange (CEXCH) algorithm in the generation of robust optimal designs. The assessment involves a comparative analysis, wherein designs produced by the Point Exchange (PEXCH) Algorithm are employed as benchmarks for evaluating the efficiency of CEXCH designs. Three modified criteria, selected from the traditional alphabet criteria pool, are utilized to score each algorithm. To enhance the reliability of the comparative analysis, multiple rounds of validation are conducted, focusing on visual assessments, design scores, and criteria efficiencies. The findings from each round of validation contribute to a comprehensive …


Comparing North American Professional Sports League Season Formats Using Monte Carlo Simulation, Lathan Gregg May 2024

Comparing North American Professional Sports League Season Formats Using Monte Carlo Simulation, Lathan Gregg

Industrial Engineering Undergraduate Honors Theses

Each NFL, NBA, and MLB season consists of a regular season, in which teams play a set number of scheduled games and a playoff, in which qualifying teams compete for a championship. At the conclusion of each season, teams are ranked based on their performance throughout the season. This study aims to investigate the ability of each league's season format to accurately rank teams using Monte Carlo simulation. Matches between two teams are simulated by using the team’s assigned strength ranks to calculate a winning probability for each team. The winning probabilities are simulated with different skill values, dictating how …


Information Based Approach For Detecting Change Points In Inverse Gaussian Model With Applications, Alexis Anne Wallace May 2024

Information Based Approach For Detecting Change Points In Inverse Gaussian Model With Applications, Alexis Anne Wallace

Electronic Theses, Projects, and Dissertations

Change point analysis is a method used to estimate the time point at which a change in the mean or variance of data occurs. It is widely used as changes appear in various datasets such as the stock market, temperature, and quality control, allowing statisticians to take appropriate measures to mitigate financial losses, operational disruptions, or other adverse impacts. In this thesis, we develop a change point detection procedure in the Inverse Gaussian (IG) model using the Modified Information Criterion (MIC). The IG distribution, originating as the distribution of the first passage time of Brownian motion with positive drift, offers …


Cost-Risk Analysis Of The Ercot Region Using Modern Portfolio Theory, Megan Sickinger May 2024

Cost-Risk Analysis Of The Ercot Region Using Modern Portfolio Theory, Megan Sickinger

Master's Theses

In this work, we study the use of modern portfolio theory in a cost-risk analysis of the Electric Reliability Council of Texas (ERCOT). Based upon the risk-return concepts of modern portfolio theory, we develop an n-asset minimization problem to create a risk-cost frontier of portfolios of technologies within the ERCOT electricity region. The levelized cost of electricity for each technology in the region is a step in evaluating the expected cost of the portfolio, and the historical data of cost factors estimate the variance of cost for each technology. In addition, there are several constraints in our minimization problem to …


Selected Topics On Sequential Designs For Decision Making, Caroline Kerfonta May 2024

Selected Topics On Sequential Designs For Decision Making, Caroline Kerfonta

All Dissertations

This dissertation is comprised of three parts. The first proposes a sequential approach to determine the experimental setting with the minimum variance (Kerfonta et al., 2024). Two acquisition functions are developed to assist developing the approach. Theoretical results along with a case study using data from crystallization experiments is conducted to show the ability of the proposed method to correctly select the experiment with the minimum variance. The second and third parts propose adaptations to the Bayesian optimization algorithm using transformed additive Gaussian processes (TAG) as the surrogate model. The goal of using the TAG framework is to decompose the …


Efficient Fully Bayesian Approaches To Brain Activity Mapping With Complex-Valued Fmri Data: Analysis Of Real And Imaginary Components In A Cartesian Model And Extension To Magnitude And Phase In A Polar Model, Zhengxin Wang May 2024

Efficient Fully Bayesian Approaches To Brain Activity Mapping With Complex-Valued Fmri Data: Analysis Of Real And Imaginary Components In A Cartesian Model And Extension To Magnitude And Phase In A Polar Model, Zhengxin Wang

All Dissertations

Functional magnetic resonance imaging (fMRI) plays a crucial role in neuroimaging, enabling the exploration of brain activity through complex-valued signals. Traditional fMRI analyses have largely focused on magnitude information, often overlooking the potential insights offered by phase data, and therefore, lead to underutilization of available data and flawed statistical assumptions. This dissertation proposes two efficient, fully Bayesian approaches for the analysis of complex-valued functional magnetic resonance imaging (cv-fMRI) time series.

Chapter 2 introduces the model, referred to as CV-sSGLMM, using the real and imaginary components of cv-fMRI data and sparse spatial generalized linear mixed model prior. This model extends the …


Exploring Optimal Design Of Experiments For Random Effects Models, Ryan C. Bushman May 2024

Exploring Optimal Design Of Experiments For Random Effects Models, Ryan C. Bushman

All Graduate Theses and Dissertations, Fall 2023 to Present

The majority of research in the field of optimal design of experiments has focused on producing designs for fixed effects models. The purpose of this thesis is to explore how the optimal design framework applies to nested random effects models. The object that is being optimized is the model information matrix. We explore the full derivation of the random effects information matrix to highlight the complexity of the problem and show how the optimization is a function of the model's parameters. In conjunction with this research, the ODVC (Optimal Design for Variance Components) package was built to provide tools that …


On The Existence Of Periodic Traveling-Wave Solutions To Certain Systems Of Nonlinear, Dispersive Wave Equations, Jacob Daniels May 2024

On The Existence Of Periodic Traveling-Wave Solutions To Certain Systems Of Nonlinear, Dispersive Wave Equations, Jacob Daniels

All Graduate Theses and Dissertations, Fall 2023 to Present

A variety of physical phenomena can be modeled by systems of nonlinear, dispersive wave equations. Such examples include the propagation of a wave through a canal, deep ocean waves with small amplitude and long wavelength, and even the propagation of long-crested waves on the surface of lakes. An important task in the study of water wave equations is to determine whether a solution exists. This thesis aims to determine whether there exists solutions that both travel at a constant speed and are periodic for several systems of water wave equations. The work done in this thesis contributes to the subfields …


Ianova: Multi-Sample Means Comparisons For Imprecise Interval Data, Zachary Rios May 2024

Ianova: Multi-Sample Means Comparisons For Imprecise Interval Data, Zachary Rios

All Graduate Theses and Dissertations, Fall 2023 to Present

In recent years, interval data has become an increasingly popular tool to solve modern data problems. Intervals are now often used for dimensionality reduction, data aggregation, privacy censorship, and quantifying awareness of various uncertainties. Among many statistical methods that are being studied and developed for interval data, the significance test is particularly of importance due to its fundamental value both in theory and practice. The difficulty in developing such tests mainly lies in the fact that the concept of normality does not extend naturally to interval data (due the range of an interval being necessarily non-negative), causing the exact tests …


Assessing Extant Methods For Generating G-Optimal Designs And A Novel Methodology To Compute The G-Score Of A Candidate Design, Hyrum John Hansen May 2024

Assessing Extant Methods For Generating G-Optimal Designs And A Novel Methodology To Compute The G-Score Of A Candidate Design, Hyrum John Hansen

All Graduate Theses and Dissertations, Fall 2023 to Present

Experimental designs are used by scientists to allocate treatments such that statistical inference is appropriate. Most traditional experimental designs have mathematical properties that make them desirable under certain conditions. Optimal experimental designs are those where the researcher can exercise total control over the treatment levels to maximize a chosen mathematical property. As is common in literature, the experimental design is represented as a matrix where each column represents a variable, and each row represents a trial. We define a function that takes as input the design matrix and outputs its score. We then algorithmically adjust each entry until a design …


A Comprehensive Uncertainty Quantification Methodology For Metrology Calibration And Method Comparison Problems Via Numeric Solutions To Maximum Likelihood Estimation And Parametric Bootstrapping, Aloka B. S. N. Dayarathne May 2024

A Comprehensive Uncertainty Quantification Methodology For Metrology Calibration And Method Comparison Problems Via Numeric Solutions To Maximum Likelihood Estimation And Parametric Bootstrapping, Aloka B. S. N. Dayarathne

All Graduate Theses and Dissertations, Fall 2023 to Present

In metrology, the science of measurements, straight line calibration models are frequently employed. These models help understand the instrumental response to an analyte, whose chemical constituents are unknown, and predict the analyte’s concentration in a sample. Techniques such as ordinary least squares and generalized least squares are commonly used to fit these calibration curves. However, these methods may yield biased estimates of slope and intercept when the calibrant, substance used to calibrate an analytical procedure with known chemical constituents (x-values), carries uncertainty. To address this, Ripley and Thompson (1987) proposed functional relationship estimation by maximum likelihood (FREML), which considers uncertainties …


A Survey Of The Murray State University Csis Department Of Student And Instructor Attitudes In Relation To Earlier Introduction Of Version Control Systems, Gavin Johnson Apr 2024

A Survey Of The Murray State University Csis Department Of Student And Instructor Attitudes In Relation To Earlier Introduction Of Version Control Systems, Gavin Johnson

Honors College Theses

Over the previous 20 years, the software development industry has overseen an evolution in application of Version Control Systems (VCS) from a Centralized Version Control System (CVCS) format to a Decentralized Version Control Format (DVCS). Examples of the former include Perforce and Subversion whilst the latter of the two include Github and BitBucket. As DVCS models allow software contributors to maintain their respective local repositories of relevant code bases, developers are able to work offline and maintain their work with relative fault tolerance. This contrasts to CVCS models, which require software contributors to be connected online to a main server. …


Assessment Of Method Effects Of Keying And Wording In Instruments: A Mixed-Methods Explanatory Sequential Study, Lin Ma Mar 2024

Assessment Of Method Effects Of Keying And Wording In Instruments: A Mixed-Methods Explanatory Sequential Study, Lin Ma

Electronic Theses and Dissertations

This dissertation presents an innovative approach to examining the keying method, wording method, and construct validity on psychometric instruments. By employing a mixed methods explanatory sequential design, the effects of keying and wording in two psychometric assessments were examined and validated. Those two self-report psychometric assessments were the Effortful Control assessment (Ellis & Rothbart, 2001) and the Grit assessment (Duckworth & Quinn, 2009). Moreover, the quantitative phase utilized structural equation modeling to analyze 2,104 students’ responses and assess the construct of keying and wording. Various hypothetical models were investigated and evaluated. The reliability of each construct in each method was …


Modeling Of Covid-19 Clinical Outcomes In Mexico: An Analysis Of Demographic, Clinical, And Chronic Disease Factors, Livia Clarete Feb 2024

Modeling Of Covid-19 Clinical Outcomes In Mexico: An Analysis Of Demographic, Clinical, And Chronic Disease Factors, Livia Clarete

Dissertations, Theses, and Capstone Projects

This study explores COVID-19 clinical outcomes in Mexico, focusing on demographic, clinical, and chronic disease variables to develop predictive models. In the binary classification task, the Ada Boost Classifier distinguishes survivors from non-survivors, with age, sex, ethnicity, and chronic medical conditions influencing outcomes. In multiclass classification, the Gradient Boosting Classifier categorizes patients into outcome groups.

Demographic variables, especially age, are crucial for predicting COVID-19 outcomes for both the binary and multiclass classification tasks. Clinical information about previous conditions, including chronic diseases, also holds relevance, especially diabetes, immunocompromise, and cardiovascular diseases. These insights inform public health measures and healthcare strategies, emphasizing …


A Causal Inference Approach For Spike Train Interactions, Zach Saccomano Feb 2024

A Causal Inference Approach For Spike Train Interactions, Zach Saccomano

Dissertations, Theses, and Capstone Projects

Since the 1960s, neuroscientists have worked on the problem of estimating synaptic properties, such as connectivity and strength, from simultaneously recorded spike trains. Recent years have seen renewed interest in the problem coinciding with rapid advances in experimental technologies, including an approximate exponential increase in the number of neurons that can be recorded in parallel and perturbation techniques such as optogenetics that can be used to calibrate and validate causal hypotheses about functional connectivity. This thesis presents a mathematical examination of synaptic inference from two perspectives: (1) using in vivo data and biophysical models, we ask in what cases the …


Making Sense Of Making Parole In New York, Alexandra Mcglinchy Feb 2024

Making Sense Of Making Parole In New York, Alexandra Mcglinchy

Dissertations, Theses, and Capstone Projects

For many individuals incarcerated in New York, the initial step toward freedom begins with an interview with the Board of Parole. This process, however, is frequently a complex and challenging one, characterized by repeated denials and extended incarcerations. The disparity in outcomes – where one individual may receive over 20 denials and another is granted parole on their first attempt – highlights the ambiguity and inconsistency in the parole decision-making process. This project aims to clarify the factors that influence parole decisions by concentrating on measurable variables. These include age, race, duration of sentence served, proportion of sentence served, type …


Statistical Consulting In Academia: A Review, Ke Xiao Jan 2024

Statistical Consulting In Academia: A Review, Ke Xiao

Major Papers

This paper reviews the state of statistical consulting in academia by performing a literature review on this topic in chapters 1 and 2. Chapter 1 overviews general aspects of statistical consulting and types of centers that conduct such services in academia. In Chapter 2 we summarise the literature about the common logistics and processes for conducting statistical consulting in academia. In Chapters 3 and 4, we analyze data on statistical consulting centers for the largest 100 universities in the USA. We also review the literature on the future of statistical consulting in academia in the era of big data and …


A Bayesian Inversion For Emissions And Export Productivity Across The End-Cretaceous Boundary, Alexander A. Cox Jan 2024

A Bayesian Inversion For Emissions And Export Productivity Across The End-Cretaceous Boundary, Alexander A. Cox

Dartmouth College Master’s Theses

The end-Cretaceous mass extinction was marked by both the Chicxulub impact and the ongoing emplacement of the Deccan Traps flood basalt province. Both of these events perturbed the environment by the emission of climate-active volatiles, primarily CO2 and SO2. To understand the mechanism of extinction, we must disentangle the timing, duration, and intensity of volcanic and meteoritic environmental forcings. In this thesis, we used a parallel Markov chain Monte Carlo approach to invert for the aforementioned volatile emissions, export productivity, and remineralization from 67 to 65 million years ago using the LOSCAR (Long-term Ocean-atmosphere-Sediment CArbon cycle Reservoir) model. The parallel …


Ms Environmental Biology Capstone Project, Denise Corona Jan 2024

Ms Environmental Biology Capstone Project, Denise Corona

Regis University Student Publications (comprehensive collection)

Land-use change (LUC) is a key driver of biodiversity loss, altering the structure and function of ecosystems through human activities such as urbanization and agriculture. This change has led to habitat loss and fragmentation, resulting in the rapid decline of avian populations globally. Wildlife rehabilitation centers are the primary responders for injured birds and their records provide valuable data to monitor potential factors impacting bird populations. However, these datasets are underutilized in research. This study examined how LUC in the Front Range affects the likelihood and circumstances of admission of injured birds to the Rocky Mountain Wildlife Alliance (RMWA) in …


Multiscale Modelling Of Brain Networks And The Analysis Of Dynamic Processes In Neurodegenerative Disorders, Hina Shaheen Jan 2024

Multiscale Modelling Of Brain Networks And The Analysis Of Dynamic Processes In Neurodegenerative Disorders, Hina Shaheen

Theses and Dissertations (Comprehensive)

The complex nature of the human brain, with its intricate organic structure and multiscale spatio-temporal characteristics ranging from synapses to the entire brain, presents a major obstacle in brain modelling. Capturing this complexity poses a significant challenge for researchers. The complex interplay of coupled multiphysics and biochemical activities within this intricate system shapes the brain's capacity, functioning within a structure-function relationship that necessitates a specific mathematical framework. Advanced mathematical modelling approaches that incorporate the coupling of brain networks and the analysis of dynamic processes are essential for advancing therapeutic strategies aimed at treating neurodegenerative diseases (NDDs), which afflict millions of …


Applications Of Independent And Identically Distributed (Iid) Random Processes In Polarimetry And Climatology, Dan Kestner Jan 2024

Applications Of Independent And Identically Distributed (Iid) Random Processes In Polarimetry And Climatology, Dan Kestner

Dissertations, Master's Theses and Master's Reports

The unifying theme of this thesis is the characterization of “perfect randomness,” i.e., independent and identically distributed (IID) stochastic processes as these are applied in physical science. Two specific and mathematically distinct applications are chosen: (i) Radar and optical polarimetry; (ii) Analysis of time series in meteorology. In (i), IID process of a special kind, namely, with a distribution defined by symmetry, is used to link its multivariate Gaussian density to uniformity on the Poincaré sphere. This “statistical ellipsometry” approach is then used to relate polarimetric mismatches or imbalances to ellipsometric variables and suitably chosen cross-correlation measures. In (ii), recently …


On Generative Models And Joint Architectures For Document-Level Relation Extraction, Aviv Brokman Jan 2024

On Generative Models And Joint Architectures For Document-Level Relation Extraction, Aviv Brokman

Theses and Dissertations--Statistics

Biomedical text is being generated at a high rate in scientific literature publications and electronic health records. Within these documents lies a wealth of potentially useful information in biomedicine. Relation extraction (RE), the process of automating the identification of structured relationships between entities within text, represents a highly sought-after goal in biomedical informatics, offering the potential to unlock deeper insights and connections from this vast corpus of data. In this dissertation, we tackle this problem with a variety of approaches.

We review the recent history of the field of document-level RE. Several themes emerge. First, graph neural networks dominate the …


Tropical Fish Study In Tahiti, French Polynesia, Miranda Brainard, Caitlyn Swango, Paityn Houglan, Richard Londraville Jan 2024

Tropical Fish Study In Tahiti, French Polynesia, Miranda Brainard, Caitlyn Swango, Paityn Houglan, Richard Londraville

Williams Honors College, Honors Research Projects

In May of 2023, I embarked on an exciting research journey to Moorea, French Polynesia, alongside fellow students and faculty members from the University of Akron and Syracuse University. This expedition was part of the university-sponsored Tropical Vertebrate Biology course, where we delved into the exploration of various tropical species inhabiting the island, including sea urchins, geckos, and my primary focus, the blackspotted rockskipper.

My research team, composed of my co-authors and me, was particularly intrigued by the unique refuge-seeking behavior displayed by blackspotted rockskippers. These amphibious fish are renowned for their remarkable ability to inhabit tide pools and rocky …