Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Theses/Dissertations

Statistics and Probability

Institution
Keyword
Publication Year
Publication

Articles 1 - 30 of 3600

Full-Text Articles in Physical Sciences and Mathematics

High-Dimensional Mediation Analysis Of Multi-Omics Data, Sunyi Chi May 2024

High-Dimensional Mediation Analysis Of Multi-Omics Data, Sunyi Chi

Dissertations & Theses (Open Access)

Environmental exposures such as cigarette smoking influence health outcomes through intermediate molecular phenotypes, such as the methylome, transcriptome, and metabolome. Mediation analysis is a useful tool for investigating the role of potentially high-dimensional intermediate phenotypes in the relationship between environmental exposures and health outcomes. Rapid development of high-throughput technologies have made mediation analysis of multi-omics data critical to gain groundbreaking insights into the biological mechanisms underlying the disease etiology. This dissertation aims to develop mediation analysis methods that utilize the enormous amount of multi-omics data in assessing mechanisms of disease etiology. It contains three projects where I propose advanced mediation …


Exploring Application Of The Coordinate Exchange To Generate Optimal Designs Robust To Data Loss, Asher Hanson May 2024

Exploring Application Of The Coordinate Exchange To Generate Optimal Designs Robust To Data Loss, Asher Hanson

All Graduate Theses and Dissertations, Fall 2023 to Present

The primary objective of this study is to evaluate the efficacy of the coordinate exchange (CEXCH) algorithm in the generation of robust optimal designs. The assessment involves a comparative analysis, wherein designs produced by the Point Exchange (PEXCH) Algorithm are employed as benchmarks for evaluating the efficiency of CEXCH designs. Three modified criteria, selected from the traditional alphabet criteria pool, are utilized to score each algorithm. To enhance the reliability of the comparative analysis, multiple rounds of validation are conducted, focusing on visual assessments, design scores, and criteria efficiencies. The findings from each round of validation contribute to a comprehensive …


Exploring Optimal Design Of Experiments For Random Effects Models, Ryan C. Bushman May 2024

Exploring Optimal Design Of Experiments For Random Effects Models, Ryan C. Bushman

All Graduate Theses and Dissertations, Fall 2023 to Present

The majority of research in the field of optimal design of experiments has focused on producing designs for fixed effects models. The purpose of this thesis is to explore how the optimal design framework applies to nested random effects models. The object that is being optimized is the model information matrix. We explore the full derivation of the random effects information matrix to highlight the complexity of the problem and show how the optimization is a function of the model's parameters. In conjunction with this research, the ODVC (Optimal Design for Variance Components) package was built to provide tools that …


A Causal Inference Approach For Spike Train Interactions, Zach Saccomano Feb 2024

A Causal Inference Approach For Spike Train Interactions, Zach Saccomano

Dissertations, Theses, and Capstone Projects

Since the 1960s, neuroscientists have worked on the problem of estimating synaptic properties, such as connectivity and strength, from simultaneously recorded spike trains. Recent years have seen renewed interest in the problem coinciding with rapid advances in experimental technologies, including an approximate exponential increase in the number of neurons that can be recorded in parallel and perturbation techniques such as optogenetics that can be used to calibrate and validate causal hypotheses about functional connectivity. This thesis presents a mathematical examination of synaptic inference from two perspectives: (1) using in vivo data and biophysical models, we ask in what cases the …


Making Sense Of Making Parole In New York, Alexandra Mcglinchy Feb 2024

Making Sense Of Making Parole In New York, Alexandra Mcglinchy

Dissertations, Theses, and Capstone Projects

For many individuals incarcerated in New York, the initial step toward freedom begins with an interview with the Board of Parole. This process, however, is frequently a complex and challenging one, characterized by repeated denials and extended incarcerations. The disparity in outcomes – where one individual may receive over 20 denials and another is granted parole on their first attempt – highlights the ambiguity and inconsistency in the parole decision-making process. This project aims to clarify the factors that influence parole decisions by concentrating on measurable variables. These include age, race, duration of sentence served, proportion of sentence served, type …


Modeling Of Covid-19 Clinical Outcomes In Mexico: An Analysis Of Demographic, Clinical, And Chronic Disease Factors, Livia Clarete Feb 2024

Modeling Of Covid-19 Clinical Outcomes In Mexico: An Analysis Of Demographic, Clinical, And Chronic Disease Factors, Livia Clarete

Dissertations, Theses, and Capstone Projects

This study explores COVID-19 clinical outcomes in Mexico, focusing on demographic, clinical, and chronic disease variables to develop predictive models. In the binary classification task, the Ada Boost Classifier distinguishes survivors from non-survivors, with age, sex, ethnicity, and chronic medical conditions influencing outcomes. In multiclass classification, the Gradient Boosting Classifier categorizes patients into outcome groups.

Demographic variables, especially age, are crucial for predicting COVID-19 outcomes for both the binary and multiclass classification tasks. Clinical information about previous conditions, including chronic diseases, also holds relevance, especially diabetes, immunocompromise, and cardiovascular diseases. These insights inform public health measures and healthcare strategies, emphasizing …


Statistical Consulting In Academia: A Review, Ke Xiao Jan 2024

Statistical Consulting In Academia: A Review, Ke Xiao

Major Papers

This paper reviews the state of statistical consulting in academia by performing a literature review on this topic in chapters 1 and 2. Chapter 1 overviews general aspects of statistical consulting and types of centers that conduct such services in academia. In Chapter 2 we summarise the literature about the common logistics and processes for conducting statistical consulting in academia. In Chapters 3 and 4, we analyze data on statistical consulting centers for the largest 100 universities in the USA. We also review the literature on the future of statistical consulting in academia in the era of big data and …


A Bayesian Inversion For Emissions And Export Productivity Across The End-Cretaceous Boundary, Alexander A. Cox Jan 2024

A Bayesian Inversion For Emissions And Export Productivity Across The End-Cretaceous Boundary, Alexander A. Cox

Dartmouth College Master’s Theses

The end-Cretaceous mass extinction was marked by both the Chicxulub impact and the ongoing emplacement of the Deccan Traps flood basalt province. Both of these events perturbed the environment by the emission of climate-active volatiles, primarily CO2 and SO2. To understand the mechanism of extinction, we must disentangle the timing, duration, and intensity of volcanic and meteoritic environmental forcings. In this thesis, we used a parallel Markov chain Monte Carlo approach to invert for the aforementioned volatile emissions, export productivity, and remineralization from 67 to 65 million years ago using the LOSCAR (Long-term Ocean-atmosphere-Sediment CArbon cycle Reservoir) model. The parallel …


Multiscale Modelling Of Brain Networks And The Analysis Of Dynamic Processes In Neurodegenerative Disorders, Hina Shaheen Jan 2024

Multiscale Modelling Of Brain Networks And The Analysis Of Dynamic Processes In Neurodegenerative Disorders, Hina Shaheen

Theses and Dissertations (Comprehensive)

The complex nature of the human brain, with its intricate organic structure and multiscale spatio-temporal characteristics ranging from synapses to the entire brain, presents a major obstacle in brain modelling. Capturing this complexity poses a significant challenge for researchers. The complex interplay of coupled multiphysics and biochemical activities within this intricate system shapes the brain's capacity, functioning within a structure-function relationship that necessitates a specific mathematical framework. Advanced mathematical modelling approaches that incorporate the coupling of brain networks and the analysis of dynamic processes are essential for advancing therapeutic strategies aimed at treating neurodegenerative diseases (NDDs), which afflict millions of …


Applications Of Independent And Identically Distributed (Iid) Random Processes In Polarimetry And Climatology, Dan Kestner Jan 2024

Applications Of Independent And Identically Distributed (Iid) Random Processes In Polarimetry And Climatology, Dan Kestner

Dissertations, Master's Theses and Master's Reports

The unifying theme of this thesis is the characterization of “perfect randomness,” i.e., independent and identically distributed (IID) stochastic processes as these are applied in physical science. Two specific and mathematically distinct applications are chosen: (i) Radar and optical polarimetry; (ii) Analysis of time series in meteorology. In (i), IID process of a special kind, namely, with a distribution defined by symmetry, is used to link its multivariate Gaussian density to uniformity on the Poincaré sphere. This “statistical ellipsometry” approach is then used to relate polarimetric mismatches or imbalances to ellipsometric variables and suitably chosen cross-correlation measures. In (ii), recently …


Tropical Fish Study In Tahiti, French Polynesia, Miranda Brainard, Caitlyn Swango, Paityn Houglan, Richard Londraville Jan 2024

Tropical Fish Study In Tahiti, French Polynesia, Miranda Brainard, Caitlyn Swango, Paityn Houglan, Richard Londraville

Williams Honors College, Honors Research Projects

In May of 2023, I embarked on an exciting research journey to Moorea, French Polynesia, alongside fellow students and faculty members from the University of Akron and Syracuse University. This expedition was part of the university-sponsored Tropical Vertebrate Biology course, where we delved into the exploration of various tropical species inhabiting the island, including sea urchins, geckos, and my primary focus, the blackspotted rockskipper.

My research team, composed of my co-authors and me, was particularly intrigued by the unique refuge-seeking behavior displayed by blackspotted rockskippers. These amphibious fish are renowned for their remarkable ability to inhabit tide pools and rocky …


Reinforcement Learning: Applying Low Discrepancy Action Selection To Deep Deterministic Policy Gradient, Aleksandr Svishchev Jan 2024

Reinforcement Learning: Applying Low Discrepancy Action Selection To Deep Deterministic Policy Gradient, Aleksandr Svishchev

Electronic Theses and Dissertations

Reinforcement learning (RL) is a subfield of machine learning concerned with agents learning to behave optimally by interacting with an environment. One of the most important topics in RL is how the agent should explore, that is, how to choose actions in order to rate their impact on long-term reward. For example, a simple baseline strategy might be uniformly random action selection. This thesis investigates the heuristic idea that agents will learn faster if they explore by factoring the environment’s state into their decision and intentionally choose actions which are as different as possible from what they have previously observed. …


Interpretable Word-Level Sentiment Analysis With Attention-Based Multiple Instance Classification Models, Chenyu Yang Dec 2023

Interpretable Word-Level Sentiment Analysis With Attention-Based Multiple Instance Classification Models, Chenyu Yang

Statistical Science Theses and Dissertations

In this study, our main objective is to tackle the black-box nature of popular machine learning models in sentiment analysis and enhance model interpretability. We aim to gain more insight into the decision-making process of sentiment analysis models, which is often obscure in those complex models. To achieve this goal, we introduce two word-level sentiment analysis models.

The first model is called the attention-based multiple instance classification (AMIC) model. It combines the transparent model structure of multiple instance classification and the self-attention mechanism in deep learning to incorporate the contextual information from documents. As demonstrated by a wine review dataset …


Microplate-Like Metal Pyrophosphate Engineered On Ni-Foam Towards Multifunctional Electrode Material For Energy Conversion And Storage, Rishabh Srivastava Dec 2023

Microplate-Like Metal Pyrophosphate Engineered On Ni-Foam Towards Multifunctional Electrode Material For Energy Conversion And Storage, Rishabh Srivastava

Electronic Theses & Dissertations

High clean energy demand, dire need for sustainable development, and low carbon footprints are the few intuitive challenges, leading researchers to aim for research and development for high-performance energy devices. The development of materials used in energy devices is currently focused on enhancing the performance, electronic properties, and durability of devices. Tunning the attributes of transition metals using pyrophosphate (P2O7) ligand moieties can be a promising approach to meet the requirements of energy devices such as water electrolyzers and supercapacitors, although such a material’s configuration is rarely exposed for this purpose of study.

Herein, we grow …


Investigating The Effects Of A Southward Flow In The Southeastern Florida Shelf Using Robotic Instruments, Alfredo Quezada Dec 2023

Investigating The Effects Of A Southward Flow In The Southeastern Florida Shelf Using Robotic Instruments, Alfredo Quezada

All HCAS Student Capstones, Theses, and Dissertations

We deployed a Slocum G3 glider fitted with an acoustic Doppler current profiler (ADCP), a Conductivity-Temperature-Depth sensor (CTD), optics sensor channels, and a propeller on the Southeastern Florida shelf. The ADCP and CTD provide continuous measurements of Northern and Eastern current velocity components, salinity, temperature, and density, throughout the water column in a high-current environment. The optics sensor channels are able to provide measurements of chlorophyll concentrations, colored dissolved organic matter (CDOM), and backscatter particle counts. Additionally, for one of the glider deployments, we deployed a Wirewalker wave-powered profiling platform system also fitted with an ADCP and a CTD in …


Development Of An App For The Kalamazoo Nature Center, Ernest Au Dec 2023

Development Of An App For The Kalamazoo Nature Center, Ernest Au

Honors Theses

Kalamazoo Nature Center (KNC), which has been recognized by its peers as one of the top nature centers in the country, is home to over 14 miles of hiking trails winding through woods, wetlands, and prairies. There are numerous places/plots in KNC that have an interesting and impressive history besides being home to a variety of animals and hundreds of wildflowers and other plant life. To improve the visitor’s experience at KNC, we will design a software app via the senior capstone project at the department of Computer Science at WMU. As the first step towards establishing a reference model …


Analyzing The Efficacy Of Covid-19 Travel Bans: A Regression Analysis Approach, Mallory Kochanek Dec 2023

Analyzing The Efficacy Of Covid-19 Travel Bans: A Regression Analysis Approach, Mallory Kochanek

Honors Projects

Some might associate the term ‘public health’ with the pandemic that occurred in 2020. COVID-19 spread like most have never seen in their lifetime. It is useful to look at the effectiveness of the travel re- strictions in mitigating the spread of the global pandemic. Using linear regression and network regression, we obtain parameter estimates to determine the relation of predictors, such as network effect, percentage of urban population and GDP, on the COVID-19 incidence rate for the months January to April of 2020. Linear regression does not ac- count for the correlation structure of the data. Network regression, on …


Random Variable Spaces: Mathematical Properties And An Extension To Programming Computable Functions, Mohammed Kurd-Misto Dec 2023

Random Variable Spaces: Mathematical Properties And An Extension To Programming Computable Functions, Mohammed Kurd-Misto

Computational and Data Sciences (PhD) Dissertations

This dissertation aims to extend the boundaries of Programming Computable Functions (PCF) by introducing a novel collection of categories referred to as Random Variable Spaces. Originating as a generalization of Quasi-Borel Spaces, Random Variable Spaces are rigorously defined as categories where objects are sets paired with a collection of random variables from an underlying measurable space. These spaces offer a theoretical foundation for extending PCF to natively handle stochastic elements.

The dissertation is structured into seven chapters that provide a multi-disciplinary background, from PCF and Measure Theory to Category Theory with special attention to Monads and the Giry Monad. The …


Static And Dynamic State Estimation Applications In Power Systems Protection And Control Engineering, Ibukunoluwa Olayemi Korede Dec 2023

Static And Dynamic State Estimation Applications In Power Systems Protection And Control Engineering, Ibukunoluwa Olayemi Korede

Doctoral Dissertations

The developed methodologies are proposed to serve as support for control centers and fault analysis engineers. These approaches provide a dependable and effective means of pinpointing and resolving faults, which ultimately enhances power grid reliability. The algorithm uses the Least Absolute Value (LAV) method to estimate the augmented states of the PCB, enabling supervisory monitoring of the system. In addition, the application of statistical analysis based on projection statistics of the system Jacobian as a virtual sensor to detect faults on transmission lines. This approach is particularly valuable for detecting anomalies in transmission line data, such as bad data or …


Parameter Estimation For Patient Enrollment In Clinical Trials, Junyan Liu Dec 2023

Parameter Estimation For Patient Enrollment In Clinical Trials, Junyan Liu

Undergraduate Honors Theses

In this paper, we study the Poisson-gamma model for recruitment time in clinical trials. We proved several properties of this model that match our intuitions from a reliability perspective, did simulations on this model, and used different optimization methods to estimate the parameters. Although the behaviors of the optimization methods were unfavorable and unstable, we identified certain conditions and provided potential explanations for this phenomenon and further insights into the Poisson-gamma model.


Aspects Of Stochastic Geometric Mechanics In Molecular Biophysics, David Frost Dec 2023

Aspects Of Stochastic Geometric Mechanics In Molecular Biophysics, David Frost

All Dissertations

In confocal single-molecule FRET experiments, the joint distribution of FRET efficiency and donor lifetime distribution can reveal underlying molecular conformational dynamics via deviation from their theoretical Forster relationship. This shift is referred to as a dynamic shift. In this study, we investigate the influence of the free energy landscape in protein conformational dynamics on the dynamic shift by simulation of the associated continuum reaction coordinate Langevin dynamics, yielding a deeper understanding of the dynamic and structural information in the joint FRET efficiency and donor lifetime distribution. We develop novel Langevin models for the dye linker dynamics, including rotational dynamics, based …


Integrating Machine Learning Methods For Medical Diagnosis, Jazmin Quezada Dec 2023

Integrating Machine Learning Methods For Medical Diagnosis, Jazmin Quezada

Open Access Theses & Dissertations

Abstract:The rapid advancement of machine learning techniques has revolutionized the field of medical diagnosis by offering powerful tools to analyze complex data sets and make accurate predictions. In this proposed method, we present a novel approach that integrates machine learning and optimization models to enhance the accuracy of medical diagnoses. Our method focuses on fine-tuning and optimizing the parameters of machine learning algorithms commonly used in medical diagnosis, such as logistic regression, support vector machines, and neural networks. By employing optimization techniques, we systematically explore the parameter space of these algorithms to discover the most optimal configurations. Moreover, by representing …


Metrics For Comparison Of Complex Networks, Clarissa Reyes Dec 2023

Metrics For Comparison Of Complex Networks, Clarissa Reyes

Open Access Theses & Dissertations

Heuristic network statistics are used as a preliminary approach to identify change across networks. In networks where there is known node correspondence (KNC), conventional network comparison methods include taking a norm of the difference matrix, or calculating dissimilarity measures like DeltaCon and cut distance. Since different KNC measures provide varying insight to the network comparison problem, we propose employing Rank Score Characteristic Functions (RSCFs) and the rank-score process as a method for reaching a consensus when ranking quantified change across multiple pairs of networks â?? which is particularly useful for ranking change across subpopulations or subgraphs. Additionally, we propose a …


Causal Inference For The Effect Of Continuous Treatment On Time-To-Event Outcomes And Mediation Analysis On Health Disparities In Observational Studies., Triparna Poddar Dec 2023

Causal Inference For The Effect Of Continuous Treatment On Time-To-Event Outcomes And Mediation Analysis On Health Disparities In Observational Studies., Triparna Poddar

Electronic Theses and Dissertations

The dissertation comprises two projects related to causal inference based on observational data. In healthcare research, where abundant observational data such as claims data and electronic records are available, researchers often aim to study the treatment effect and the pathway of that effect. However, estimating treatment effects in observational data presents challenges due to confounding factors. The first project focuses on estimating continuous treatment effects for survival outcomes, while the second concentrates on mediation analysis, allowing the exploration of the pathway of the causal effect. Both projects involve addressing confounding variables. In the first project, I investigate estimation of the …


Wavelet Compression As An Observational Operator In Data Assimilation Systems For Sea Surface Temperature, Bradley J. Sciacca Dec 2023

Wavelet Compression As An Observational Operator In Data Assimilation Systems For Sea Surface Temperature, Bradley J. Sciacca

University of New Orleans Theses and Dissertations

The ocean remains severely under-observed, in part due to its sheer size. Containing nearly billion of water with most of the subsurface being invisible because water is extremely difficult to penetrate using electromagnetic radiation, as is typically used by satellite measuring instruments. For this reason, most observations of the ocean have very low spatial-temporal coverage to get a broad capture of the ocean’s features. However, recent “dense but patchy” data have increased the availability of high-resolution – low spatial coverage observations. These novel data sets have motivated research into multi-scale data assimilation methods. Here, we demonstrate a new assimilation approach …


Exploration And Statistical Modeling Of Profit, Caleb Gibson Dec 2023

Exploration And Statistical Modeling Of Profit, Caleb Gibson

Undergraduate Honors Theses

For any company involved in sales, maximization of profit is the driving force that guides all decision-making. Many factors can influence how profitable a company can be, including external factors like changes in inflation or consumer demand or internal factors like pricing and product cost. Understanding specific trends in one's own internal data, a company can readily identify problem areas or potential growth opportunities to help increase profitability.

In this discussion, we use an extensive data set to examine how a company might analyze their own data to identify potential changes the company might investigate to drive better performance. Based …


The Private Pilot Check Ride: Applying The Spacing Effect Theory To Predict Time To Proficiency For The Practical Test, Michael Scott Harwin Dec 2023

The Private Pilot Check Ride: Applying The Spacing Effect Theory To Predict Time To Proficiency For The Practical Test, Michael Scott Harwin

Theses and Dissertations

This study examined the relationship between a set of targeted factors and the total flight time students needed to become ready to take the private pilot check ride. The study was grounded in Ebbinghaus’s (1885/1913/2013) forgetting curve theory and spacing effect, and Ausubel’s (1963) theory of meaningful learning. The research factors included (a) training time to proficiency, which represented the number of training days needed to become check-ride ready; (b) flight training program (Part 61 vs. Part 141); (c) organization offering the training program (2- or 4-year college/university vs. FBO); (d) scheduling policy (mandated vs. student-driven); and demographical variables, which …


Bayesian Strategies For Propensity Score Estimation In Causal Inference., Uthpala I. Wanigasekara Dec 2023

Bayesian Strategies For Propensity Score Estimation In Causal Inference., Uthpala I. Wanigasekara

Electronic Theses and Dissertations

Causal inference is a method used in various fields to draw causal conclusions based on data. It involves using assumptions, study designs, and estimation strategies to minimize the impact of confounding variables. Propensity scores are used to estimate outcome effects, through matching methods, stratification, weighting methods, and the Covariate Balancing Propensity Score method. However, they can be sensitive to estimation techniques and can lead to unstable findings. Researchers have proposed integrating weighing with regression adjustment in parametric models to improve causal inference validity. The first project focuses on Bayesian joint and two-stage methods for propensity score analysis. Propensity score modeling …


Analyses Of Effect Indices Across Single-Case Research Designs In Counseling, Cian L. Brown Dec 2023

Analyses Of Effect Indices Across Single-Case Research Designs In Counseling, Cian L. Brown

Graduate Theses and Dissertations

Single case research design (SCRD) is a common methodology used across clinical disciplines to determine treatments effectiveness by comparing treatment conditions to baseline conditions in individual cases, usually among researchers working with smaller samples. Although popular within behavioral disciplines such as special education and behavioral analysis, studies have begun to emerge in counseling. However, guidance and current understanding of the use of SCRD in counseling is limited. A content analysis of counseling journals from 2003 to 2014 yielded only 7 studies using SCRD. In 2015, the flagship counseling journal, Journal of Counseling and Development, published a special issue on the …


Foundations Of Memory Capacity In Models Of Neural Cognition, Chandradeep Chowdhury Dec 2023

Foundations Of Memory Capacity In Models Of Neural Cognition, Chandradeep Chowdhury

Master's Theses

A central problem in neuroscience is to understand how memories are formed as a result of the activities of neurons. Valiant’s neuroidal model attempted to address this question by modeling the brain as a random graph and memories as subgraphs within that graph. However the question of memory capacity within that model has not been explored: how many memories can the brain hold? Valiant introduced the concept of interference between memories as the defining factor for capacity; excessive interference signals the model has reached capacity. Since then, exploration of capacity has been limited, but recent investigations have delved into the …