Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Theses/Dissertations

Statistics and Probability

2023

Institution
Keyword
Publication

Articles 1 - 30 of 218

Full-Text Articles in Physical Sciences and Mathematics

Interpretable Word-Level Sentiment Analysis With Attention-Based Multiple Instance Classification Models, Chenyu Yang Dec 2023

Interpretable Word-Level Sentiment Analysis With Attention-Based Multiple Instance Classification Models, Chenyu Yang

Statistical Science Theses and Dissertations

In this study, our main objective is to tackle the black-box nature of popular machine learning models in sentiment analysis and enhance model interpretability. We aim to gain more insight into the decision-making process of sentiment analysis models, which is often obscure in those complex models. To achieve this goal, we introduce two word-level sentiment analysis models.

The first model is called the attention-based multiple instance classification (AMIC) model. It combines the transparent model structure of multiple instance classification and the self-attention mechanism in deep learning to incorporate the contextual information from documents. As demonstrated by a wine review dataset …


Microplate-Like Metal Pyrophosphate Engineered On Ni-Foam Towards Multifunctional Electrode Material For Energy Conversion And Storage, Rishabh Srivastava Dec 2023

Microplate-Like Metal Pyrophosphate Engineered On Ni-Foam Towards Multifunctional Electrode Material For Energy Conversion And Storage, Rishabh Srivastava

Electronic Theses & Dissertations

High clean energy demand, dire need for sustainable development, and low carbon footprints are the few intuitive challenges, leading researchers to aim for research and development for high-performance energy devices. The development of materials used in energy devices is currently focused on enhancing the performance, electronic properties, and durability of devices. Tunning the attributes of transition metals using pyrophosphate (P2O7) ligand moieties can be a promising approach to meet the requirements of energy devices such as water electrolyzers and supercapacitors, although such a material’s configuration is rarely exposed for this purpose of study.

Herein, we grow …


Investigating The Effects Of A Southward Flow In The Southeastern Florida Shelf Using Robotic Instruments, Alfredo Quezada Dec 2023

Investigating The Effects Of A Southward Flow In The Southeastern Florida Shelf Using Robotic Instruments, Alfredo Quezada

All HCAS Student Capstones, Theses, and Dissertations

We deployed a Slocum G3 glider fitted with an acoustic Doppler current profiler (ADCP), a Conductivity-Temperature-Depth sensor (CTD), optics sensor channels, and a propeller on the Southeastern Florida shelf. The ADCP and CTD provide continuous measurements of Northern and Eastern current velocity components, salinity, temperature, and density, throughout the water column in a high-current environment. The optics sensor channels are able to provide measurements of chlorophyll concentrations, colored dissolved organic matter (CDOM), and backscatter particle counts. Additionally, for one of the glider deployments, we deployed a Wirewalker wave-powered profiling platform system also fitted with an ADCP and a CTD in …


Development Of An App For The Kalamazoo Nature Center, Ernest Au Dec 2023

Development Of An App For The Kalamazoo Nature Center, Ernest Au

Honors Theses

Kalamazoo Nature Center (KNC), which has been recognized by its peers as one of the top nature centers in the country, is home to over 14 miles of hiking trails winding through woods, wetlands, and prairies. There are numerous places/plots in KNC that have an interesting and impressive history besides being home to a variety of animals and hundreds of wildflowers and other plant life. To improve the visitor’s experience at KNC, we will design a software app via the senior capstone project at the department of Computer Science at WMU. As the first step towards establishing a reference model …


Analyzing The Efficacy Of Covid-19 Travel Bans: A Regression Analysis Approach, Mallory Kochanek Dec 2023

Analyzing The Efficacy Of Covid-19 Travel Bans: A Regression Analysis Approach, Mallory Kochanek

Honors Projects

Some might associate the term ‘public health’ with the pandemic that occurred in 2020. COVID-19 spread like most have never seen in their lifetime. It is useful to look at the effectiveness of the travel re- strictions in mitigating the spread of the global pandemic. Using linear regression and network regression, we obtain parameter estimates to determine the relation of predictors, such as network effect, percentage of urban population and GDP, on the COVID-19 incidence rate for the months January to April of 2020. Linear regression does not ac- count for the correlation structure of the data. Network regression, on …


Random Variable Spaces: Mathematical Properties And An Extension To Programming Computable Functions, Mohammed Kurd-Misto Dec 2023

Random Variable Spaces: Mathematical Properties And An Extension To Programming Computable Functions, Mohammed Kurd-Misto

Computational and Data Sciences (PhD) Dissertations

This dissertation aims to extend the boundaries of Programming Computable Functions (PCF) by introducing a novel collection of categories referred to as Random Variable Spaces. Originating as a generalization of Quasi-Borel Spaces, Random Variable Spaces are rigorously defined as categories where objects are sets paired with a collection of random variables from an underlying measurable space. These spaces offer a theoretical foundation for extending PCF to natively handle stochastic elements.

The dissertation is structured into seven chapters that provide a multi-disciplinary background, from PCF and Measure Theory to Category Theory with special attention to Monads and the Giry Monad. The …


Static And Dynamic State Estimation Applications In Power Systems Protection And Control Engineering, Ibukunoluwa Olayemi Korede Dec 2023

Static And Dynamic State Estimation Applications In Power Systems Protection And Control Engineering, Ibukunoluwa Olayemi Korede

Doctoral Dissertations

The developed methodologies are proposed to serve as support for control centers and fault analysis engineers. These approaches provide a dependable and effective means of pinpointing and resolving faults, which ultimately enhances power grid reliability. The algorithm uses the Least Absolute Value (LAV) method to estimate the augmented states of the PCB, enabling supervisory monitoring of the system. In addition, the application of statistical analysis based on projection statistics of the system Jacobian as a virtual sensor to detect faults on transmission lines. This approach is particularly valuable for detecting anomalies in transmission line data, such as bad data or …


Aspects Of Stochastic Geometric Mechanics In Molecular Biophysics, David Frost Dec 2023

Aspects Of Stochastic Geometric Mechanics In Molecular Biophysics, David Frost

All Dissertations

In confocal single-molecule FRET experiments, the joint distribution of FRET efficiency and donor lifetime distribution can reveal underlying molecular conformational dynamics via deviation from their theoretical Forster relationship. This shift is referred to as a dynamic shift. In this study, we investigate the influence of the free energy landscape in protein conformational dynamics on the dynamic shift by simulation of the associated continuum reaction coordinate Langevin dynamics, yielding a deeper understanding of the dynamic and structural information in the joint FRET efficiency and donor lifetime distribution. We develop novel Langevin models for the dye linker dynamics, including rotational dynamics, based …


Integrating Machine Learning Methods For Medical Diagnosis, Jazmin Quezada Dec 2023

Integrating Machine Learning Methods For Medical Diagnosis, Jazmin Quezada

Open Access Theses & Dissertations

Abstract:The rapid advancement of machine learning techniques has revolutionized the field of medical diagnosis by offering powerful tools to analyze complex data sets and make accurate predictions. In this proposed method, we present a novel approach that integrates machine learning and optimization models to enhance the accuracy of medical diagnoses. Our method focuses on fine-tuning and optimizing the parameters of machine learning algorithms commonly used in medical diagnosis, such as logistic regression, support vector machines, and neural networks. By employing optimization techniques, we systematically explore the parameter space of these algorithms to discover the most optimal configurations. Moreover, by representing …


Metrics For Comparison Of Complex Networks, Clarissa Reyes Dec 2023

Metrics For Comparison Of Complex Networks, Clarissa Reyes

Open Access Theses & Dissertations

Heuristic network statistics are used as a preliminary approach to identify change across networks. In networks where there is known node correspondence (KNC), conventional network comparison methods include taking a norm of the difference matrix, or calculating dissimilarity measures like DeltaCon and cut distance. Since different KNC measures provide varying insight to the network comparison problem, we propose employing Rank Score Characteristic Functions (RSCFs) and the rank-score process as a method for reaching a consensus when ranking quantified change across multiple pairs of networks â?? which is particularly useful for ranking change across subpopulations or subgraphs. Additionally, we propose a …


Wavelet Compression As An Observational Operator In Data Assimilation Systems For Sea Surface Temperature, Bradley J. Sciacca Dec 2023

Wavelet Compression As An Observational Operator In Data Assimilation Systems For Sea Surface Temperature, Bradley J. Sciacca

University of New Orleans Theses and Dissertations

The ocean remains severely under-observed, in part due to its sheer size. Containing nearly billion of water with most of the subsurface being invisible because water is extremely difficult to penetrate using electromagnetic radiation, as is typically used by satellite measuring instruments. For this reason, most observations of the ocean have very low spatial-temporal coverage to get a broad capture of the ocean’s features. However, recent “dense but patchy” data have increased the availability of high-resolution – low spatial coverage observations. These novel data sets have motivated research into multi-scale data assimilation methods. Here, we demonstrate a new assimilation approach …


Exploration And Statistical Modeling Of Profit, Caleb Gibson Dec 2023

Exploration And Statistical Modeling Of Profit, Caleb Gibson

Undergraduate Honors Theses

For any company involved in sales, maximization of profit is the driving force that guides all decision-making. Many factors can influence how profitable a company can be, including external factors like changes in inflation or consumer demand or internal factors like pricing and product cost. Understanding specific trends in one's own internal data, a company can readily identify problem areas or potential growth opportunities to help increase profitability.

In this discussion, we use an extensive data set to examine how a company might analyze their own data to identify potential changes the company might investigate to drive better performance. Based …


Parameter Estimation For Patient Enrollment In Clinical Trials, Junyan Liu Dec 2023

Parameter Estimation For Patient Enrollment In Clinical Trials, Junyan Liu

Undergraduate Honors Theses

In this paper, we study the Poisson-gamma model for recruitment time in clinical trials. We proved several properties of this model that match our intuitions from a reliability perspective, did simulations on this model, and used different optimization methods to estimate the parameters. Although the behaviors of the optimization methods were unfavorable and unstable, we identified certain conditions and provided potential explanations for this phenomenon and further insights into the Poisson-gamma model.


Bayesian Strategies For Propensity Score Estimation In Causal Inference., Uthpala I. Wanigasekara Dec 2023

Bayesian Strategies For Propensity Score Estimation In Causal Inference., Uthpala I. Wanigasekara

Electronic Theses and Dissertations

Causal inference is a method used in various fields to draw causal conclusions based on data. It involves using assumptions, study designs, and estimation strategies to minimize the impact of confounding variables. Propensity scores are used to estimate outcome effects, through matching methods, stratification, weighting methods, and the Covariate Balancing Propensity Score method. However, they can be sensitive to estimation techniques and can lead to unstable findings. Researchers have proposed integrating weighing with regression adjustment in parametric models to improve causal inference validity. The first project focuses on Bayesian joint and two-stage methods for propensity score analysis. Propensity score modeling …


Analyses Of Effect Indices Across Single-Case Research Designs In Counseling, Cian L. Brown Dec 2023

Analyses Of Effect Indices Across Single-Case Research Designs In Counseling, Cian L. Brown

Graduate Theses and Dissertations

Single case research design (SCRD) is a common methodology used across clinical disciplines to determine treatments effectiveness by comparing treatment conditions to baseline conditions in individual cases, usually among researchers working with smaller samples. Although popular within behavioral disciplines such as special education and behavioral analysis, studies have begun to emerge in counseling. However, guidance and current understanding of the use of SCRD in counseling is limited. A content analysis of counseling journals from 2003 to 2014 yielded only 7 studies using SCRD. In 2015, the flagship counseling journal, Journal of Counseling and Development, published a special issue on the …


Foundations Of Memory Capacity In Models Of Neural Cognition, Chandradeep Chowdhury Dec 2023

Foundations Of Memory Capacity In Models Of Neural Cognition, Chandradeep Chowdhury

Master's Theses

A central problem in neuroscience is to understand how memories are formed as a result of the activities of neurons. Valiant’s neuroidal model attempted to address this question by modeling the brain as a random graph and memories as subgraphs within that graph. However the question of memory capacity within that model has not been explored: how many memories can the brain hold? Valiant introduced the concept of interference between memories as the defining factor for capacity; excessive interference signals the model has reached capacity. Since then, exploration of capacity has been limited, but recent investigations have delved into the …


Radiation Exposure Calibration Of The Al2o3:C With Radium-226 And Cesium-137 Using The Osl Method, Selma Tepeli Aydin Dec 2023

Radiation Exposure Calibration Of The Al2o3:C With Radium-226 And Cesium-137 Using The Osl Method, Selma Tepeli Aydin

All Theses

Optically stimulated luminescence (OSL) dosimetry was utilized to calibrate Al2O3:C powder dosimeters, available commercially as the nanoDot® from Landauer Inc., and compare the dosimeter response to radium-226 (226Ra) and cesium-137 (137Cs). The signal from the OSL was quantified using a microSTARii® OSL reader also produced by Landauer Inc. Dose-response curves were developed for 226Ra and 137Cs experiments (5 dosimeters each) at thirteen absorbed doses. Individual dosimeter response was tracked by serial number. Linear regression analysis was performed to determine if there were significant differences between the intercepts of the …


Bayesian Learning Of Spatiotemporal Source Distribution For Beached Microplastic In The Gulf Of Mexico, David Pojunas Dec 2023

Bayesian Learning Of Spatiotemporal Source Distribution For Beached Microplastic In The Gulf Of Mexico, David Pojunas

Graduate Theses and Dissertations

Over the last several decades, plastic waste has gradually accumulated while slowly degrading in terrestrial and oceanic environments. Recently, there has been an increased effort to identify the possible sources of plastic to understand how they affect vulnerable beaches. This issue is of particular concern in the Gulf of Mexico due to the presence of oil, natural gas, and plastic production. In this thesis, we expand upon existing Bayesian plastic attribution models and develop a rigorous statistical framework to map observed beached microplastics to their sources. Within this framework, we combine Lagrangian backtracking simulations of floating particles using nurdle beaching …


Comparative Analysis Of Teacher Effects Parameters In Models Used For Assessing School Effectiveness: Value-Added Models & Persistence, Merlin J. Kamgue Dec 2023

Comparative Analysis Of Teacher Effects Parameters In Models Used For Assessing School Effectiveness: Value-Added Models & Persistence, Merlin J. Kamgue

Graduate Theses and Dissertations

Longitudinal measures for students have become increasingly popular to estimate the effects of individual teachers and schools. Value-added models are one of the approaches using longitudinal data to evaluate teachers and schools. In the value-added model (VAM) literature, many statistical approaches have been developed and used to estimate teacher or school effects on student learning. This study opted to use a Bayesian multivariate model for evaluating teacher effects. The generalized persistence models can handle longitudinal data, not vertically scaled, allowing for a below-par teacher’s effects correlation across test administrations. This study first generated longitudinal students’ test score data and used …


Interactions Between Sediment Mechanical Structure And Infaunal Community Structure Following Physical Disturbance, William Cyrus Roger Clemo Dec 2023

Interactions Between Sediment Mechanical Structure And Infaunal Community Structure Following Physical Disturbance, William Cyrus Roger Clemo

<strong> Theses and Dissertations </strong>

Shallow, river-influenced coastal sediments are important for global carbon storage and nutrient cycling and provide a habitat for diverse communities of invertebrates (infauna). Elevated bed shear stress from extreme storms can resuspend, transport, and deposit sediments, disrupting the cohesive structure of muds, and sorting and depositing sand eroded from beaches. These physical disruptions can also resuspend or smother infauna, decreasing abundances and changing community structure. Infaunal activities such as burrowing, tube construction, and feeding can impact sediment structure and stability. However, little is known about how physical disturbance impacts short and long-term sediment habitat suitability and whether disturbance-tolerant infauna influence …


Generative Adversarial Game With Tailored Quantum Feature Maps For Enhanced Classification, Anais Sandra Nguemto Guiawa Dec 2023

Generative Adversarial Game With Tailored Quantum Feature Maps For Enhanced Classification, Anais Sandra Nguemto Guiawa

Doctoral Dissertations

In the burgeoning field of quantum machine learning, the fusion of quantum computing and machine learning methodologies has sparked immense interest, particularly with the emergence of noisy intermediate-scale quantum (NISQ) devices. These devices hold the promise of achieving quantum advantage, but they grapple with limitations like constrained qubit counts, limited connectivity, operational noise, and a restricted set of operations. These challenges necessitate a strategic and deliberate approach to crafting effective quantum machine learning algorithms.

This dissertation revolves around an exploration of these challenges, presenting innovative strategies that tailor quantum algorithms and processes to seamlessly integrate with commercial quantum platforms. A …


Causal Inference For The Effect Of Continuous Treatment On Time-To-Event Outcomes And Mediation Analysis On Health Disparities In Observational Studies., Triparna Poddar Dec 2023

Causal Inference For The Effect Of Continuous Treatment On Time-To-Event Outcomes And Mediation Analysis On Health Disparities In Observational Studies., Triparna Poddar

Electronic Theses and Dissertations

The dissertation comprises two projects related to causal inference based on observational data. In healthcare research, where abundant observational data such as claims data and electronic records are available, researchers often aim to study the treatment effect and the pathway of that effect. However, estimating treatment effects in observational data presents challenges due to confounding factors. The first project focuses on estimating continuous treatment effects for survival outcomes, while the second concentrates on mediation analysis, allowing the exploration of the pathway of the causal effect. Both projects involve addressing confounding variables. In the first project, I investigate estimation of the …


The Private Pilot Check Ride: Applying The Spacing Effect Theory To Predict Time To Proficiency For The Practical Test, Michael Scott Harwin Dec 2023

The Private Pilot Check Ride: Applying The Spacing Effect Theory To Predict Time To Proficiency For The Practical Test, Michael Scott Harwin

Theses and Dissertations

This study examined the relationship between a set of targeted factors and the total flight time students needed to become ready to take the private pilot check ride. The study was grounded in Ebbinghaus’s (1885/1913/2013) forgetting curve theory and spacing effect, and Ausubel’s (1963) theory of meaningful learning. The research factors included (a) training time to proficiency, which represented the number of training days needed to become check-ride ready; (b) flight training program (Part 61 vs. Part 141); (c) organization offering the training program (2- or 4-year college/university vs. FBO); (d) scheduling policy (mandated vs. student-driven); and demographical variables, which …


An Ensemble Approach For Mapping Snow Water Equivalent In Utah, Logan Schneider Dec 2023

An Ensemble Approach For Mapping Snow Water Equivalent In Utah, Logan Schneider

All Graduate Theses and Dissertations, Fall 2023 to Present

Mountain snowpack is an important resource for water management planning in Utah. Snow water equivalent (SWE) is the amount of water contained in a snowpack. A few organizations predict SWE throughout the United States but struggle making accurate predictions in mountainous regions. Weather stations provide accurate measurements of SWE but have limited spatial coverage that hinders the ability to make accurate estimates statewide. This thesis examines the accuracy of current models and proposes using local weather measurements to improve upon national level predictions. An R statistical software package named rsnodas implements this process while allowing the public access to a …


Using Gamification To Foster Student Resilience And Motivation To Learn, And Using Games To Teach Significance Testing Concepts In The Statistics Classroom, Todd Partridge Dec 2023

Using Gamification To Foster Student Resilience And Motivation To Learn, And Using Games To Teach Significance Testing Concepts In The Statistics Classroom, Todd Partridge

All Graduate Theses and Dissertations, Fall 2023 to Present

Two studies are outlined in this dissertation.

In the first study, elements of Super Mario Bros. videos games were used to change the way college students in a beginners’ statistics course were graded on their work. This was part of an effort to help students remain optimistic in the face of challenging coursework and even failure on assignments and tests. The study shows that the changes made to the grading structure did help students to keep trying and to use the materials given to them by their professor until they achieved their desired grade in the course, and suggests ways …


Nonparametric Derivative Estimation Using Penalized Splines: Theory And Application, Bright Antwi Boasiako Nov 2023

Nonparametric Derivative Estimation Using Penalized Splines: Theory And Application, Bright Antwi Boasiako

Doctoral Dissertations

This dissertation is in the field of Nonparametric Derivative Estimation using
Penalized Splines. It is conducted in two parts. In the first part, we study the L2
convergence rates of estimating derivatives of mean regression functions using penalized splines. In 1982, Stone provided the optimal rates of convergence for estimating derivatives of mean regression functions using nonparametric methods. Using these rates, Zhou et. al. in their 2000 paper showed that the MSE of derivative estimators based on regression splines approach zero at the optimal rate of convergence. Also, in 2019, Xiao showed that, under some general conditions, penalized spline estimators …


Multi-Arm Randomized Control Trials In Inflammatory Bowel Disease: A Literature Review And An Illustration Of Methods For Analysis, Sahiba Saini Nov 2023

Multi-Arm Randomized Control Trials In Inflammatory Bowel Disease: A Literature Review And An Illustration Of Methods For Analysis, Sahiba Saini

Electronic Thesis and Dissertation Repository

This thesis aimed to review the literature on multiple-arm randomized control trials in inflammatory bowel disease (IBD) and to illustrate how to analyze these trials, focusing on appropriately controlling the type 1 error rates. The literature review found 247 trials published from the inception of each database to April 2014, of which 122 (49%) trials were multiple-arm trials and of those, 59 (48%) trials were on ulcerative colitis and 63 (52%) on Crohn’s disease. A published assessment tool was adopted to assess whether controlling of Type I error rates was needed. Despite the common use of this trial design and …


Using Social Network Analysis To Measure And Visualize Student Clustering Within Middle And High Schools, Geoffrey David West Nov 2023

Using Social Network Analysis To Measure And Visualize Student Clustering Within Middle And High Schools, Geoffrey David West

USF Tampa Graduate Theses and Dissertations

The dominant philosophy of American public schools has been to group students together based on similar characteristics. Known as tracking, high achieving students would take courses on the “college track” while others would take “career track” courses. It was not long until advocates noticed that this process unfairly advantaged affluent and White student over poor and minoritized groups. A new process called “ability grouping” took over where tracking left off, but to the same effect. It is difficult to measure the degree students are grouped together by a certain characteristic, and while a few research papers aim to do so, …


The Use Of Regularization To Detect Racial Inequities In Pay Equity Studies: An Empirical Study And Reflections On Regulation Methods, Christopher M. Peña Nov 2023

The Use Of Regularization To Detect Racial Inequities In Pay Equity Studies: An Empirical Study And Reflections On Regulation Methods, Christopher M. Peña

Electronic Theses and Dissertations

Since the late 1970s, multiple linear regression has been the preferred method for identifying discrimination in pay. An empirical study on this topic was conducted using quantitative critical methods. A literature review first examined conflicting views on using multiple linear regression in pay equity studies. The review found that multiple linear regression is used so prevalently in pay equity studies because the courts and practitioners have widely accepted it and because of its simplicity and ability to parse multiple sources of variance simultaneously. Commentaries in the literature cautioned about errors in model specification, the use of tainted variables, and the …


Bayesian Statistical Modeling Of Spatially Resolved Transcriptomics Data, Xi Jiang Oct 2023

Bayesian Statistical Modeling Of Spatially Resolved Transcriptomics Data, Xi Jiang

Statistical Science Theses and Dissertations

Spatially resolved transcriptomics (SRT) quantifies expression levels at different spatial locations, providing a new and powerful tool to investigate novel biological insights. As experimental technologies enhance both in capacity and efficiency, there arises a growing demand for the development of analytical methodologies.

One question in SRT data analysis is to identify genes whose expressions exhibit spatially correlated patterns, called spatially variable (SV) genes. Most current methods to identify SV genes are built upon the geostatistical model with Gaussian process, which could limit the models' ability to identify complex spatial patterns. In order to overcome this challenge and capture more types …