Open Access. Powered by Scholars. Published by Universities.®

Digital Commons Network

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 19 of 19

Full-Text Articles in Entire DC Network

The Private Pilot Check Ride: Applying The Spacing Effect Theory To Predict Time To Proficiency For The Practical Test, Michael Scott Harwin Dec 2023

The Private Pilot Check Ride: Applying The Spacing Effect Theory To Predict Time To Proficiency For The Practical Test, Michael Scott Harwin

Theses and Dissertations

This study examined the relationship between a set of targeted factors and the total flight time students needed to become ready to take the private pilot check ride. The study was grounded in Ebbinghaus’s (1885/1913/2013) forgetting curve theory and spacing effect, and Ausubel’s (1963) theory of meaningful learning. The research factors included (a) training time to proficiency, which represented the number of training days needed to become check-ride ready; (b) flight training program (Part 61 vs. Part 141); (c) organization offering the training program (2- or 4-year college/university vs. FBO); (d) scheduling policy (mandated vs. student-driven); and demographical variables, which …


Stochastic Optimal Control Of Conditional Mckean-Vlasov Equations With Jump And Markovian Switching, Charles Samuel Conly Sharp Dec 2023

Stochastic Optimal Control Of Conditional Mckean-Vlasov Equations With Jump And Markovian Switching, Charles Samuel Conly Sharp

Theses and Dissertations

This thesis obtains a number of results in stochastic optimal control for conditional McKean-Vlasov equations with jump and Markovian switching. First, we prove the uniqueness of the solutions and derive a relevant version of Itô's formula. We provide the dynamic programming principle and prove the associated verification theorem. A stochastic maximum principle is established. Further, we derive the relationship between dynamic programming and the stochastic maximum principle. Additionally, we utilize our stochastic maximum principle result for a mean-variance portfolio selection problem.


Approaches To Detecting And Modeling Over-And Underdispersion In Alternative Count Data Distributions And An Application Of Logistic Regression And Random Forest Modeling To Improve Screening Tools For Tic Disorders In Children, Rebecca C. Wardrop Jul 2023

Approaches To Detecting And Modeling Over-And Underdispersion In Alternative Count Data Distributions And An Application Of Logistic Regression And Random Forest Modeling To Improve Screening Tools For Tic Disorders In Children, Rebecca C. Wardrop

Theses and Dissertations

This dissertation focuses on theory and application of discrete data methods, particularly approaches to over- and underdispersion relative to the Poisson distribution and an application of random forest and logistic regression modeling. The first chapter derives a score test for over- and underdispersion in the heaped generalized Poisson distribution. Equi-, over-, and underdispersed heaped generalized Poisson and heaped negative binomial data are simulated to evaluate the performance of the score test by comparing the power it achieves to that of Wald and likelihood ratio tests. We find that the score test we derive performs comparably to both the Wald and …


Statistical Methods For Single Cell Sequencing Data Analysis, Fei Qin Jul 2023

Statistical Methods For Single Cell Sequencing Data Analysis, Fei Qin

Theses and Dissertations

The recent emergence of single cell sequencing (SCS) technology has provided us with single-cell DNA or RNA sequencing (scDNA/RNA-seq) information to investigate cellular evolutionary relationships. Despite many analysis methods have been developed to infer intra-tumor genetic heterogeneity, cluster cellular subclones, detect genetic mutations, and investigate spatially variable (SV) genes, exploring SCS data remains statistically challenging due to its noisy nature.

To identify subclones with scDNA-seq data, many existing studies use an independent statistical model to detect copy number profile in the first step, followed by classical clustering methods for subclone identification in downstream analyses. However, spurious results might be generated …


A Bayesian Spatial Scan Statistic For Normal Data, Laasya Velamakanni Jul 2023

A Bayesian Spatial Scan Statistic For Normal Data, Laasya Velamakanni

Theses and Dissertations

Scan statistics are useful methods for detecting spatial clustering. While they were initially developed to detect regions with an excess of binomial or Poisson events, spatial scan statistics have been extended to detect hotspots in other types of data including continuous data. They have many applications in different fields such as epidemiology (e.g. detecting disease outbreaks), sociology (e.g. detecting crime hotspots), and environmental health (e.g. detecting high-pollution areas). Spatial scan statistics identify a ‘most likely cluster’ and then use a likelihood ratio test to determine if this cluster is statistically significant. Spatial scan statistics have been extended to the Bayesian …


Explorations In Baseball Analytics: Simulations, Predictions, And Evaluations For Games And Players, Katelyn Mongerson May 2023

Explorations In Baseball Analytics: Simulations, Predictions, And Evaluations For Games And Players, Katelyn Mongerson

Theses and Dissertations

From statistics being reported in newspapers in the 1840s, to present day, baseballhas always been one of the most data-driven sports. We make use of the endless publicly available baseball data to build models in R and Python that answer various baseball- related questions regarding predicting and optimizing run production, evaluating player effectiveness, and forecasting the postseason. To predict and optimize run production, we present three models. The first builds a common tool in baseball analysis called a Run Expectancy Matrix which is used to give a value (in terms of runs) to various in-game decisions. The second uses the …


Change Point Detection For A Process Having Several Regimes, Oliver Gerd Meister May 2023

Change Point Detection For A Process Having Several Regimes, Oliver Gerd Meister

Theses and Dissertations

In this dissertation, possible methods for multiple change point detection on Markovchain processes are studied. Related works for oine and online change point detection are discussed and their applicability on sequential multiple change point detection for several regimes is evaluated. We develop a method for a multiple change point detection for a process having three regimes. Its eciency is then evaluated on simulated Markov chain data by looking into dierent scenarios such as processes that signicantly dier between each other or probability distributions that are slightly similar. This approach is then applied on Covid- 19 hospital data. Therefore, the data …


A Machine Learning Approach To Evaluate The Effect Of Sodium-Glucose Cotransporter-2 Inhibitors On Chronic Kidney Disease In Diabetes Patients, Solomon Eshun May 2023

A Machine Learning Approach To Evaluate The Effect Of Sodium-Glucose Cotransporter-2 Inhibitors On Chronic Kidney Disease In Diabetes Patients, Solomon Eshun

Theses and Dissertations

Chronic kidney disease (CKD) is a significant complication that contributes to diabetes-related mortality in the United States, and there is growing evidence that sodium-glucose cotransporter 2 inhibitors (SGLT2i) can slow its progression. However, observational studies may suffer from confounding by indication, where patient characteristics and disease severity influence the decision to prescribe SGLT2i. This study utilized electronic health records of individuals with diabetes (from TriNetX) to investigate the effectiveness of SGLT2i on CKD progression. The database provided detailed information on patients’ CKD status, demographics, diagnosis, procedures, and medications, along with corresponding dates of diagnosis and prescription. The study comprised of …


A Machine Learning Approach To Obese-Inflammatory Phenotyping, Tania Mayleth Vargas May 2023

A Machine Learning Approach To Obese-Inflammatory Phenotyping, Tania Mayleth Vargas

Theses and Dissertations

Obesity is the accumulation of an abnormal, or excessive, amount of fat in the body, which can have negative effects on overall health. This excess accumulation of macronutrients in adipose tissue can cause the release of inflammatory mediators, leading to a proinflammatory state. Inflammation is a known risk factor for various health conditions, including cardiovascular diseases, metabolic syndrome, and diabetes. This study sought to examine the use of data mining methods, particularly clustering algorithms, to identify inflammatory biomarker phenotypes and their association with obesity in a local adolescent population. The algorithms evaluated in this study included: k-means, Ward's hierarchical …


Sparse Partitioned Empirical Bayes Ecm Algorithms For High-Dimensional Linear Mixed Effects And Heteroscedastic Regression, Anja Zgodic Apr 2023

Sparse Partitioned Empirical Bayes Ecm Algorithms For High-Dimensional Linear Mixed Effects And Heteroscedastic Regression, Anja Zgodic

Theses and Dissertations

Variable selection methods in both the frequentist and Bayesian frameworks are powerful techniques that provide prediction and inference in high-dimensional linear regression models. These methods often assume independence between observations and normally distributed errors with the same variance. In practice, these two assumptions are often violated. To mitigate this, we develop efficient and powerful Bayesian approaches for linear mixed modeling and heteroscedastic linear regression. These method offers increased flexibility through the development of empirical Bayes estimators for hyperparameters, with computationally efficient estimation through the Expectation Conditional-Minimization (ECM) algorithm. The novelty of these approaches lies in the partitioning and parameter expansion, …


Advancements In Parametric Modal Regression, Qingyang Liu Apr 2023

Advancements In Parametric Modal Regression, Qingyang Liu

Theses and Dissertations

This dissertation considers statistical inference methods for parametric modal regression models. In Chapter 1, we motivate the mode as the measure of central tendency instead of the median or the mean with an example. Following the motivational example, we include an overview of existing modal regression models. Later, in the same chapter, we explain advantages of the parametric modal regression models over existing nonparametric modal regression models. In Chapter 2, we address issues in statistical inference brought in by data contaminated with measurement error. With measurement error in covariates, statistical inference methods designed for modal regression models with error-free covariates …


Detecting Spatially Varying Coefficient Effects With Conditional Autoregressive Models: A Simulation Study Using Social Determinants Of Health Screening Data, Reid J. Demass Apr 2023

Detecting Spatially Varying Coefficient Effects With Conditional Autoregressive Models: A Simulation Study Using Social Determinants Of Health Screening Data, Reid J. Demass

Theses and Dissertations

Generalized linear models which include spatially varying coefficient terms allow researchers to determine if the association between predictor and outcome variables vary across geographic space. Such models are particularly applicable to research with public health data where interventions and limited health care resources must be allocated carefully. The integrated nested Laplace approximation (INLA) methodology available in the R INLA package is a popular tool to estimate spatially varying coefficients. To assess the performance of the estimation procedure, patient emergency department (ED) visits were simulated from data sourced from a pilot study at Prisma Health. The INLA technique was used to …


Bayesian Dependence Structure Analysis For Ordinal Data, Yang He Apr 2023

Bayesian Dependence Structure Analysis For Ordinal Data, Yang He

Theses and Dissertations

This dissertation explores different methods to study the dependence structure among many ordinal variables under the Bayesian framework.

Chapter 1 introduces ordinal data analysis methods, and the related literature works are briefly reviewed. An outline of the dissertation is put forward.

In Chapter 2, Gaussian copula graphical models with different priors of graphical Lasso, adaptive graphical Lasso, and spike-and-slab Lasso on the precision matrix are assessed and compared. The proposed models are well illustrated via simulations and a real ordinal survey data analysis.

In Chapter 3, adaptive spike-and-slab Lasso prior is proposed as an extension of Chapter 2. The developed …


Debris Survivability Study For Mega-Constellation Architectures, Joseph C. Canoy Mar 2023

Debris Survivability Study For Mega-Constellation Architectures, Joseph C. Canoy

Theses and Dissertations

The analysis for the overall theoretical debris survivabilty of mega-constellation architectures, with an emphasis on space-based ballistic missile defense constellation (SB-BMD), is explored via three extensive different Monte Carlo simulations: preliminary analysis of low Earth Orbit (LEO) mega-constellation survivabilty following a fragmentation event within the constellation, analysis of LEO mega-constellation survivability with a fragmentation event occurring on a satellite performing a maneuver to insert itself within the constellation, and the analysis of LEO mega-constellation survivabilty after a fragmentation event resulting from the destruction of a missile. The LEO mega-constellations represent the SB-BMD constellation. The first two analysis sections will include …


Probability Of Agreement As A Simulation Validation Methodology, Matthew C. Ledwith Mar 2023

Probability Of Agreement As A Simulation Validation Methodology, Matthew C. Ledwith

Theses and Dissertations

Determining whether a simulation model is operationally valid requires the rigorous assessment of agreement between observed functional responses of the simulation model and the corresponding real world system or process of interest. This research seeks to extend and formulate the probability of agreement approach to the operational validation of simulation models. The first paper provides a methodological approach and an initial demonstration which leverages bootstrapping to overcome situations where one’s ability to collect real-world data is limited. The second paper extends the probability of agreement approach to account for second-order heteroscedastic variability structures and establishes a weighted probability of agreement …


Examining Failures Of Kc-135 Boom Assemblies Using Survival Analysis, Benjamin D. Miller Mar 2023

Examining Failures Of Kc-135 Boom Assemblies Using Survival Analysis, Benjamin D. Miller

Theses and Dissertations

The purposes of this study are to confirm the applicability of survival analysis for predicting recurrent failures of a component of a military aircraft and to provide practical insights to maintenance managers and mission planners. The results of this study also can help the United States Department of Defense improve the CBM+ program. This study was able to predict recurrent failures of the component using Nelson-Aalen cumulative estimates. In addition, this study used a Cox proportional hazards regression model with shared frailty for measuring the effect of covariates on recurrent failures and unidentified heterogeneity in the model, which warranted future …


Examining Fuel Service System Failures Of The Usaf R11 Using Survival Analysis, Roed M.S. Mejia Mar 2023

Examining Fuel Service System Failures Of The Usaf R11 Using Survival Analysis, Roed M.S. Mejia

Theses and Dissertations

Recent events show that fuel supply is a large contributor to the success or failure of a military operation in response to a contingency. Any future near-peer conflict will stress the supply chain and require fully operational vehicles to be ready for the primary mission sets they support. In the United States Air Force (USAF), the readiness of fuel distribution trucks is crucial to meeting those mission sets in global operations. Utilizing non-parametric and semi-parametric survival models, which do not assume specific probability distributions, this study analyzes maintenance data for R-11 trucks that refuel aircraft.


Model-Based Imputation Of Below Detection Limit Missing Data And Group Selection In Bayesian Group Index Regression, Matthew Carli Jan 2023

Model-Based Imputation Of Below Detection Limit Missing Data And Group Selection In Bayesian Group Index Regression, Matthew Carli

Theses and Dissertations

Investigations into the association between chemical exposure and health outcomes are increasingly focused on the role of chemical mixtures, as opposed to individual chemicals. The analysis of chemical mixture data required the development of novel statistical methods, one of these being Bayesian group index regression. A statistical challenge common to all chemical mixture analyses is the ubiquitous presence of below detection limit (BDL) data. We propose an extension of Bayesian group index regression that treats both regression effects and missing BDL observations as parameters in a model estimated through a Markov Chain Monte Carlo algorithm that we refer to as …


Variability In Causal Effects On A Binary Outcome And Noncompliance In A Multisite Randomized Trial, Xinxin Sun Jan 2023

Variability In Causal Effects On A Binary Outcome And Noncompliance In A Multisite Randomized Trial, Xinxin Sun

Theses and Dissertations

Noncompliance to treatment assignment is widespread in randomized trials and presents challenges in causal inference. In the presence of noncompliance, the most commonly estimated effect of treatment assignment, also known as intent-to-treat (ITT) effect, is biased. Of interest in this setting is the complier average causal effect (CACE), the ITT effect among compliers. Further complication arises when the outcome variable is partially observed.

My research focuses on estimating the distribution of a site-specific CACE in a multisite randomized controlled trial (MRCT) by maximum likelihood (ML). Assuming compliance missing at random (MAR). We express the likelihood as an integral with respect …