Open Access. Powered by Scholars. Published by Universities.®

Statistical Methodology Commons

Open Access. Powered by Scholars. Published by Universities.®

PDF

2022

Discipline
Institution
Keyword
Publication
Publication Type

Articles 1 - 30 of 50

Full-Text Articles in Statistical Methodology

Examining The Relationship Between Stomiiform Fish Morphology And Their Ecological Traits, Mikayla L. Twiss Dec 2022

Examining The Relationship Between Stomiiform Fish Morphology And Their Ecological Traits, Mikayla L. Twiss

All HCAS Student Capstones, Theses, and Dissertations

Trait-based ecology characterizes individuals’ functional attributes to better understand and predict their interactions with other species and their environments. Utilizing morphological traits to describe functional groups has helped group species with similar ecological niches that are not necessarily taxonomically related. Within the deep-pelagic fishes, the Order Stomiiformes exhibits high morphological and species diversity, and many species undertake diel vertical migration (DVM). While the morphology and behavior of stomiiform fishes have been extensively studied and described through taxonomic assessments, the connection between their form and function regarding their DVM types, morphotypes, and daytime depth distributions is not well known. Here, three …


Statistical Methods For Modern Threats, Brandon Lumsden Dec 2022

Statistical Methods For Modern Threats, Brandon Lumsden

All Dissertations

More than ever before, technology is evolving at a rapid pace across the broad spectrum of biological sciences. As data collection becomes more precise, efficient, and standardized, a demand for appropriate statistical modeling grows as well. Throughout this dissertation, we examine a variety of new age data arising from modern technology of the 21st century. We begin by employing a suite of existing statistical techniques to address research questions surrounding three medical conditions presenting in public health sciences. Here we describe the techniques used, including generalized linear models and longitudinal models, and we summarize the significant associations identified between research …


Learning Graphical Models Of Multivariate Functional Data With Applications To Neuroimaging, Jiajing Niu Dec 2022

Learning Graphical Models Of Multivariate Functional Data With Applications To Neuroimaging, Jiajing Niu

All Dissertations

This dissertation investigates the functional graphical models that infer the functional connectivity based on neuroimaging data, which is noisy, high dimensional and has limited samples. The dissertation provides two recipes to infer the functional graphical model: 1) a fully Bayesian framework 2) an end-to-end deep model.

We first propose a fully Bayesian regularization scheme to estimate functional graphical models. We consider a direct Bayesian analog of the functional graphical lasso proposed by Qiao et al. (2019).. We then propose a regularization strategy via the graphical horseshoe. We compare both Bayesian approaches to the frequentist functional graphical lasso, and compare the …


Bayesian Methods For Graphical Models With Neighborhood Selection., Sagnik Bhadury Dec 2022

Bayesian Methods For Graphical Models With Neighborhood Selection., Sagnik Bhadury

Electronic Theses and Dissertations

Graphical models determine associations between variables through the notion of conditional independence. Gaussian graphical models are a widely used class of such models, where the relationships are formalized by non-null entries of the precision matrix. However, in high-dimensional cases, covariance estimates are typically unstable. Moreover, it is natural to expect only a few significant associations to be present in many realistic applications. This necessitates the injection of sparsity techniques into the estimation method. Classical frequentist methods, like GLASSO, use penalization techniques for this purpose. Fully Bayesian methods, on the contrary, are slow because they require iteratively sampling over a quadratic …


Statistical Methods For Meta-Analysis In Large-Scale Genomic Experiments, Wimarsha Thathsarani Jayanetti Dec 2022

Statistical Methods For Meta-Analysis In Large-Scale Genomic Experiments, Wimarsha Thathsarani Jayanetti

Mathematics & Statistics Theses & Dissertations

Recent developments in high throughput genomic assays have opened up the possibility of testing hundreds and thousands of genes simultaneously. With the availability of vast amounts of public databases, researchers tend to combine genomic analysis results from multiple studies in the form of a meta-analysis. Meta-analysis methods can be broadly classified into two main categories. The first approach is to combine the statistical significance (pvalues) of the genes from each individual study, and the second approach is to combine the statistical estimates (effect sizes) from the individual studies. In this dissertation, we will discuss how adherence to the standard null …


Functional Data Analysis Of Covid-19, Nichole L. Fluke Nov 2022

Functional Data Analysis Of Covid-19, Nichole L. Fluke

Mathematics & Statistics ETDs

This thesis deals with Functional Data Analysis (FDA) on COVID data. The Data involves counts for new COVID cases, hospitalized COVID patients, and new COVID deaths. The data used is for all the states and regions in the United States. The data starts in March 1st, 2020 and goes through March 31st, 2021. The FDA smooths the data and looks to see if there are similarities or differences between the states and regions in the data. The data also shows which states and regions stand out from the others and which ones are similar. Also shown …


Statistical Roles Of The G-Expectation Framework In Model Uncertainty: The Semi-G-Structure As A Stepping Stone, Yifan Li Oct 2022

Statistical Roles Of The G-Expectation Framework In Model Uncertainty: The Semi-G-Structure As A Stepping Stone, Yifan Li

Electronic Thesis and Dissertation Repository

The G-expectation framework is a generalization of the classical probability system based on the sublinear expectation to deal with phenomena that cannot be described by a single probabilistic model. These phenomena are closely related to the long-existing concern about model uncertainty in statistics. However, the distributions and independence in the G-framework are quite different from the classical setup. These distinctions bring difficulty when applying the idea of this framework to general statistical practice. Therefore, a fundamental and unavoidable problem is how to better understand G-version concepts from a statistical perspective.

To explore this problem, this thesis establishes a new substructure …


Regression-Based Methods For Dynamic Treatment Regimes With Mismeasured Covariates Or Misclassified Response, Dan Liu Sep 2022

Regression-Based Methods For Dynamic Treatment Regimes With Mismeasured Covariates Or Misclassified Response, Dan Liu

Electronic Thesis and Dissertation Repository

The statistical study of dynamic treatment regimes (DTRs) focuses on estimating sequential treatment decision rules tailored to patient-level information across multiple stages of intervention. Regression-based methods in DTR have been studied in the literature with a critical assumption that all the observed variables are precisely measured. However, this assumption is often violated in many applications. One example is the STAR*D study, in which the patient's depressive score is subject to measurement error. In this thesis, we explore problems in the context of DTR with measurement error or misclassification considered in the observed data.

The first project deals with covariate measurement …


Copulas, Maximal Dependence, And Anomaly Detection In Bi-Variate Time Series, Ning Sun Aug 2022

Copulas, Maximal Dependence, And Anomaly Detection In Bi-Variate Time Series, Ning Sun

Electronic Thesis and Dissertation Repository

This thesis focuses on discussing non-parametric estimators and their asymptotic behaviors for indices developed to characterize bi-variate time series. There are typically two types of indices depending on whether the distributional information is involved. For the indices containing the distributional information of the bivariate stationary time series, we particularly focus on the index called the tail order of maximal dependence (TOMD), which is an improvement of the tail order. For the indices without distributional information of the bivariate time series, we focus on an anomaly detection index for univariate input-output systems.

This thesis integrates three articles. The first article (Chapter …


Bias-Corrected Bagging In Active Learning With An Actuarial Application, Yangxuan Xu Aug 2022

Bias-Corrected Bagging In Active Learning With An Actuarial Application, Yangxuan Xu

Undergraduate Student Research Internships Conference

The variable annuity (VA) is a modern insurance product that offers certain guaranteed protection and tax-deferred treatment. Because of the inherent complexity of guarantees’ payoff, the closed-form solution of fair market values (FMVs) is often not available. Most insurance companies depend on Monte Carlo (MC) simulation to price the FMVs of these products, which is an extremely computational intensive and time-consuming approach. The metamodeling approach can be used to circumvent the heavy computation.

In the modeling stage, the bagged tree method has proved to outperform other parametric approaches. Also, a bias-corrected (BC) bagging model was tried and showed significant improvement …


The Q-Analogue Of The Extended Generalized Gamma Distribution, Wenhao Chen Aug 2022

The Q-Analogue Of The Extended Generalized Gamma Distribution, Wenhao Chen

Undergraduate Student Research Internships Conference

This project introduces a flexible univariate probability model referred to as the q-analogue of the Extended Generalized Gamma (or q-EGG) distribution, which encompasses the majority of the most frequently used continuous distributions, including the gamma, Weibull, logistic, type-1 and type-2 beta, Gaussian, Cauchy, Student-t and F. Closed form representations of its moments and cumulative distribution function are provided. Additionally, computational techniques are proposed for determining estimates of its parameters. Both the method of moments and the maximum likelihood approach are utilized. The effect of each parameter is also graphically illustrated. Certain data sets are modeled with q-EGG distributions; goodness of …


Dynamic Prediction For Alternating Recurrent Events Using A Semiparametric Joint Frailty Model, Jaehyeon Yun Aug 2022

Dynamic Prediction For Alternating Recurrent Events Using A Semiparametric Joint Frailty Model, Jaehyeon Yun

Statistical Science Theses and Dissertations

Alternating recurrent events data arise commonly in health research; examples include hospital admissions and discharges of diabetes patients; exacerbations and remissions of chronic bronchitis; and quitting and restarting smoking. Recent work has involved formulating and estimating joint models for the recurrent event times considering non-negligible event durations. However, prediction models for transition between recurrent events are lacking. We consider the development and evaluation of methods for predicting future events within these models. Specifically, we propose a tool for dynamically predicting transition between alternating recurrent events in real time. Under a flexible joint frailty model, we derive the predictive probability of …


To Logit Or Not To Logit Data In The Unit Interval: A Simulation Study, Kayode Idris Hamzat Aug 2022

To Logit Or Not To Logit Data In The Unit Interval: A Simulation Study, Kayode Idris Hamzat

Major Papers

In this paper, we recommend a mechanism for determining whether to logit or not to logit data in the unit interval which is based on quantile estimation of data between 0 and 1. By using a simulated dataset generated from a Beta regression model, the estimated quantile for this model perform better than those based on the linear quantile regression with logit transformation.

Further, we investigate the performance of the quantile regression estimators based on the LQR and we conclude that it is better than those based on the Beta regression when the distribution is contaminated with 10% uniform numbers …


Advanced High Dimensional Regression Techniques, Yuan Yang Aug 2022

Advanced High Dimensional Regression Techniques, Yuan Yang

All Dissertations

This dissertation focuses on developing high dimensional regression techniques to analyze large scale data using both Bayesian and frequentist approaches, motivated by data sets from various disciplines, such as public health and genetics. More specifically, Chapters 2 and Chapter 4 take a Bayesian approach to achieve modeling and parameter estimation simultaneously while Chapter 3 takes a frequentist approach. The main aspects of these techniques are that they perform variable selection and parameter estimation simultaneously, while also being easily adaptable to large-scale data. In particular, by embedding a logistic model into traditional spike and slab framework and selecting of proper prior …


New Developments On The Estimability And The Estimation Of Phase-Type Actuarial Models, Cong Nie Jul 2022

New Developments On The Estimability And The Estimation Of Phase-Type Actuarial Models, Cong Nie

Electronic Thesis and Dissertation Repository

This thesis studies the estimability and the estimation methods for two models based on Markov processes: the phase-type aging model (PTAM), which models the human aging process, and the discrete multivariate phase-type model (DMPTM), which can be used to model multivariate insurance claim processes.

The principal contributions of this thesis can be categorized into two areas. First, an objective measure of estimability is proposed to quantify estimability in the context of statistical models. Existing methods for assessing estimability require the subjective specification of thresholds, which potentially limits their usefulness. Unlike these methods, the proposed measure of estimability is objective. In …


Statistical Extensions Of Multi-Task Learning With Semiparametric Methods And Task Diagnostics, Nikolay Miller Jun 2022

Statistical Extensions Of Multi-Task Learning With Semiparametric Methods And Task Diagnostics, Nikolay Miller

Mathematics & Statistics ETDs

In this dissertation, I propose new approaches to multi-task learning, inspired by statistical model diagnostics and semiparametric and additive modeling. The newly designed additive multi-task model framework allows for flexible estimation of multi-task parametric and nonparametric effects by using an extension of the backfitting algorithm. Further, I propose new methods for statistical task diagnostics, which allow for the identification and remedy of outlier tasks, based on task-specific performance metrics and their empirical distributions. I perform a deep examination of the well-established multi-task kernel method and achieve theoretical and experimental contributions. Lastly, I propose a two-step modeling approach to multi-task modeling, …


A Bayesian Programming Approach To Car-Following Model Calibration And Validation Using Limited Data, Franklin Abodo Jun 2022

A Bayesian Programming Approach To Car-Following Model Calibration And Validation Using Limited Data, Franklin Abodo

FIU Electronic Theses and Dissertations

Traffic simulation software is used by transportation researchers and engineers to design and evaluate changes to roadway networks. Underlying these simulators are mathematical models of microscopic driver behavior from which macroscopic measures of flow and congestion can be recovered. Many models are intended to apply to only a subset of possible traffic scenarios and roadway configurations, while others do not have any explicit constraint on their applicability. Work zones on highways are one scenario for which no model invented to date has been shown to accurately reproduce realistic driving behavior. This makes it difficult to optimize for safety and other …


The Short-Term Effects Of Fine Airborne Particulate Matter And Climate On Covid-19 Disease Dynamics, El Hussain Shamsa, Kezhong Zhang Jun 2022

The Short-Term Effects Of Fine Airborne Particulate Matter And Climate On Covid-19 Disease Dynamics, El Hussain Shamsa, Kezhong Zhang

Medical Student Research Symposium

Background: Despite more than 60% of the United States population being fully vaccinated, COVID-19 cases continue to spike in a temporal pattern. These patterns in COVID-19 incidence and mortality may be linked to short-term changes in environmental factors.

Methods: Nationwide, county-wise measurements for COVID-19 cases and deaths, fine-airborne particulate matter (PM2.5), and maximum temperature were obtained from March 20, 2020 to March 20, 2021. Multivariate Linear Regression was used to analyze the association between environmental factors and COVID-19 incidence and mortality rates in each season. Negative Binomial Regression was used to analyze daily fluctuations of COVID-19 cases …


Adjusting Community Survey Data Benchmarks For External Factors, Allen Miller, Nicole M. Norelli, Robert Slater, Mingyang N. Yu Jun 2022

Adjusting Community Survey Data Benchmarks For External Factors, Allen Miller, Nicole M. Norelli, Robert Slater, Mingyang N. Yu

SMU Data Science Review

Abstract. Using U.S. resident survey data from the National Community Survey in combination with public data from the U.S. Census and additional sources, a Voting Regressor Model was developed to establish fair benchmark values for city performance. These benchmarks were adjusted for characteristics the city cannot easily influence that contribute to confidence in local government, such as population size, demographics, and income. This adjustment allows for a more meaningful comparison and interpretation of survey results among individual cities. Methods explored for the benchmark adjustment included cluster analysis, anomaly detection, and a variety of regression techniques, including random forest, ridge, decision …


Investigation Of The Association Of Exposures To Fire-Related Hazards With Pulmonary Function Of Firefighters, David G. Goldfarb Jun 2022

Investigation Of The Association Of Exposures To Fire-Related Hazards With Pulmonary Function Of Firefighters, David G. Goldfarb

Dissertations and Theses

Background. Firefighters are habitually exposed to hazardous toxicants which place them at an elevated risk for numerous adverse health outcomes. An example of this is the associations observed in other works between inhalation of combustion byproducts from urban structural fires and both acute and chronic pulmonary dysfunction. To-date, the characterization of firefighters’ exposures to dangerous chemicals in smoke from non-wildfire incidents, both directly through personal monitoring and indirectly from work-related records is scarce. Prior works investigating the association between routine firefighting and pulmonary function have relied on crude metrics such as years of service and numbers of responses to …


Statistical Modeling Of Longitudinal Medical Cost Data, Shikun Wang Jun 2022

Statistical Modeling Of Longitudinal Medical Cost Data, Shikun Wang

Dissertations & Theses (Open Access)

Projecting the future cancer care cost is critical in health economics research and policy making. An indispensable step is to estimate cost trajectories from an incident cohort of cancer patients using longitudinal medical cost data, accounting for terminal events such as death, and right censoring due to loss of follow-up. Since the cost of cancer care and survival are correlated, a scientifically meaningful quantity for inference in this context is the mean cost trajectory conditional on survival. Many standard approaches for longitudinal and survival analysis are not valid for the problem. The research for my Ph.D. dissertation consists of three …


Attempting To Predict The Unpredictable: March Madness, Coleton Kanzmeier May 2022

Attempting To Predict The Unpredictable: March Madness, Coleton Kanzmeier

Theses/Capstones/Creative Projects

Each year, millions upon millions of individuals fill out at least one if not hundreds of March Madness brackets. People test their luck every year, whether for fun, with friends or family, or to even win some money. Some people rely on their basketball knowledge whereas others know it is called March Madness for a reason and take a shot in the dark. Others have even tried using statistics to give them an edge. I intend to follow a similar approach, using statistics to my advantage. The end goal is to predict this year’s, 2022, March Madness bracket. To achieve …


Assessing The Influence Of Health Policy And Population Mobility On Covid-19 Spread In Arkansas, Tayden Barretto May 2022

Assessing The Influence Of Health Policy And Population Mobility On Covid-19 Spread In Arkansas, Tayden Barretto

Industrial Engineering Undergraduate Honors Theses

The outbreak of COVID-19 has created a major crisis across the world since its start in 2019, and its influence on every realm of society is undeniable. Globally, more than 500 million cases have been recorded since March 2020, with almost 6 million deaths. In the wake of this crisis, many governments and health organizations have taken steps and precautions to mitigate its spread. These steps involve public mandates of information, reducing frequency of personal contact, and use of masks to minimize the risk of transmission. Current access to mobility data released from Google detailing population movements has provided a …


Advancements In Gaussian Process Learning For Uncertainty Quantification, John C. Nicholson May 2022

Advancements In Gaussian Process Learning For Uncertainty Quantification, John C. Nicholson

All Dissertations

Gaussian processes are among the most useful tools in modeling continuous processes in machine learning and statistics. The research presented provides advancements in uncertainty quantification using Gaussian processes from two distinct perspectives. The first provides a more fundamental means of constructing Gaussian processes which take on arbitrary linear operator constraints in much more general framework than its predecessors, and the other from the perspective of calibration of state-aware parameters in computer models. If the value of a process is known at a finite collection of points, one may use Gaussian processes to construct a surface which interpolates these values to …


Aberrant Responding With Underlying Dominance And Unfolding Response Processes: Examining Model Fit And Performance Of Person-Fit Statistics, Jennifer A. Reimers May 2022

Aberrant Responding With Underlying Dominance And Unfolding Response Processes: Examining Model Fit And Performance Of Person-Fit Statistics, Jennifer A. Reimers

Graduate Theses and Dissertations

Researchers have recognized that respondents may not answer items in a way that accurately reflects their attitude or trait level being measured. The resulting response data that deviates from what would be expected has been shown to have significant effects on the psychometric properties of a scale and analytical results. However, many studies that have investigated the detection of aberrant data and its effects have done so using dominance item response theory (IRT) models. It is unknown whether the impacts of aberrant data and the methodology used to identify aberrant responding when using dominance IRT models apply similarly when scales …


A Statistical Study Into The Relationship Between The Student Age And Their Academic Performance, Umangkumar Patel, Akeisha Belgrave Apr 2022

A Statistical Study Into The Relationship Between The Student Age And Their Academic Performance, Umangkumar Patel, Akeisha Belgrave

Harrisburg University Research Symposium: Highlighting Research, Innovation, & Creativity

This project will conduct a research in order to find out the relationship between the age of a student and their academic performance. This project will survey at least 100 students. (Class Project)


Relationship Between Higher Education And Health Insurance Coverage And Healthcare Utilization For Black Or African American Individuals, Emma Revoir Apr 2022

Relationship Between Higher Education And Health Insurance Coverage And Healthcare Utilization For Black Or African American Individuals, Emma Revoir

Psychology Student Works

Higher education may decrease mortality due to higher income and health insurance availability (Buckles et al., 2016). Education can increase understanding and utilizing health insurance (Gallo et al., 2020). There is a gap in how this relationship affects racial minorities. This study aims to understand how health insurance is affected by education level for Black or African American individuals.


The Association Between Sibling Relationships And Personality, Rebecca Ramsey Apr 2022

The Association Between Sibling Relationships And Personality, Rebecca Ramsey

Psychology Student Works

Differences in self perception (specifically aggression) was discovered in participants with siblings, and no difference of intelligence, outgoing personalities, creativity, competitiveness and family orientation (Van Volkom, Guerguis, & Kramer, 2017). Sisters were found to have more empathy than the brothers along with higher intimacy, knowledge, and emotional support (Walęcka-Matyja, 2017). There is a gap in my research between sibling jealousy and depression, sibling negativity, and intimacy in young adulthood (Hamwey & Whiteman, 2020). My study examines the similarity of personality to siblings. Some personality traits may include jealousy, outgoingness, competitiveness, and creativity.


How Quality Of Paternal Relationship Impacts Depression Development In Adulthood, Rachel Lane Apr 2022

How Quality Of Paternal Relationship Impacts Depression Development In Adulthood, Rachel Lane

Psychology Student Works

Previous studies show that paternal involvement directly correlates to a decrease in the child’s probability of developing a mental illness (O’Gara, et.al. 2019). Research shows, the more supportive and the more affection a parent gives, the less likely a child is to develop depressive tendencies (Del Barrio, et. al. 2016) There is a gap in investigating whether or not having a loving father postpones the age at which someone is diagnosed with depression. This study provides insight into paternal closeness and the age at which someone is diagnosed with depression.


The Association Between Promiscuity And Marital Satisfaction, Shelby Kate Christopher Apr 2022

The Association Between Promiscuity And Marital Satisfaction, Shelby Kate Christopher

Psychology Student Works

Research has suggested that premarital cohabitation is linked to a heightened risk of divorce and dissatisfaction if it occurs with someone other than the marriage partner (Teachman, 2004). The number of partners is linked to marital dissatisfaction for both men and women (Legkauskas & Stankevičienė, 2008). Involvement in church groups is linked to a 5x increase in likelihood of abstinence (Paul et al, 2000). Previous studies have used small sample sizes with little diversity and few men. However, my research will use a large, diverse sample size with both men and women.