Open Access. Powered by Scholars. Published by Universities.®

Statistics and Probability Commons

Open Access. Powered by Scholars. Published by Universities.®

Theses/Dissertations

2022

Discipline
Institution
Keyword
Publication

Articles 1 - 30 of 268

Full-Text Articles in Statistics and Probability

Application Of Distributed Fiber-Optic Sensing For Pressure Predictions And Multiphase Flow Characterization, Gerald Kelechi Ekechukwu Dec 2022

Application Of Distributed Fiber-Optic Sensing For Pressure Predictions And Multiphase Flow Characterization, Gerald Kelechi Ekechukwu

LSU Doctoral Dissertations

In the oil and gas industry, distributed fiber optics sensing (DFOS) has the potential to revolutionize well and reservoir surveillance applications. Using fiber optic sensors is becoming increasingly common because of its chemically passive and non-magnetic interference properties, the possibility of flexible installations that could be behind the casing, on the tubing, or run on wireline, as well as the potential for densely distributed measurements along the entire length of the fiber. The main objectives of my research are to develop and demonstrate novel signal processing and machine learning computational techniques and workflows on DFOS data for a variety of …


Regression Modeling Of Complex Survival Data Based On Pseudo-Observations, Rong Rong Dec 2022

Regression Modeling Of Complex Survival Data Based On Pseudo-Observations, Rong Rong

Statistical Science Theses and Dissertations

The restricted mean survival time (RMST) is a clinically meaningful summary measure in studies with survival outcomes. Statistical methods have been developed for regression analysis of RMST to investigate impacts of covariates on RMST, which is a useful alternative to the Cox regression analysis. However, existing methods for regression modeling of RMST are not applicable to left-truncated right-censored data that arise frequently in prevalent cohort studies, for which the sampling bias due to left truncation and informative censoring induced by the prevalent sampling scheme must be properly addressed. Meanwhile, statistical methods have been developed for regression modeling of the cumulative …


Mitigation Impact Of Statewide Non-Pharmaceutical Policies On Covid-19: An Application Of Infectious Disease Transmission Model And Partially Observed Markov Process To New Mexico, Xingya Ma Dec 2022

Mitigation Impact Of Statewide Non-Pharmaceutical Policies On Covid-19: An Application Of Infectious Disease Transmission Model And Partially Observed Markov Process To New Mexico, Xingya Ma

Mathematics & Statistics ETDs

This thesis is an application of epidemiological models for infectious disease transmission and the use of partially observed Markov process (POMP) for model fitting. It focuses on COVID-19 pandemic in the state of New Mexico. The analysis covered March 2020 to June 2021. Daily data of COVID19 cases and deaths and a daily index of eleven statewide government non-pharmaceutical intervention (NPI) policies were collected from six public sources and were validated. These data were integrated through the Susceptible-Exposed-Infected-Removed (SEIR) model. Estimated daily transmission rates between the model compartments quantify the impact of the mitigation policies, and show that transmission rates …


Dealing With Dimensionality: Problems And Techniques In High-Dimensional Statistics, Cezareo Rodriguez Dec 2022

Dealing With Dimensionality: Problems And Techniques In High-Dimensional Statistics, Cezareo Rodriguez

Arts & Sciences Electronic Theses and Dissertations

In modern data analysis, problems involving high dimensional data with more variables than subjects is increasingly common. Two such cases are mediation analysis and distributed optimization. In Chapter 2 we start with an overview of high dimensional statistics and mediation analysis. In Chapter 3 we motivate and prove properties for a new marginal screening procedure for performing high dimensional mediation analysis. This screening procedure is shown via simulation to perform better than benchmark approaches and is applied to a DNA methylation study. In Chapter 4 we construct a cryptosystem that accurately performs distributed penalized quantile regression in the high-dimensional setting …


Kernel Estimation Of Spot Volatility And Its Application In Volatility Functional Estimation, Bei Wu Dec 2022

Kernel Estimation Of Spot Volatility And Its Application In Volatility Functional Estimation, Bei Wu

Arts & Sciences Electronic Theses and Dissertations

It\^o semimartingale models for the dynamics of asset returns have been widely studied in financial econometrics. A key component of the model, spot volatility, plays a crucial role in option pricing, portfolio management, and financial risk assessment. In this dissertation, we consider three problems related to the estimation of spot volatility using high-frequency asset returns. We first revisit the problem of estimating the spot volatility of an It\^o semimartingale using a kernel estimator. We prove a Central Limit Theorem with an optimal convergence rate for a general two-sided kernel under quite mild assumptions, which includes leverage effects and jumps of …


Contribution To Data Science: Time Series, Uncertainty Quantification And Applications, Dhrubajyoti Ghosh Dec 2022

Contribution To Data Science: Time Series, Uncertainty Quantification And Applications, Dhrubajyoti Ghosh

Arts & Sciences Electronic Theses and Dissertations

Time series analysis is an essential tool in modern world statistical analysis, with a myriad of real data problems having temporal components that need to be studied to gain a better understanding of the temporal dependence structure in the data. For example, in the stock market, it is of significant importance to identify the ups and downs of the stock prices, for which time series analysis is crucial. Most of the existing literature on time series deals with linear time series, or with Gaussianity assumption. However, there are multiple instances where the time series shows nonlinear trends, or when the …


Dynamics Of Redox-Driven Molecular Processes In Local And Systemic Plant Immunity, Philip Berg Dec 2022

Dynamics Of Redox-Driven Molecular Processes In Local And Systemic Plant Immunity, Philip Berg

Theses and Dissertations

The work here presents two main parts. In the first part, chapters 1 – 3 focus on dynamical systems modeling in plant immunity, whereas chapters 4 – 6 describe contributions to computational modeling and analysis of proteomics and genomics data. Chapter 1 investigates dynamical and biochemical patterns of reversibly oxidized cysteines (RevOxCys) during effector-triggered immunity (ETI) in Arabidopsis, examines the regulatory patterns associated with Arabidopsis thimet oligopeptidase 1 and 2’s (TOP1 and TOP2), roles in the RevOxCys events during ETI, and analyzes the redox phenotype of the top1top2 mutant. The second chapter investigates the peptidome dynamics during ETI …


Examining The Relationship Between Stomiiform Fish Morphology And Their Ecological Traits, Mikayla L. Twiss Dec 2022

Examining The Relationship Between Stomiiform Fish Morphology And Their Ecological Traits, Mikayla L. Twiss

All HCAS Student Capstones, Theses, and Dissertations

Trait-based ecology characterizes individuals’ functional attributes to better understand and predict their interactions with other species and their environments. Utilizing morphological traits to describe functional groups has helped group species with similar ecological niches that are not necessarily taxonomically related. Within the deep-pelagic fishes, the Order Stomiiformes exhibits high morphological and species diversity, and many species undertake diel vertical migration (DVM). While the morphology and behavior of stomiiform fishes have been extensively studied and described through taxonomic assessments, the connection between their form and function regarding their DVM types, morphotypes, and daytime depth distributions is not well known. Here, three …


Active Vs Passive Investing, Garret Buchheit Dec 2022

Active Vs Passive Investing, Garret Buchheit

Honors Theses

With the increased popularity of passive investing, the long-term investment success of active management is being questioned more frequently. For this reason, this research seeks to find whether actively managed funds produce sufficient returns that cover the fees and management costs associated with them. A comparative analysis was made with 5401 actively managed U.S. mutual funds and several common market indices over three, five, and ten-year time spans ranging from 2012 to 2021. Additionally, an analysis was made comparing active and passive management in the recessionary period of 2007 to 2009. Finally, analysis was conducted on annual holdings turnover rates …


Towards Structured Planning And Learning At The State Fisheries Agency Scale, Caleb A. Aldridge Dec 2022

Towards Structured Planning And Learning At The State Fisheries Agency Scale, Caleb A. Aldridge

Theses and Dissertations

Inland recreational fisheries has grown philosophically and scientifically to consider economic and sociopolitical aspects (non-biological) in addition to the biological. However, integrating biological and non-biological aspects of inland fisheries has been challenging. Thus, an opportunity exists to develop approaches and tools which operationalize planning and decision-making processes which include biological and non-biological aspects of a fishery. This dissertation expands the idea that a core set of goals and objectives is shared among and within inland fisheries agencies; that many routine operations of inland fisheries managers can be regimented or standardized; and the novel concept that current information and operations can …


Larval Ecology Of Atlantic Bluefin Tuna (Thunnus Thynnus): New Insights From Otolith Microstructure, Biotic, And Abiotic Analyses From The Gulf Of Mexico And Mediterranean Sea, Estrella Malca Dec 2022

Larval Ecology Of Atlantic Bluefin Tuna (Thunnus Thynnus): New Insights From Otolith Microstructure, Biotic, And Abiotic Analyses From The Gulf Of Mexico And Mediterranean Sea, Estrella Malca

All HCAS Student Capstones, Theses, and Dissertations

Atlantic bluefin tuna (ABT), Thunnus thynnus, spawn in the Gulf of Mexico (GoM) and the Mediterranean Sea (MED). Spawning occurs within narrow temporal and environmental parameters. Efforts to characterize growth of ABT in wild conditions revealed a wide range of growth variability during the early life stages. This series of studies examined potential biotic and abiotic influences of larval growth from seven ABT cohorts, and identified several key drivers of growth for this commercially valuable species. A detailed investigation of larval dynamics using otolith microstructure was conducted as follows. First, companion growth curves and stable isotope analysis from the same …


Evaluation Of Effect Of Preprocessing Algorithms On Resting State Fmri Data, Hortencia Josefina Hernandez Dec 2022

Evaluation Of Effect Of Preprocessing Algorithms On Resting State Fmri Data, Hortencia Josefina Hernandez

Open Access Theses & Dissertations

Graph theory modeling is a common modeling approach in neurobiology research studies. These models are useful since they describe patterns of connection for regions of interest in the brain using resting state fMRI images. The standard rule of thumb is to threshold the observed activation levels prior to model building. It is reasonable to assume that the use of this threshold affects the statistical distribution of commonly reported centrality metrics from the graph theory model, such as degree, betweenness, and closeness. In this study we examine the differential effect of using the standard approaches versus alternative direct thresholds and incorporation …


Mle And Eap Methods For Estimating Ability Scores For Data Of Varying Sample Size And Item Length, Sahar Taji Dec 2022

Mle And Eap Methods For Estimating Ability Scores For Data Of Varying Sample Size And Item Length, Sahar Taji

Graduate Theses and Dissertations

In this research, the performance of two popular estimators, Maximum Likelihood Estimator(MLE) and Bayesian Expected a Posteriori (EAP) is studied and compared in estimating the latent ability score in an Item Response Theory (IRT) model. The 2-Parameter Logistic (2PL) IRT model which is characterized by difficulty and discrimination item parameters is used to estimate the latent ability scores. Several datasets are generated for variety of sample size and item length values. The Monte-Carlo simulation is used to analyze the performance of the estimators. Results show that MLE produces reliable results with low root mean square error (RMSE) across all datasets. …


Power Approximations For Generalized Linear Mixed Models In R Using Steep Priors On Variance Components, Sydney Geisler Dec 2022

Power Approximations For Generalized Linear Mixed Models In R Using Steep Priors On Variance Components, Sydney Geisler

All Graduate Theses and Dissertations, Spring 1920 to Summer 2023

When designing an experiment, researchers often want to know how likely they are to detect statistically significant effects in the resulting data, i.e., they want to estimate their statistical power. The probability distribution method is a flexible way to do this, and it is currently implemented in the statistical software package SAS. This method requires a hypothetical data set (showing the magnitude of hypothesized effects) and constant values of variance components, which are critical elements of the statistical models used. The statistical software package R is increasingly popular, but the probability distribution method has not yet been implemented in R, …


Statistical Challenges And Methods For Missing And Imbalanced Data, Rose Adjei Dec 2022

Statistical Challenges And Methods For Missing And Imbalanced Data, Rose Adjei

All Graduate Theses and Dissertations, Spring 1920 to Summer 2023

Missing data remains a prevalent issue in every area of research. The impact of missing data, if not carefully handled, can be detrimental to any statistical analysis. Some statistical challenges associated with missing data include, loss of information, reduced statistical power and non-generalizability of findings in a study. It is therefore crucial that researchers pay close and particular attention when dealing with missing data. This multi-paper dissertation provides insight into missing data across different fields of study and addresses some of the above mentioned challenges of missing data through simulation studies and application to real datasets. The first paper of …


Bayesian Adaptive Clinical Trial Design, Mengyi Lu Dec 2022

Bayesian Adaptive Clinical Trial Design, Mengyi Lu

Dissertations & Theses (Open Access)

The landscape of drug development in oncology has changed from conventional chemotherapies to molecular targeted therapies and immunotherapies, which provide innovative therapeutic modalities for treating cancers. These novel therapeutic agents work through mechanisms that fundamentally differ from standard chemotherapeutic agents, making the conventional trial design paradigm inefficient and dysfunctional. Specifically, the focus of dose-finding trials has shifted from finding the maximum tolerated dose (MTD) to the optimal biological dose (OBD), defined as the dose that optimizes the risk–benefit tradeoff. How to accurately identify the OBD and its dosing schedule is of great importance to maximize efficacy and safety of targeted …


Bayesian Methods For Graphical Models With Neighborhood Selection., Sagnik Bhadury Dec 2022

Bayesian Methods For Graphical Models With Neighborhood Selection., Sagnik Bhadury

Electronic Theses and Dissertations

Graphical models determine associations between variables through the notion of conditional independence. Gaussian graphical models are a widely used class of such models, where the relationships are formalized by non-null entries of the precision matrix. However, in high-dimensional cases, covariance estimates are typically unstable. Moreover, it is natural to expect only a few significant associations to be present in many realistic applications. This necessitates the injection of sparsity techniques into the estimation method. Classical frequentist methods, like GLASSO, use penalization techniques for this purpose. Fully Bayesian methods, on the contrary, are slow because they require iteratively sampling over a quadratic …


Evaluating The Health Impacts Of Fruit And Vegetable Intake At The Individual Level And Food Pantry Level Among Food Pantry Users, Jiacheng Chen Dec 2022

Evaluating The Health Impacts Of Fruit And Vegetable Intake At The Individual Level And Food Pantry Level Among Food Pantry Users, Jiacheng Chen

Legacy Theses & Dissertations (2009 - 2024)

Background: Chronic diseases impose heavy burdens on individuals and the healthcare system in the US. Many factors were found to be associated with chronic diseases, including demographics, family history, social environmental factors, and individual behavioral factors such as diet and physical activity. Among those factors, fruit and vegetable intake can have substantial health impacts via a variety of causal pathways. Fruit and vegetable (F&V) consumption is generally lower among individuals living in households experiencing food insecurity and rely on food assistance programs. Decreased F&V intake among food pantry users may negatively impact health. However, conducting quantitative analysis on this population …


Metabolic Alterations And Cardiovascular Risk After Hepatitis C Cure In Subjects With Or At Risk For Hiv, Christophe Maxime Fokoua Dongmo Dec 2022

Metabolic Alterations And Cardiovascular Risk After Hepatitis C Cure In Subjects With Or At Risk For Hiv, Christophe Maxime Fokoua Dongmo

Legacy Theses & Dissertations (2009 - 2024)

Background. Hepatitis C virus (HCV) infection engenders substantial metabolic changes. These changes are altered when the virus is cleared after successful treatment. We measured these metabolic alterations that occur after HCV cure; further, we assessed whether these alterations differed in subgroups defined by patients’ characteristics.


Green On The Map - The Influence Of Conservation Easements On The Naturalness Of Landscapes In The United States, Nakisha Fouch Dec 2022

Green On The Map - The Influence Of Conservation Easements On The Naturalness Of Landscapes In The United States, Nakisha Fouch

All Dissertations

Large protected areas have long been the cornerstone of conservation biology, however, in an era branded by the human dominance of ecosystems, regional landscape structure and function are often a consequence of accumulated land-use decisions that may or may not include a nod to conservation planning. With underrepresentation of habitats in publicly protected areas, attention has focused on the function of alternative land conservation mechanisms. Private conservation easements (CEs) have proliferated in the United States, yet assessing landscape-level function is confounded by holder and donor intent, national and regional policy, regional landscape contexts, varying extents, resolution, and temporal scale. Over …


Learning Graphical Models Of Multivariate Functional Data With Applications To Neuroimaging, Jiajing Niu Dec 2022

Learning Graphical Models Of Multivariate Functional Data With Applications To Neuroimaging, Jiajing Niu

All Dissertations

This dissertation investigates the functional graphical models that infer the functional connectivity based on neuroimaging data, which is noisy, high dimensional and has limited samples. The dissertation provides two recipes to infer the functional graphical model: 1) a fully Bayesian framework 2) an end-to-end deep model.

We first propose a fully Bayesian regularization scheme to estimate functional graphical models. We consider a direct Bayesian analog of the functional graphical lasso proposed by Qiao et al. (2019).. We then propose a regularization strategy via the graphical horseshoe. We compare both Bayesian approaches to the frequentist functional graphical lasso, and compare the …


Lindley Processes With Correlated Changes, John Grant Dec 2022

Lindley Processes With Correlated Changes, John Grant

All Dissertations

This dissertation studies a Lindley random walk model when the increment process driving the walk is strictly stationary. Lindley random walks govern customer waiting times in many queueing models and several natural and business processes, including snow depths, frozen soil depths, inventory quantities, etc. Probabilistic properties of a Lindley process with time-correlated stationary changes are explored. We provide a streamlined argument that the process admits a limiting stationary distribution when the mean of the incremental changes is negative and that the Lindley process is strictly stationary when starting from this stationary distribution. The Markov characteristics of the process are explored …


Weather Parameters Influencing The Incidence Of Citrus Canker Caused By Aw Strain In The Rio Grande Valley, Amit Sharma Dec 2022

Weather Parameters Influencing The Incidence Of Citrus Canker Caused By Aw Strain In The Rio Grande Valley, Amit Sharma

Theses and Dissertations

Citrus canker caused by bacterium Xanthomonas citri subsp. citri (Xcc) seriously affects the citrus industry by making the fruit unmarketable due to unsightly lesions on the fruit. Canker caused by Aw strain of Xcc was reported in the citrus trees located in the residential areas of the Rio Grande Valley (RGV). Canker severity differs amongst cultivars/varieties, and it is influenced by prevailing environmental conditions. Multiple regression modeling of the disease incidence with the environmental variables such as temperature, humidity, windspeed, wind gust, and rainfall was performed to understand the environmental conditions that are favorable for spread of citrus …


Estimation Of The Parameters In A Mixture Of Two Normal Distributions And The Generalized Pivotal Quantity Method, Md Faruk Hossain Dec 2022

Estimation Of The Parameters In A Mixture Of Two Normal Distributions And The Generalized Pivotal Quantity Method, Md Faruk Hossain

UNLV Theses, Dissertations, Professional Papers, and Capstones

A pivotal quantity is a random variable that is a function of both the random data and the unknown population parameters and whose probability distribution does not depend on any of the unknown parameters. The population parameters here may include nuisance parameters. Historically, pivotal quantities have been used for the construction of test statistics for hypothesis testing of some of these unknown parameters. They have also been used for the construction of confidence intervals for some of these parameters.Generalized pivotal quantities (GPQ) were introduced by Tsui and Weerahandi (1989) and Weerahandi (1993). A GPQ is a function, not only of …


Statistical Methods For Modern Threats, Brandon Lumsden Dec 2022

Statistical Methods For Modern Threats, Brandon Lumsden

All Dissertations

More than ever before, technology is evolving at a rapid pace across the broad spectrum of biological sciences. As data collection becomes more precise, efficient, and standardized, a demand for appropriate statistical modeling grows as well. Throughout this dissertation, we examine a variety of new age data arising from modern technology of the 21st century. We begin by employing a suite of existing statistical techniques to address research questions surrounding three medical conditions presenting in public health sciences. Here we describe the techniques used, including generalized linear models and longitudinal models, and we summarize the significant associations identified between research …


Retrospective Varying Coefficient Association Analysis Of Longitudinal Binary Traits, Gang Xu Dec 2022

Retrospective Varying Coefficient Association Analysis Of Longitudinal Binary Traits, Gang Xu

UNLV Theses, Dissertations, Professional Papers, and Capstones

Many genetic studies contain rich information on longitudinal phenotypes that require powerful analytical tools for optimal analysis. Genetic analysis of longitudinal data that incorporates temporal variation is important for understanding the genetic architecture and biological variation of complex diseases. Most of the existing methods assume that the contribution of genetic variants is constant over time and fails to capture the dynamic pattern of disease progression. However, the relative influence of genetic variants on complex traits fluctuates over time.We developed several tests to fill the gap of analyzing time-varying genetic effects in longitudinal GWAS for binary traits. First, we propose a …


Use Of Healthcare Utilization Records For Analyzing Trends In Clinical Toxoplasmosis: A Comparison Of Nevada And The United States, Elijah Kreutzer Dec 2022

Use Of Healthcare Utilization Records For Analyzing Trends In Clinical Toxoplasmosis: A Comparison Of Nevada And The United States, Elijah Kreutzer

UNLV Theses, Dissertations, Professional Papers, and Capstones

Toxoplasmosis, a zoonotic disease caused by the parasitic protist Toxoplasma gondii, is a ubiquitous, global public health concern with a wide variety of clinical manifestations. Surveillance for the disease is lacking even in developed countries, and what surveillance is present most often focuses on pregnant women. This research investigated trends in clinical toxoplasmosis in Nevada and nationally to address the lack of knowledge concerning how Nevada discharges compare to national discharges in cases of toxoplasmosis. Specifically, this research sought to determine what characterizes toxoplasmosis in Nevada across inpatient, outpatient, and emergency department settings, as well as how these cases differ …


Statistical Methods For Meta-Analysis In Large-Scale Genomic Experiments, Wimarsha Thathsarani Jayanetti Dec 2022

Statistical Methods For Meta-Analysis In Large-Scale Genomic Experiments, Wimarsha Thathsarani Jayanetti

Mathematics & Statistics Theses & Dissertations

Recent developments in high throughput genomic assays have opened up the possibility of testing hundreds and thousands of genes simultaneously. With the availability of vast amounts of public databases, researchers tend to combine genomic analysis results from multiple studies in the form of a meta-analysis. Meta-analysis methods can be broadly classified into two main categories. The first approach is to combine the statistical significance (pvalues) of the genes from each individual study, and the second approach is to combine the statistical estimates (effect sizes) from the individual studies. In this dissertation, we will discuss how adherence to the standard null …


Convexity Of Regularized Optimal Transport Dissimilarity Measures For Signed Signals, Christian P. Fowler Nov 2022

Convexity Of Regularized Optimal Transport Dissimilarity Measures For Signed Signals, Christian P. Fowler

Mathematics & Statistics ETDs

Debiased Sinkhorn divergence (DS divergence) is a distance function of

regularized optimal transport that measures the dissimilarity between two

probability measures of optimal transport. This thesis analyzes the advantages of

using DS divergence when compared to the more computationally expensive

Wasserstein distance as well as the classical Euclidean norm. Specifically, theory

and numerical experiments are used to show that Debiased Sinkhorn divergence

has geometrically desirable properties such as maintained convexity after data

normalization. Data normalization is often needed to calculate Sinkhorn

divergence as well as Wasserstein distance, as these formulas only accept

probability distributions as inputs and do not directly …


Statistical Methods For Differential Gene Expression Analysis Under The Case-Cohort Design, Lidong Wang Nov 2022

Statistical Methods For Differential Gene Expression Analysis Under The Case-Cohort Design, Lidong Wang

Mathematics & Statistics ETDs

Differential gene expression analysis has the potential to discover candidate biomarkers, therapeutic targets, and gene signatures. How to save money when using an unaffordable sample is a practical question. The case-cohort (CCH) study design can blend the economy of case-control studies with the advantages of cohort studies. But it has not been seen in the medical research literature where high-throughput genomic data were involved.

A score test does not need to fit the Cox PH model iteratively; hence, it can save computing time and avoid potential convergence issues. We developed a score test under the CCH design to identify DEGs …