Open Access. Powered by Scholars. Published by Universities.®

Biostatistics Commons

Open Access. Powered by Scholars. Published by Universities.®

Theses/Dissertations

2021

Discipline
Institution
Keyword
Publication

Articles 1 - 30 of 60

Full-Text Articles in Biostatistics

Dependent Censoring In Survival Analysis, Zhongcheng Lin Dec 2021

Dependent Censoring In Survival Analysis, Zhongcheng Lin

Dissertations

This dissertation mainly consists of two parts. In the first part, some properties of bivariate Archimedean Copulas formed by two time-to-event random variables are discussed under the setting of left censoring, where these two variables are subject to one left-censored independent variable respectively. Some distributional results for their joint cdf under different censoring patterns are presented. Those results are expected to be useful in both model fitting and checking procedures for Archimedean copula models with bivariate left-censored data. As an application of the theoretical results that are obtained, a moment estimator of the dependence parameter in Archimedean copula models is …


Approximate Likelihood Based Estimations For Joint Models With Intractable Likelihoods, Karl Stessy M. Bisselou Dec 2021

Approximate Likelihood Based Estimations For Joint Models With Intractable Likelihoods, Karl Stessy M. Bisselou

Theses & Dissertations

This dissertation focuses on the development of approximation approaches for the joint modeling (JM) of repeated measures data and time-to-event data in the presence of analytically or numerically intractable likelihoods. Current likelihood-based inferences for JMs show several limitations including (i) intractability of integrals during marginal likelihood derivations due to the complexity in computations, and (ii) the large number of nuisance parameters (unobserved) posing a problem with convergence. The h-likelihood (HL) and synthetic likelihood (SL) are two computationally efficient estimation approaches that overcome these challenges.

In the presence of extremely high censoring rates, the HL can produce bias parameter estimates. We …


Smoking, Alcohol Consumption, And Depression In Association With Incidence Of Type 2 Diabetes Among Mexican Americans In Starr County, Texas, Gabriela Rubannelsonkumar Dec 2021

Smoking, Alcohol Consumption, And Depression In Association With Incidence Of Type 2 Diabetes Among Mexican Americans In Starr County, Texas, Gabriela Rubannelsonkumar

Honors Program Theses and Research Projects

Previous studies on conditions like obesity, hypertension, and type 2 diabetes mellitus (T2DM) have explored the correlations between them and various other human conditions, including aortic stiffness, left ventricular hypertrophy and sleep apnea, as they predict possibilities of developing certain diseases in Mexican Americans. This study aims to observe the correlation between lifestyle decisions that could relate to the onset of the depression in normal, prediabetic, and diabetic individuals. These include smoking habits and alcohol consumption. Many papers have previously conducted research on these lifestyle habits as they relate to obesity, hypertension, diabetes, however, have done so in a singular …


Confidence Interval For The Mean Of A Beta Distribution, Sean Rangel Dec 2021

Confidence Interval For The Mean Of A Beta Distribution, Sean Rangel

Electronic Theses and Dissertations

Statistical inference for the mean of a beta distribution has become increasingly popular in various fields of academic research. In this study, we developed a novel statistical model from likelihood-based techniques to evaluate various confidence interval techniques for the mean of a beta distribution. Simulation studies will be implemented to compare the performance of the confidence intervals. In addition to the development and study involving confidence intervals, we will also apply the confidence intervals to real biological data that was gathered by the Department of Biology at Stephen F. Austin State University and provide recommendations on the best practice.


Predictors Of Poor Glycemic Control In Diabetic Clients With Mental Health Illness, Community Alliance, Omaha, Nebraska, Rachelle Flick Dec 2021

Predictors Of Poor Glycemic Control In Diabetic Clients With Mental Health Illness, Community Alliance, Omaha, Nebraska, Rachelle Flick

Capstone Experience

People with severe mental illness tend to die 10-25 years earlier than the general population (WHO). Main contributors to these premature deaths include comorbidities such as hypertension, cardiovascular disease, and diabetes. Diabetes prevalence in mentally ill people is 2 times higher than the general population (WHO). The World Health Organization is taking action to improve the health of people with severe mental illness. These efforts include creating protocols of prevention, identification, assessment, and treatment for mentally ill people, as well as improving access to general health services through the integration of physical and mental health services. Community Alliance, located in …


Estimating Treatment Effect On Medical Cost And Examining Medical Cost Trajectory Using Splines And Change Point Techniques., Indranil Ghosh Dec 2021

Estimating Treatment Effect On Medical Cost And Examining Medical Cost Trajectory Using Splines And Change Point Techniques., Indranil Ghosh

Electronic Theses and Dissertations

In the world of growing medical needs, other than the clinical outcomes, the cost of healthcare is one of the important aspects to evaluate. The cost of treatment could act as a decisive factor on which one to choose from two equally likely effective treatment options. In literature, the most used quantity for the cost of treatment is cumulative lifetime cost since the diagnosis of a disease. While it provides a bird' eye view of the treatment cost, it fails to capture the underlying pattern of the treatment cost trajectory. We developed a marginal structural functional model (MSFM) using an …


Data-Driven Statin Initiation Evaluation And Optimization For Prediabetes Population, Muhenned A. Abdulsahib Dec 2021

Data-Driven Statin Initiation Evaluation And Optimization For Prediabetes Population, Muhenned A. Abdulsahib

Graduate Theses and Dissertations

This dissertation develops quantitative models to support medical decision making of statininitiation considering the uncertainty in disease progression for prediabetes patients. A mathematical model is built to help medical decision-makers take action of statin initiation under uncertainty in future prediabetes progressions. The association between cholesterol drug use, such as statin, and elevating glucose level attracted considerable amounts of attention in the literature. Statin effects on glucose vary with respect to different levels of glucose. The first chapter of this dissertation introduces the problem and an overview of the tools that will be used to solve it. In the second chapter …


A Copula Model Approach To Identify The Differential Gene Expression, Prasansha Liyanaarachchi Dec 2021

A Copula Model Approach To Identify The Differential Gene Expression, Prasansha Liyanaarachchi

Mathematics & Statistics Theses & Dissertations

Deoxyribonucleic acid, more commonly known as DNA, is a complex double helix-shaped molecule present in all living organisms and hosts thousands of genes. However, only a few genes exhibit differential expression and play a vital role in a particular disease such as breast cancer. Microarray technology is one of the modern technologies developed to study these gene expressions. There are two major microarray technologies available for expression analysis: Spotted cDNA array and oligonucleotide array. The focus of our research is the statistical analysis of data that arises from the spotted cDNA microarray. Numerous models have been proposed in the literature …


Differential Privacy For Regression Modeling In Health: An Evaluation Of Algorithms, Joseph Ficek Nov 2021

Differential Privacy For Regression Modeling In Health: An Evaluation Of Algorithms, Joseph Ficek

USF Tampa Graduate Theses and Dissertations

Background: There is a need for rigorous and standardized methods of privacy protection for shared data in the health sciences. Differential privacy is one such method that has gained much popularity due to its versatility and robustness. This study evaluates differential privacy for explanatory regression modeling in the context of health research.

Methods: Surveyed and newly proposed algorithms were evaluated with respect to the accuracy (bias and RMSE) of coefficient estimates, the empirical coverage probability of confidence intervals, and the power and type I error rates of hypothesis tests. Evaluations took place in both simulated and real data from a …


High-Dimensional Feature Selection And Multi-Level Causal Mediation Analysis With Applications To Human Aging And Cluster-Based Intervention Studies, Hachem Saddiki Oct 2021

High-Dimensional Feature Selection And Multi-Level Causal Mediation Analysis With Applications To Human Aging And Cluster-Based Intervention Studies, Hachem Saddiki

Doctoral Dissertations

Many questions in public health and medicine are fundamentally causal in that our objective is to learn the effect of some exposure, randomized or not, on an outcome of interest. As a result, causal inference frameworks and methodologies have gained interest as a promising tool to reliably answer scientific questions. However, the tasks of identifying and efficiently estimating causal effects from observed data still pose significant challenges under complex data generating scenarios. We focus on (1) high-dimensional settings where the number of variables is orders of magnitude higher than the number of observations; and (2) multi-level settings, where study participants …


Monitoring Mammals At Multiple Scales: Case Studies From Carnivore Communities, Kadambari Devarajan Oct 2021

Monitoring Mammals At Multiple Scales: Case Studies From Carnivore Communities, Kadambari Devarajan

Doctoral Dissertations

Carnivores are distributed widely and threatened by habitat loss, poaching, climate change, and disease. They are considered integral to ecosystem function through their direct and indirect interactions with species at different trophic levels. Given the importance of carnivores, it is of high conservation priority to understand the processes driving carnivore assemblages in different systems. It is thus essential to determine the abiotic and biotic drivers of carnivore community composition at different spatial scales and address the following questions: (i) What factors influence carnivore community composition and diversity? (ii) How do the factors influencing carnivore communities vary across spatial and temporal …


Bayesian Calibration Of The Icrp Zirconium Biokinetic Model And Use Of Canned Priors For The Evaluation Of Bioassay, Thomas Raymond Labone Oct 2021

Bayesian Calibration Of The Icrp Zirconium Biokinetic Model And Use Of Canned Priors For The Evaluation Of Bioassay, Thomas Raymond Labone

Theses and Dissertations

The International Commission on Radiological Protection (ICRP) publishes biokinetic models that relate measurements of radioactive material in the body and excreta (bioassay) to the amount of the material taken into the body (intake). Given the intake and the biokinetic model, radiation dose to organs and tissues can be calculated. The ICRP approximates the biokinetics of radioactive materials in the body with compartmental models expressed mathematically as a system of ordinary differential equations, for which they provide point estimates of the rate constants. Inaccurate estimates of intake and radiation dose can result in cases where the biokinetics of an individual differ …


Marginally Interpretable Models And Multilevel Models For Quantile Regression With Random-Effects, Nahid Sultana Sumi Oct 2021

Marginally Interpretable Models And Multilevel Models For Quantile Regression With Random-Effects, Nahid Sultana Sumi

Theses and Dissertations

The quantile regression model is an active area of statistical research that has received a lot of attention. This complements the most widely used statistical tool, that is, mean regression analysis. Quantile regression analysis It has become more flexible because of its properties that include no assumption on the distribution of the response variable, equivalent to monotone transformations, and robustness to outliers. However, regression analysis offers methodological challenges if the observations are not independent. Cluster, multilevel, and repeated measures (longitudinal data) designs introduce such dependence. The correlation between observations on the same units or clusters should be accounted for to …


The Classification Of Basket Neural Cells In The Mammalian Neocortex, Sreya Pudi Oct 2021

The Classification Of Basket Neural Cells In The Mammalian Neocortex, Sreya Pudi

Senior Theses

Basket neuronal cells of the mammalian neocortex have been classically categorized into two or more groups. Originally, it was thought that the large and small types are the naturally occurring groups that emerge from reasons that relate to neurobiological function and anatomical position. Later, a study based on anatomical and physiological features of these neurons introduced a third type, the net basket cell which is intermediate in size as compared to the large and small types. In this study, multivariate analysis was used to test the hypothesis that the large and small types are morphologically distinct groups. The results of …


A Network-Based Approach For Computational Drug Repurposing On Cancer Data, Ann Reba, Thomas Alexander Oct 2021

A Network-Based Approach For Computational Drug Repurposing On Cancer Data, Ann Reba, Thomas Alexander

Electronic Theses and Dissertations

In this thesis, we are interested in finding the best drugs that can be repurposed for the disease and able to find the adverse effects such drugs that are FDA-Approved. Developing an effective drug can be a time-consuming and expensive crucible method. Network-based machine learning methods are used for predicting a given drug for A that can be used for B. It aims at finding new indications for already existing drugs and therefore increases the available therapeutic choices at a fraction of the cost of new drug development. The perturbation gene expression data corresponding to the MCF7 cell line was …


Correcting For Measurement Error In The Outcome When Estimating The Distribution Of Time To Pregnancy With The Current Duration Approach, Nicole Nasrallah Oct 2021

Correcting For Measurement Error In The Outcome When Estimating The Distribution Of Time To Pregnancy With The Current Duration Approach, Nicole Nasrallah

Theses and Dissertations

The current duration approach to modeling time-to-pregnancy (TTP) models the length of pregnancy attempt for women that are currently attempting pregnancy. There is a scarcity of studies, let alone TTP studies, that account for measurement error in the outcome. Previously, the benefits of a piecewise constant model with regards to bias in estimates of the survival function with measurement error and the parametric modelling of TTP was shown. In this thesis, correcting for measurement error in the outcome with the current duration approach is explored through piecewise constant models with log-normal measurement error. Five different methods are compared to determine …


Multiple Frailty Model For Spatially Correlated Interval-Censored, Wanfang Zhang Oct 2021

Multiple Frailty Model For Spatially Correlated Interval-Censored, Wanfang Zhang

Theses and Dissertations

In this paper, we consider the problem of multiple frailty selection for general interval-censored spatial survival data, which often occurs in clinical trials and epidemiological studies. The general interval-censored data is a mixture of left-, right- and interval-censored data. We propose a Bayesian semiparametric approach based on the Cox proportional hazard model, where monotone splines were used for non-parametrical modeling of the cumulative baseline hazards where the variable selection priors were used for frailty selection. A two-stage data augmentation with Poisson latent variables is developed for efficient computation. The approach is evaluated based a simulation study and illustrated using a …


Association Between The Beta Band Neural Response And The Behavioral Performance In Aphasic And Neurologically Intact Individuals, Yilun Zhang Oct 2021

Association Between The Beta Band Neural Response And The Behavioral Performance In Aphasic And Neurologically Intact Individuals, Yilun Zhang

Theses and Dissertations

The complex motor act of speech requires integrating linguistic and sensorimotor processes. Sensorimotor interaction mainly supports speech production in the form of state feedback control architecture. While speaking, subjects react to perturbations in the pitch of voice auditory feedback by changing their tone in the opposite direction to pitch-shift stimuli to compensate for the perceived pitch shift. Aphasia is a communication impairment affecting patients’ speaking, understanding, reading, and writing. The present study aims to examine the association between brain neural activity and the ability for speech auditory feedback error correction in both post-stroke aphasia and neurologically intact individuals. There are …


Urinary Bile Acid Indices As Prognostic Biomarkers For The Complications Of Liver Diseases, Wenkuan Li Aug 2021

Urinary Bile Acid Indices As Prognostic Biomarkers For The Complications Of Liver Diseases, Wenkuan Li

Theses & Dissertations

Hepatobilary diseases cause the accumulation of toxic bile acids (BA) in the liver, blood, and other tissues, which may lead to an unfavorable prognosis. In this study, we compared the urinary BA profile in 257 patients with hepatobilary diseases during a 7-year follow-up period. We investigated the use of the urinary BA profile to develop logistic regression models to predict the prognosis of hepatobiliary diseases in terms of developing disease-related complications, especially for ascites. The urinary BA profile was characterized by calculating BA indices, which quantify the composition, metabolism, hydrophilicity, and toxicity of the BA profile. All patients had high …


Predictive Modeling Of Clinical Outcomes For Hospitalized Covid-19 Patients Utilizing Cytof And Clinical Data., Onajia Stubblefield Aug 2021

Predictive Modeling Of Clinical Outcomes For Hospitalized Covid-19 Patients Utilizing Cytof And Clinical Data., Onajia Stubblefield

Electronic Theses and Dissertations

In December 2019, an outbreak of a novel coronavirus initiated a global pandemic. Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is a virus that causes the disease coronavirus disease 2019 (COVID-19). Symptoms of infection with COVID-19 vary widely between individuals. While some infected individuals are asymptomatic, others need more extensive care and require hospitalization. Indeed, the COVID-19 pandemic was characterized by a shortage of hospital beds which presented additional complications in providing adequate care for patients. In this study, we used a combination of T cell population data collected from mass cytometry analysis and clinical markers to form a predictive …


Bayesian Variable Selection Strategies In Longitudinal Mixture Models And Categorical Regression Problems., Md Nazir Uddin Aug 2021

Bayesian Variable Selection Strategies In Longitudinal Mixture Models And Categorical Regression Problems., Md Nazir Uddin

Electronic Theses and Dissertations

In this work, we seek to develop a variable screening and selection method for Bayesian mixture models with longitudinal data. To develop this method, we consider data from the Health and Retirement Survey (HRS) conducted by University of Michigan. Considering yearly out-of-pocket expenditures as the longitudinal response variable, we consider a Bayesian mixture model with $K$ components. The data consist of a large collection of demographic, financial, and health-related baseline characteristics, and we wish to find a subset of these that impact cluster membership. An initial mixture model without any cluster-level predictors is fit to the data through an MCMC …


Impact Of Inconsistent Imputation Models In Mediation Analysis, Bo Ye Aug 2021

Impact Of Inconsistent Imputation Models In Mediation Analysis, Bo Ye

Legacy Theses & Dissertations (2009 - 2024)

In this dissertation, we study the impact of inconsistent imputation methods in mediation analysis and its application. We present the study in three papers.


Conditional And Marginal Imputation Models For Multilevel Data, Gang Liu Aug 2021

Conditional And Marginal Imputation Models For Multilevel Data, Gang Liu

Legacy Theses & Dissertations (2009 - 2024)

This dissertation study extends sequential hierarchical regression imputation (SHRIMP) methods to multilevel datasets with three levels of nesting and proposes a marginal method based on marginalized multilevel model (MMM) framework. Specifically, the proposed model consists of two levels such that the first level relates the marginal mean of responses with covariates through a generalized regression model and the second level includes subject specific random effects within the same generalized regression model. To draw the inference on the population-averaged or subject-specified coefficients, the hierarchical regression and/or MMM is applied as the imputation and estimation models. We employ Markov Chain Monte Carlo …


Identification And Characterization Of De Novo Germline Tp53 Mutation Carriers In Families With Li-Fraumeni Syndrome, Carlos C. Vera Recio Aug 2021

Identification And Characterization Of De Novo Germline Tp53 Mutation Carriers In Families With Li-Fraumeni Syndrome, Carlos C. Vera Recio

Dissertations & Theses (Open Access)

Li-Fraumeni syndrome (LFS) is an inherited cancer syndrome caused by a deleterious mutation in TP53. An estimated 48% of LFS patients present due to a de novo mutation (DNM) in TP53. The knowledge of DNM status, DNM or familial mutation (FM), of an LFS patient requires genetic testing of both parents which is often inaccessible, making de novo LFS patients difficult to study. Famdenovo.TP53 is a Mendelian Risk prediction model used to predict DNM status of TP53 mutation carriers based on the cancer-family history and several input genetic parameters, including disease-gene penetrance. The good predictive performance of Famdenovo.TP53 was demonstrated …


Evaluating Public Masking Mandates On Covid-19 Growth Rates In U.S. States, Angus K. Wong Jul 2021

Evaluating Public Masking Mandates On Covid-19 Growth Rates In U.S. States, Angus K. Wong

Masters Theses

U.S. state governments have implemented numerous policies to help mitigate the spread of COVID-19. While there is strong biological evidence supporting the wearing of face masks or coverings in public spaces, the impact of public masking policies remains unclear. We aimed to evaluate how early versus delayed implementation of state-level public masking orders impacted subsequent COVID-19 growth rates. We defined “early” implementation as having a state-level mandate in place before September 1, 2020, the approximate start of the school-year. We defined COVID-19 growth rates as the relative increase in confirmed cases 7, 14, 21, 30, 45, 60-days after September 1. …


Statistical Modeling For High-Dimensional Compositional Data With Applications To The Human Microbiome, Thy Dao Jul 2021

Statistical Modeling For High-Dimensional Compositional Data With Applications To The Human Microbiome, Thy Dao

Graduate Theses and Dissertations

Compositional data refer to the data that lie on a simplex, which are common in many scientific domains such as genomics, geology, and economics. As the components in a composition must sum to one, traditional tests based on unconstrained data become inappropriate, and new statistical methods are needed to analyze this special type of data. This dissertation is motivated by some statistical problems arising in the analysis of compositional data. In particular, we focus on the high-dimensional and over-dispersed setting, where the dimensionality of compositions is greater than the sample size and the dispersion parameter is moderate or large. In …


Accurate And Integrative Detection Of Copy Number Variants With High-Throughput Data, Xizhi Luo Jul 2021

Accurate And Integrative Detection Of Copy Number Variants With High-Throughput Data, Xizhi Luo

Theses and Dissertations

Copy number variation, as a major source of genetic variation in the human genome, are gains or losses of the DNA segments. Copy number variation has gained considerable interest as it plays important roles in human complex diseases. Therefore, accurate detection of CNVs with data generated by modern genotyping technologies, such as SNP array and whole-exome sequencing (WES), comprises a critical step toward a better understanding of disease etiology. However, current statistical methodologies for CNV detection still face analytical challenges due to numerous genetic and technological factors that may lead to spurious findings. First, existing methods assume the independent observations …


A Comparison Of Spatial Clustering Assessment Methods, Nadeesha Dilhani Vidanapathirana Jul 2021

A Comparison Of Spatial Clustering Assessment Methods, Nadeesha Dilhani Vidanapathirana

Theses and Dissertations

Spatial clustering detection methods are widely used in many fields of research including sociology, epidemiology, ecology, and criminology. The objective of this study is to assess the performance of four spatial clustering detection methods: the average nearest neighbor ratio, Ripley’s K function, local Moran’s I and Getis-Ord Gi* statistics. We conduct a simulation study to evaluate the performance of each method for areal data under different types of spatial dependence and three different areal structures; a 20x20 regular grid, United States counties in six states and Canadian forward sortation areas (FSAs) in three provinces. The results shows that the empirical …


Addressing Bias In Non-Experimental Studies Assessing Treatment Outcomes In Prostate Cancer, David E. Guy Jun 2021

Addressing Bias In Non-Experimental Studies Assessing Treatment Outcomes In Prostate Cancer, David E. Guy

Electronic Thesis and Dissertation Repository

We evaluated the ability of matching techniques to balance baseline characteristics between treatment groups using non-experimental data. We identified a set of balance diagnostics that assessed key differences in baseline covariates with potential for confounding. These diagnostics were used in a novel systematic approach to developing and evaluating models for use in propensity score matching that optimized balance and data retention. We then compared the performance of propensity score and coarsened exact matching strategies in optimizing balance and data retention, using non-experimental data from a pan-Canadian prostate cancer database. Both matching techniques balanced baseline covariates adequately and retained approximately 70% …


Bayesian Multivariate Joint Modeling For Skewed-Longitudinal And Time-To-Event Data, Lan Xu Jun 2021

Bayesian Multivariate Joint Modeling For Skewed-Longitudinal And Time-To-Event Data, Lan Xu

USF Tampa Graduate Theses and Dissertations

In epidemiologic and clinical studies, a relatively large number of biomarkers are repeatedly measured in patients over time, often associated with data on epidemiologic and clinical interest events. So, much attention is focused on developing the specific patterns of the longitudinal measurements, and the associations between those patterns and the time to a certain event, such as heart attack, diagnose of disease, time to transplantation, or death. In the last two decades, the research into joint modeling of longitudinal and time-to-event data has received a tremendous amount of attention.

Numerous researchers have proposed joint modeling approaches for a single longitudinal …