Open Access. Powered by Scholars. Published by Universities.®

Statistical Models Commons

Open Access. Powered by Scholars. Published by Universities.®

2021

Biostatistics

Institution
Keyword
Publication
Publication Type

Articles 1 - 21 of 21

Full-Text Articles in Statistical Models

Approximate Likelihood Based Estimations For Joint Models With Intractable Likelihoods, Karl Stessy M. Bisselou Dec 2021

Approximate Likelihood Based Estimations For Joint Models With Intractable Likelihoods, Karl Stessy M. Bisselou

Theses & Dissertations

This dissertation focuses on the development of approximation approaches for the joint modeling (JM) of repeated measures data and time-to-event data in the presence of analytically or numerically intractable likelihoods. Current likelihood-based inferences for JMs show several limitations including (i) intractability of integrals during marginal likelihood derivations due to the complexity in computations, and (ii) the large number of nuisance parameters (unobserved) posing a problem with convergence. The h-likelihood (HL) and synthetic likelihood (SL) are two computationally efficient estimation approaches that overcome these challenges.

In the presence of extremely high censoring rates, the HL can produce bias parameter estimates. We …


Comparing Machine Learning Techniques With State-Of-The-Art Parametric Prediction Models For Predicting Soybean Traits, Susweta Ray Dec 2021

Comparing Machine Learning Techniques With State-Of-The-Art Parametric Prediction Models For Predicting Soybean Traits, Susweta Ray

Department of Statistics: Dissertations, Theses, and Student Work

Soybean is a significant source of protein and oil, and also widely used as animal feed. Thus, developing lines that are superior in terms of yield, protein and oil content is important to feed the ever-growing population. As opposed to the high-cost phenotyping, genotyping is both cost and time efficient for breeders while evaluating new lines in different environments (location-year combinations) can be costly. Several Genomic prediction (GP) methods have been developed to use the marker and environment data effectively to predict the yield or other relevant phenotypic traits of crops. Our study compares a conventional GP method (GBLUP), a …


Confidence Interval For The Mean Of A Beta Distribution, Sean Rangel Dec 2021

Confidence Interval For The Mean Of A Beta Distribution, Sean Rangel

Electronic Theses and Dissertations

Statistical inference for the mean of a beta distribution has become increasingly popular in various fields of academic research. In this study, we developed a novel statistical model from likelihood-based techniques to evaluate various confidence interval techniques for the mean of a beta distribution. Simulation studies will be implemented to compare the performance of the confidence intervals. In addition to the development and study involving confidence intervals, we will also apply the confidence intervals to real biological data that was gathered by the Department of Biology at Stephen F. Austin State University and provide recommendations on the best practice.


Monitoring Mammals At Multiple Scales: Case Studies From Carnivore Communities, Kadambari Devarajan Oct 2021

Monitoring Mammals At Multiple Scales: Case Studies From Carnivore Communities, Kadambari Devarajan

Doctoral Dissertations

Carnivores are distributed widely and threatened by habitat loss, poaching, climate change, and disease. They are considered integral to ecosystem function through their direct and indirect interactions with species at different trophic levels. Given the importance of carnivores, it is of high conservation priority to understand the processes driving carnivore assemblages in different systems. It is thus essential to determine the abiotic and biotic drivers of carnivore community composition at different spatial scales and address the following questions: (i) What factors influence carnivore community composition and diversity? (ii) How do the factors influencing carnivore communities vary across spatial and temporal …


Ecological Risk Assessment For The Temperate Demersal Elasmobranch Resource, Department Of Primary Industries And Regional Development, Western Australia Oct 2021

Ecological Risk Assessment For The Temperate Demersal Elasmobranch Resource, Department Of Primary Industries And Regional Development, Western Australia

Fisheries research reports

No abstract provided.


Squid And Cuttlefish Resources Of Western Australia, Daniel Yeoh, Danielle J. Johnston Phd, David C. Harris Sep 2021

Squid And Cuttlefish Resources Of Western Australia, Daniel Yeoh, Danielle J. Johnston Phd, David C. Harris

Fisheries research reports

No abstract provided.


Otoliths Of South-Western Australian Fish: A Photographic Catalogue, Chris Dowling, Kim Smith, Elain Lek, Joshua Brown Sep 2021

Otoliths Of South-Western Australian Fish: A Photographic Catalogue, Chris Dowling, Kim Smith, Elain Lek, Joshua Brown

Fisheries research reports

No abstract provided.


Determining Malignancy: Can Mammogram Results Help Predict The Diagnosis Of Breast Tumors?, Taylor Behrens Aug 2021

Determining Malignancy: Can Mammogram Results Help Predict The Diagnosis Of Breast Tumors?, Taylor Behrens

Symposium of Student Scholars

Even with advancements in treatment and preventative care, breast cancer remains an epidemic claiming more than 40,000 American male and female lives each year. The mammogram dataset that I am analyzing was initially complied in the early 1990s by a team from the University of Wisconsin - Madison. Past research diagnoses breast cancer from fine-needle aspirates. My research focuses on predicting whether we can determine breast cancer diagnoses without the use of invasive procedures and, in particular, whether we can predict breast cancer based on mammogram data. Do measures of gray-scale texture, radius, concavity, perimeter, compactness, area, and smoothness of …


Predictive Modeling Of Clinical Outcomes For Hospitalized Covid-19 Patients Utilizing Cytof And Clinical Data., Onajia Stubblefield Aug 2021

Predictive Modeling Of Clinical Outcomes For Hospitalized Covid-19 Patients Utilizing Cytof And Clinical Data., Onajia Stubblefield

Electronic Theses and Dissertations

In December 2019, an outbreak of a novel coronavirus initiated a global pandemic. Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is a virus that causes the disease coronavirus disease 2019 (COVID-19). Symptoms of infection with COVID-19 vary widely between individuals. While some infected individuals are asymptomatic, others need more extensive care and require hospitalization. Indeed, the COVID-19 pandemic was characterized by a shortage of hospital beds which presented additional complications in providing adequate care for patients. In this study, we used a combination of T cell population data collected from mass cytometry analysis and clinical markers to form a predictive …


Bayesian Variable Selection Strategies In Longitudinal Mixture Models And Categorical Regression Problems., Md Nazir Uddin Aug 2021

Bayesian Variable Selection Strategies In Longitudinal Mixture Models And Categorical Regression Problems., Md Nazir Uddin

Electronic Theses and Dissertations

In this work, we seek to develop a variable screening and selection method for Bayesian mixture models with longitudinal data. To develop this method, we consider data from the Health and Retirement Survey (HRS) conducted by University of Michigan. Considering yearly out-of-pocket expenditures as the longitudinal response variable, we consider a Bayesian mixture model with $K$ components. The data consist of a large collection of demographic, financial, and health-related baseline characteristics, and we wish to find a subset of these that impact cluster membership. An initial mixture model without any cluster-level predictors is fit to the data through an MCMC …


Identification And Characterization Of De Novo Germline Tp53 Mutation Carriers In Families With Li-Fraumeni Syndrome, Carlos C. Vera Recio Aug 2021

Identification And Characterization Of De Novo Germline Tp53 Mutation Carriers In Families With Li-Fraumeni Syndrome, Carlos C. Vera Recio

Dissertations & Theses (Open Access)

Li-Fraumeni syndrome (LFS) is an inherited cancer syndrome caused by a deleterious mutation in TP53. An estimated 48% of LFS patients present due to a de novo mutation (DNM) in TP53. The knowledge of DNM status, DNM or familial mutation (FM), of an LFS patient requires genetic testing of both parents which is often inaccessible, making de novo LFS patients difficult to study. Famdenovo.TP53 is a Mendelian Risk prediction model used to predict DNM status of TP53 mutation carriers based on the cancer-family history and several input genetic parameters, including disease-gene penetrance. The good predictive performance of Famdenovo.TP53 was demonstrated …


Species In Vernal Pools: Anova, Lisa Manne May 2021

Species In Vernal Pools: Anova, Lisa Manne

Open Educational Resources

A one-way analysis of variance exercise using data on species diversities from vernal pools.Data are from vernal pools in Willowbrook Park (adjacent to College of Staten Island's campus) in spring.

The typical ANOVA gives a straightforward result (significant anova, easily-interpreted Tukey-Kramer analysis). This data set requires more nuanced interpretation, as the ANOVA is marginally significant, and Tukey-Kramer yields one significant pairwise comparison between groups. Relative lack of variation within groups explains this apparent enigma.


Understanding The Effect Of Adaptive Mutations On The Three-Dimensional Structure Of Rna, Justin Cook Apr 2021

Understanding The Effect Of Adaptive Mutations On The Three-Dimensional Structure Of Rna, Justin Cook

Undergraduate Research and Scholarship Symposium

Single-nucleotide polymorphisms (SNPs) are variations in the genome where one base pair can differ between individuals.1 SNPs occur throughout the genome and can correlate to a disease-state if they occur in a functional region of DNA.1According to the central dogma of molecular biology, any variation in the DNA sequence will have a direct effect on the RNA sequence and will potentially alter the identity or conformation of a protein product. A single RNA molecule, due to intramolecular base pairing, can acquire a plethora of 3-D conformations that are described by its structural ensemble. One SNP, rs12477830, which …


Regression Analyses Assessing The Impact Of Environmental Factors On Covid-19 Transmission And Mortality, El Hussain Shamsa, Kezhong Zhang Feb 2021

Regression Analyses Assessing The Impact Of Environmental Factors On Covid-19 Transmission And Mortality, El Hussain Shamsa, Kezhong Zhang

Medical Student Research Symposium

No abstract provided.


A Bayesian Hierarchical Mixture Model With Continuous-Time Markov Chains To Capture Bumblebee Foraging Behavior, Max Thrush Hukill Jan 2021

A Bayesian Hierarchical Mixture Model With Continuous-Time Markov Chains To Capture Bumblebee Foraging Behavior, Max Thrush Hukill

Honors Projects

The standard statistical methodology for analyzing complex case-control studies in ethology is often limited by approaches that force researchers to model distinct aspects of biological processes in a piecemeal, disjointed fashion. By developing a hierarchical Bayesian model, this work demonstrates that statistical inference in this context can be done using a single coherent framework. To do this, we construct a continuous-time Markov chain (CTMC) to model bumblebee foraging behavior. To connect the experimental design with the CTMC, we employ a mixture model controlled by a logistic regression on the two-factor design matrix. We then show how to infer these model …


The Need To Incorporate Communities In Compartmental Models, Michael J. Kane, Owais Gilani Jan 2021

The Need To Incorporate Communities In Compartmental Models, Michael J. Kane, Owais Gilani

Faculty Journal Articles

Tian et al. provide a framework for assessing population- level interventions of disease outbreaks through the construction of counterfactuals in a large-scale, natural experiment assessing the efficacy of mild, but early interventions compared to delayed interventions. The technique is applied to the recent SARS-CoV-2 outbreak with the population of Shenzhen, China acting as the mild-but-early treatment group and a combination of several US counties resembling Shenzhen but enacting a delayed intervention acting as the control. To help further the development of this framework and identify an avenue for further enhancement, we focus on the use and potential limitations of compartmental …


Statistical Approaches For Estimation And Comparison Of Brain Functional Connectivity, Jifang Zhao Jan 2021

Statistical Approaches For Estimation And Comparison Of Brain Functional Connectivity, Jifang Zhao

Theses and Dissertations

Drug addiction can lead to many health-related problems and social concerns. Functional connectivity obtained from functional magnetic resonance imaging (fMRI) data promotes a variety of fundamental understandings in such association. Due to its complex correlation structure and large dimensionality, the modeling and analysis of the functional connectivity from neuroimage are challenging. By proposing a spatio-temporal model for multi-subject neuroimage data, we incorporate voxel-level spatio-temporal dependencies of whole-brain measurements to improve the accuracy of statistical inference. To tackle large-scale spatio-temporal neuroimage data, we develop a computationally efficient algorithm to estimate the parameters. Our method is used to identify functional connectivity and …


Bayesian Techniques For Relating Genetic Polymorphisms To Diffusion Tensor Images Of Cocaine Users, Tmader Alballa Jan 2021

Bayesian Techniques For Relating Genetic Polymorphisms To Diffusion Tensor Images Of Cocaine Users, Tmader Alballa

Theses and Dissertations

Past investigations utilizing Diffusion Tensor Imaging (DTI) have demonstrated that cocaine use disorder (CUD) yields white matter changes. We proposed three Bayesian techniques in order to explore the relationship between Fractional Anisotropy (FA), genetic data, and years of cocaine use (YCU). CUD participants exhibit abnormality in different areas of the brain versus non-drug using controls, which is measured by DTI. This dissertation is motivated by a neuroimaging genetic study in cocaine dependence, which found that there were relationships between several genes such as GAD and 5-HT2R and CUD subjects.

In the first chapter, there is background on the …


Sexual Behaviors Associated With Online Partner-Seeking Among Men Who Have Sex With Men From Small/Midsized Towns Or Rural Areas In Kentucky, Vira Pravosud Jan 2021

Sexual Behaviors Associated With Online Partner-Seeking Among Men Who Have Sex With Men From Small/Midsized Towns Or Rural Areas In Kentucky, Vira Pravosud

Theses and Dissertations--Epidemiology and Biostatistics

The HIV epidemic remains one of the most significant public health issues in the United States, particularly among men who have sex with men (MSM). New avenues for partner-seeking have emerged over the past three decades, including through the Internet, social media, and geosocial networking applications. Consisting of three cross-sectional studies, this dissertation research aimed to determine associations between the use of various online tools for partner-seeking (hereafter collectively referred to as “apps”) and HIV-related sexual behaviors among 252 young adult MSM residing in small/midsized towns or rural areas in Central Kentucky, a group that has been under-represented in the …


Statistical Methods In Genetic Studies, Cheng Gao Jan 2021

Statistical Methods In Genetic Studies, Cheng Gao

Dissertations, Master's Theses and Master's Reports

This dissertation includes three Chapters. A brief description of each chapter is organized as follows.

In Chapter 1, we proposed a new method, called MF-TOWmuT, for genome-wide association studies with multiple genetic variants and multiple phenotypes using family samples. MF-TOWmuT uses kinship matrix to account for sample relatedness. It is worth mentioning that in simulations, we considered hidden polygenic effects and varied the proportion of variance contributed by it to generate phenotypes. Simulation studies show that MF-TOWmuT can preserve the type I error rates and is more powerful than several existing methods in different simulation scenarios, MFTOWmuT is also quite …


Construction And Analysis Of Genetic Regulatory Networks With Rna-Seq Data From Arabidopsis Thaliana, Tessa Kriz Jan 2021

Construction And Analysis Of Genetic Regulatory Networks With Rna-Seq Data From Arabidopsis Thaliana, Tessa Kriz

Dissertations, Master's Theses and Master's Reports

Reconstruction of gene regulatory networks (GRNs) is a fundamental aspect of genetic engineering and provides a deeper understanding of the biological processes of an organism. Two methods were implemented to reconstruct the gene regulatory networks of Arabidopsis thaliana under two treatments: methyl jasmonate (MeJa) and salicylic acid (SA). The Joint Reconstruction of multiple Gene Regulatory Networks (JRmGRN) method was utilized to construct a joint network for identifying hub genes common to both conditions in addition to networks specific to each condition. The Differential Network Analysis with False Discover Rate Control method constructed a network of connections unique to only one …