Open Access. Powered by Scholars. Published by Universities.®

Statistics and Probability Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 8 of 8

Full-Text Articles in Statistics and Probability

Mathematical Modeling Suggests Cooperation Of Plant-Infecting Viruses, Joshua Miller, Vitaly V. Ganusov, Tessa Burch-Smith May 2022

Mathematical Modeling Suggests Cooperation Of Plant-Infecting Viruses, Joshua Miller, Vitaly V. Ganusov, Tessa Burch-Smith

Chancellor’s Honors Program Projects

No abstract provided.


Statistical Theory For Specialized Linear Regression Adjustment Methods Compared To Multiple Linear Regression In The Presence And Absence Of Interaction Effects, Leon Su Jan 2022

Statistical Theory For Specialized Linear Regression Adjustment Methods Compared To Multiple Linear Regression In The Presence And Absence Of Interaction Effects, Leon Su

Theses and Dissertations--Statistics

When building models to investigate outcomes and variables of interest, researchers often want to adjust for other variables. There is a variety of ways that these adjustments are performed. In this work, we will consider four approaches to adjustment utilized by researchers in various fields. We will compare the efficacy of these methods to what we call the ”true model method”, fitting a multiple linear regression model in which adjustment variables are model covariates. Our goal is to show that these adjustment methods have inferior performance to the true model method by comparing model parameter estimates, power, type I error, …


Dot: Gene-Set Analysis By Combining Decorrelated Association Statistics, Olga A. Vsevolozhskaya, Min Shi, Fengjiao Hu, Dmitri V. Zaykin Apr 2020

Dot: Gene-Set Analysis By Combining Decorrelated Association Statistics, Olga A. Vsevolozhskaya, Min Shi, Fengjiao Hu, Dmitri V. Zaykin

Biostatistics Faculty Publications

Historically, the majority of statistical association methods have been designed assuming availability of SNP-level information. However, modern genetic and sequencing data present new challenges to access and sharing of genotype-phenotype datasets, including cost of management, difficulties in consolidation of records across research groups, etc. These issues make methods based on SNP-level summary statistics particularly appealing. The most common form of combining statistics is a sum of SNP-level squared scores, possibly weighted, as in burden tests for rare variants. The overall significance of the resulting statistic is evaluated using its distribution under the null hypothesis. Here, we demonstrate that this basic …


On The Quantification Of Complexity And Diversity From Phenotypes To Ecosystems, Zachary Harrison Marion Dec 2016

On The Quantification Of Complexity And Diversity From Phenotypes To Ecosystems, Zachary Harrison Marion

Doctoral Dissertations

A cornerstone of ecology and evolution is comparing and explaining the complexity of natural systems, be they genomes, phenotypes, communities, or entire ecosystems. These comparisons and explanations then beget questions about how complexity should be quantified in theory and estimated in practice. Here I embrace diversity partitioning using Hill or effective numbers to move the empirical side of the field regarding the quantification of biological complexity.

First, at the level of phenotypes, I show that traditional multivariate analyses ignore individual complexity and provide relatively abstract representations of variation among individuals. I then suggest using well-known diversity indices from community ecology …


Niche-Based Modeling Of Japanese Stiltgrass (Microstegium Vimineum) Using Presence-Only Information, Nathan Bush Nov 2015

Niche-Based Modeling Of Japanese Stiltgrass (Microstegium Vimineum) Using Presence-Only Information, Nathan Bush

Masters Theses

The Connecticut River watershed is experiencing a rapid invasion of aggressive non-native plant species, which threaten watershed function and structure. Volunteer-based monitoring programs such as the University of Massachusetts’ OutSmart Invasives Species Project, Early Detection Distribution Mapping System (EDDMapS) and the Invasive Plant Atlas of New England (IPANE) have gathered valuable invasive plant data. These programs provide a unique opportunity for researchers to model invasive plant species utilizing citizen-sourced data. This study took advantage of these large data sources to model invasive plant distribution and to determine environmental and biophysical predictors that are most influential in dispersion, and to identify …


Assessing The Probability That A Finding Is Genuine For Large-Scale Genetic Association Studies, Chia-Ling Kuo, Olga A. Vsevolozhskaya, Dmitri V. Zaykin May 2015

Assessing The Probability That A Finding Is Genuine For Large-Scale Genetic Association Studies, Chia-Ling Kuo, Olga A. Vsevolozhskaya, Dmitri V. Zaykin

Olga A. Vsevolozhskaya

Genetic association studies routinely involve massive numbers of statistical tests accompanied by P-values. Whole genome sequencing technologies increased the potential number of tested variants to tens of millions. The more tests are performed, the smaller P-value is required to be deemed significant. However, a small P-value is not equivalent to small chances of a spurious finding and significance thresholds may fail to serve as efficient filters against false results. While the Bayesian approach can provide a direct assessment of the probability that a finding is spurious, its adoption in association studies has been slow, due in part to the ubiquity …


Using Capture-Mark-Recapture Techniques To Estimate Detection Probabilities & Fidelity Of Expression For The Critically Endangered James Spinymussel (Pleurobema Collina)., Alaina C. Esposito May 2015

Using Capture-Mark-Recapture Techniques To Estimate Detection Probabilities & Fidelity Of Expression For The Critically Endangered James Spinymussel (Pleurobema Collina)., Alaina C. Esposito

Masters Theses, 2010-2019

The critically endangered James Spinymussel (Pleurobema collina) is a species of freshwater mussel endemic to Virginia’s James and Dan River basins. In the last 20 years, P. collina has experienced a substantial decline in numbers and currently occupies approximately 10% of its original habitat; however, little information is known about this species to assist in conservation. A 230-meter reach of transitional habitat in Swift Run was selected for repeat observations to estimate detection probabilities using a Capture-Mark-Recapture framework. In June 2014, visual scouting began to locate and tag P. collina (including other mussels in the community) with PIT …


Childhood Body Mass Index Trajectories: Modeling, Characterizing, Pairwise Correlations And Socio-Demographic Predictors Of Trajectory Characteristics, Xiaozhong Wen, Ken Kleinman, Matthew W. Gillman, Sherly L. Rifas-Shiman, Elsie M. Taveras Jan 2012

Childhood Body Mass Index Trajectories: Modeling, Characterizing, Pairwise Correlations And Socio-Demographic Predictors Of Trajectory Characteristics, Xiaozhong Wen, Ken Kleinman, Matthew W. Gillman, Sherly L. Rifas-Shiman, Elsie M. Taveras

Public Health Department Faculty Publication Series

BACKGROUND:

Modeling childhood body mass index (BMI) trajectories, versus estimating change in BMI between specific ages, may improve prediction of later body-size-related outcomes. Prior studies of BMI trajectories are limited by restricted age periods and insufficient use of trajectory information.

METHODS:

Among 3,289 children seen at 81,550 pediatric well-child visits from infancy to 18 years between 1980 and 2008, we fit individual BMI trajectories using mixed effect models with fractional polynomial functions. From each child's fitted trajectory, we estimated age and BMI at infancy peak and adiposity rebound, and velocity and area under curve between 1 week, infancy peak, adiposity …