Open Access. Powered by Scholars. Published by Universities.®

Statistical Models Commons

Open Access. Powered by Scholars. Published by Universities.®

PDF

Series

Life Sciences

Institution
Keyword
Publication Year
Publication

Articles 1 - 30 of 53

Full-Text Articles in Statistical Models

Modeling Biphasic, Non-Sigmoidal Dose-Response Relationships: Comparison Of Brain- Cousens And Cedergreen Models For A Biochemical Dataset, Venkat D. Abbaraju, Tamaraty L. Robinson, Brian P. Weiser Aug 2023

Modeling Biphasic, Non-Sigmoidal Dose-Response Relationships: Comparison Of Brain- Cousens And Cedergreen Models For A Biochemical Dataset, Venkat D. Abbaraju, Tamaraty L. Robinson, Brian P. Weiser

Rowan-Virtua School of Osteopathic Medicine Faculty Scholarship

Biphasic, non-sigmoidal dose-response relationships are frequently observed in biochemistry and pharmacology, but they are not always analyzed with appropriate statistical methods. Here, we examine curve fitting methods for “hormetic” dose-response relationships where low and high doses of an effector produce opposite responses. We provide the full dataset used for modeling, and we provide the code for analyzing the dataset in SAS using two established mathematical models of hormesis, the Brain-Cousens model and the Cedergreen model. We show how to obtain and interpret curve parameters such as the ED50 that arise from modeling, and we discuss how curve parameters might change …


Utilizing Markov Chains To Estimate Allele Progression Through Generations, Ronit Gandhi Jan 2023

Utilizing Markov Chains To Estimate Allele Progression Through Generations, Ronit Gandhi

Honors Theses

All populations display patterns in allele frequencies over time. Some alleles cease to exist, while some grow to become the norm. These frequencies can shift or stay constant based on the conditions the population lives in. If in Hardy-Weinberg equilibrium, the allele frequencies stay constant. Most populations, however, have bias from environmental factors, sexual preferences, other organisms, etc. We propose a stochastic Markov chain model to study allele progression across generations. In such a model, the allele frequencies in the next generation depend only on the frequencies in the current one.

We use this model to track a recessive allele …


Comparing Machine Learning Techniques With State-Of-The-Art Parametric Prediction Models For Predicting Soybean Traits, Susweta Ray Dec 2021

Comparing Machine Learning Techniques With State-Of-The-Art Parametric Prediction Models For Predicting Soybean Traits, Susweta Ray

Department of Statistics: Dissertations, Theses, and Student Work

Soybean is a significant source of protein and oil, and also widely used as animal feed. Thus, developing lines that are superior in terms of yield, protein and oil content is important to feed the ever-growing population. As opposed to the high-cost phenotyping, genotyping is both cost and time efficient for breeders while evaluating new lines in different environments (location-year combinations) can be costly. Several Genomic prediction (GP) methods have been developed to use the marker and environment data effectively to predict the yield or other relevant phenotypic traits of crops. Our study compares a conventional GP method (GBLUP), a …


Species In Vernal Pools: Anova, Lisa Manne May 2021

Species In Vernal Pools: Anova, Lisa Manne

Open Educational Resources

A one-way analysis of variance exercise using data on species diversities from vernal pools.Data are from vernal pools in Willowbrook Park (adjacent to College of Staten Island's campus) in spring.

The typical ANOVA gives a straightforward result (significant anova, easily-interpreted Tukey-Kramer analysis). This data set requires more nuanced interpretation, as the ANOVA is marginally significant, and Tukey-Kramer yields one significant pairwise comparison between groups. Relative lack of variation within groups explains this apparent enigma.


Do Metabolic Networks Follow A Power Law? A Psamm Analysis, Ryan Geib, Lubos Thoma, Ying Zhang May 2019

Do Metabolic Networks Follow A Power Law? A Psamm Analysis, Ryan Geib, Lubos Thoma, Ying Zhang

Senior Honors Projects

Inspired by the landmark paper “Emergence of Scaling in Random Networks” by Barabási and Albert, the field of network science has focused heavily on the power law distribution in recent years. This distribution has been used to model everything from the popularity of sites on the World Wide Web to the number of citations received on a scientific paper. The feature of this distribution is highlighted by the fact that many nodes (websites or papers) have few connections (internet links or citations) while few “hubs” are connected to many nodes. These properties lead to two very important observed effects: the …


Computational Analysis Of Large-Scale Trends And Dynamics In Eukaryotic Protein Family Evolution, Joseph Boehm Ahrens Mar 2019

Computational Analysis Of Large-Scale Trends And Dynamics In Eukaryotic Protein Family Evolution, Joseph Boehm Ahrens

FIU Electronic Theses and Dissertations

The myriad protein-coding genes found in present-day eukaryotes arose from a combination of speciation and gene duplication events, spanning more than one billion years of evolution. Notably, as these proteins evolved, the individual residues at each site in their amino acid sequences were replaced at markedly different rates. The relationship between protein structure, protein function, and site-specific rates of amino acid replacement is a topic of ongoing research. Additionally, there is much interest in the different evolutionary constraints imposed on sequences related by speciation (orthologs) versus sequences related by gene duplication (paralogs). A principal aim of this dissertation is to …


Unified Methods For Feature Selection In Large-Scale Genomic Studies With Censored Survival Outcomes, Lauren Spirko-Burns, Karthik Devarajan Mar 2019

Unified Methods For Feature Selection In Large-Scale Genomic Studies With Censored Survival Outcomes, Lauren Spirko-Burns, Karthik Devarajan

COBRA Preprint Series

One of the major goals in large-scale genomic studies is to identify genes with a prognostic impact on time-to-event outcomes which provide insight into the disease's process. With rapid developments in high-throughput genomic technologies in the past two decades, the scientific community is able to monitor the expression levels of tens of thousands of genes and proteins resulting in enormous data sets where the number of genomic features is far greater than the number of subjects. Methods based on univariate Cox regression are often used to select genomic features related to survival outcome; however, the Cox model assumes proportional hazards …


Season-Ahead Forecasting Of Water Storage And Irrigation Requirements – An Application To The Southwest Monsoon In India, Arun Ravindranath, Naresh Devineni, Upmanu Lall, Paulina Concha Larrauri Oct 2018

Season-Ahead Forecasting Of Water Storage And Irrigation Requirements – An Application To The Southwest Monsoon In India, Arun Ravindranath, Naresh Devineni, Upmanu Lall, Paulina Concha Larrauri

Publications and Research

Water risk management is a ubiquitous challenge faced by stakeholders in the water or agricultural sector. We present a methodological framework for forecasting water storage requirements and present an application of this methodology to risk assessment in India. The application focused on forecasting crop water stress for potatoes grown during the monsoon season in the Satara district of Maharashtra. Pre-season large-scale climate predictors used to forecast water stress were selected based on an exhaustive search method that evaluates for highest ranked probability skill score and lowest root-mean-squared error in a leave-one-out cross-validation mode. Adaptive forecasts were made in the years …


The Impact Of Truncating Data On The Predictive Ability For Single-Step Genomic Best Linear Unbiased Prediction, Jeremy T. Howard, Thomas A. Rathje, Caitlyn E. Bruns, Danielle F. Wilson-Wells, Stephen D. Kachman, Matthew L. Spangler Jan 2018

The Impact Of Truncating Data On The Predictive Ability For Single-Step Genomic Best Linear Unbiased Prediction, Jeremy T. Howard, Thomas A. Rathje, Caitlyn E. Bruns, Danielle F. Wilson-Wells, Stephen D. Kachman, Matthew L. Spangler

Department of Animal Science: Faculty Publications

Simulated and swine industry data sets were utilized to assess the impact of removing older data on the predictive ability of selection candidate estimated breeding values (EBV) when using single-step genomic best linear unbiased prediction (ssGBLUP). Simulated data included thirty replicates designed to mimic the structure of swine data sets. For the simulated data, varying amounts of data were truncated based on the number of ancestral generations back from the selection candidates. The swine data sets consisted of phenotypic and genotypic records for three traits across two breeds on animals born from 2003 to 2017. Phenotypes and genotypes were iteratively …


Testing The Independence Hypothesis Of Accepted Mutations For Pairs Of Adjacent Amino Acids In Protein Sequences, Jyotsna Ramanan, Peter Revesz Jul 2017

Testing The Independence Hypothesis Of Accepted Mutations For Pairs Of Adjacent Amino Acids In Protein Sequences, Jyotsna Ramanan, Peter Revesz

School of Computing: Faculty Publications

Evolutionary studies usually assume that the genetic mutations are independent of each other. However, that does not imply that the observed mutations are independent of each other because it is possible that when a nucleotide is mutated, then it may be biologically beneficial if an adjacent nucleotide mutates too. With a number of decoded genes currently available in various genome libraries and online databases, it is now possible to have a large-scale computer-based study to test whether the independence assumption holds for pairs of adjacent amino acids. Hence the independence question also arises for pairs of adjacent amino acids within …


Novel Models Of Visual Topographic Map Alignment In The Superior Colliculus., Ruben A Tikidji-Hamburyan, Tarek A El-Ghazawi, Jason W. Triplett Dec 2016

Novel Models Of Visual Topographic Map Alignment In The Superior Colliculus., Ruben A Tikidji-Hamburyan, Tarek A El-Ghazawi, Jason W. Triplett

Pediatrics Faculty Publications

The establishment of precise neuronal connectivity during development is critical for sensing the external environment and informing appropriate behavioral responses. In the visual system, many connections are organized topographically, which preserves the spatial order of the visual scene. The superior colliculus (SC) is a midbrain nucleus that integrates visual inputs from the retina and primary visual cortex (V1) to regulate goal-directed eye movements. In the SC, topographically organized inputs from the retina and V1 must be aligned to facilitate integration. Previously, we showed that retinal input instructs the alignment of V1 inputs in the SC in a manner dependent on …


Hpcnmf: A High-Performance Toolbox For Non-Negative Matrix Factorization, Karthik Devarajan, Guoli Wang Feb 2016

Hpcnmf: A High-Performance Toolbox For Non-Negative Matrix Factorization, Karthik Devarajan, Guoli Wang

COBRA Preprint Series

Non-negative matrix factorization (NMF) is a widely used machine learning algorithm for dimension reduction of large-scale data. It has found successful applications in a variety of fields such as computational biology, neuroscience, natural language processing, information retrieval, image processing and speech recognition. In bioinformatics, for example, it has been used to extract patterns and profiles from genomic and text-mining data as well as in protein sequence and structure analysis. While the scientific performance of NMF is very promising in dealing with high dimensional data sets and complex data structures, its computational cost is high and sometimes could be critical for …


Binomial Regression With A Misclassified Covariate And Outcome., Sheng Luo, Wenyaw Chan, Michelle A Detry, Paul J Massman, R S. Doody Feb 2016

Binomial Regression With A Misclassified Covariate And Outcome., Sheng Luo, Wenyaw Chan, Michelle A Detry, Paul J Massman, R S. Doody

Faculty Publications

Misclassification occurring in either outcome variables or categorical covariates or both is a common issue in medical science. It leads to biased results and distorted disease-exposure relationships. Moreover, it is often of clinical interest to obtain the estimates of sensitivity and specificity of some diagnostic methods even when neither gold standard nor prior knowledge about the parameters exists. We present a novel Bayesian approach in binomial regression when both the outcome variable and one binary covariate are subject to misclassification. Extensive simulation results under various scenarios and a real clinical example are given to illustrate the proposed approach. This approach …


Models For Hsv Shedding Must Account For Two Levels Of Overdispersion, Amalia Magaret Jan 2016

Models For Hsv Shedding Must Account For Two Levels Of Overdispersion, Amalia Magaret

UW Biostatistics Working Paper Series

We have frequently implemented crossover studies to evaluate new therapeutic interventions for genital herpes simplex virus infection. The outcome measured to assess the efficacy of interventions on herpes disease severity is the viral shedding rate, defined as the frequency of detection of HSV on the genital skin and mucosa. We performed a simulation study to ascertain whether our standard model, which we have used previously, was appropriately considering all the necessary features of the shedding data to provide correct inference. We simulated shedding data under our standard, validated assumptions and assessed the ability of 5 different models to reproduce the …


An Evolutionary Vaccination Game In The Modified Activity Driven Network By Considering The Closeness, Dun Han, Mei Sun Sep 2015

An Evolutionary Vaccination Game In The Modified Activity Driven Network By Considering The Closeness, Dun Han, Mei Sun

Publications and Research

In this paper, we explore an evolutionary vaccination game in the modified activity driven network by considering the closeness. We set a closeness parameter p which is used to describe the way of connection between two individuals. The simulation results show that the closeness p may have an active role in weakening both the spreading of epidemic and the vaccination. Besides, when vaccination is not allowed, the final recovered density increases with the value of the ratio of the infection rate to the recovery rate λ/μ. However, when vaccination is allowed the final density of recovered individual first increases and …


A Gene-Based Association Method For Mapping Traits Using Reference Transcriptome Data, Eric R. Gamazon, Heather Wheeler, Kaanan P. Shah, Sahar V. Mozaffari, Keston Aquino-Michaels, Robert J. Carroll, Anne E. Eyler, Joshua C. Denny, Gtex Consortium, Dan L. Nicolae, Nancy J. Cox, Hae Kyung Im Sep 2015

A Gene-Based Association Method For Mapping Traits Using Reference Transcriptome Data, Eric R. Gamazon, Heather Wheeler, Kaanan P. Shah, Sahar V. Mozaffari, Keston Aquino-Michaels, Robert J. Carroll, Anne E. Eyler, Joshua C. Denny, Gtex Consortium, Dan L. Nicolae, Nancy J. Cox, Hae Kyung Im

Bioinformatics Faculty Publications

Genome-wide association studies (GWAS) have identified thousands of variants robustly associated with complex traits. However, the biological mechanisms underlying these associations are, in general, not well understood. We propose a gene-based association method called PrediXcan that directly tests the molecular mechanisms through which genetic variation affects phenotype. The approach estimates the component of gene expression determined by an individual’s genetic profile and correlates ‘imputed’ gene expression with the phenotype under investigation to identify genes involved in the etiology of the phenotype. Genetically regulated gene expression is estimated using whole-genome tissue-dependent prediction models trained with reference transcriptome data sets. PrediXcan enjoys …


Light Pollution Research Through Citizen Science, John Kanemoto Aug 2014

Light Pollution Research Through Citizen Science, John Kanemoto

STAR Program Research Presentations

Light pollution (LP) can disrupt and/or degrade the health of all living things, as well as, their environments. The goal of my research at the NOAO was to check the accuracy of the citizen science LP reporting systems entitled: Globe at Night (GaN), Dark Sky Meter (DSM), and Loss of the Night (LoN). On the GaN webpage, the darkness of the night sky (DotNS) is reported by selecting a magnitude chart. Each magnitude chart has a different density/number of stars around a specific constellation. The greater number of stars implies a darker night sky. Within the DSM iPhone application, a …


Relationships Between Elements Of Leslie Matrices And Future Growth Of The Population, Lorisha Lynn Riley Mar 2014

Relationships Between Elements Of Leslie Matrices And Future Growth Of The Population, Lorisha Lynn Riley

Honors Program Projects

Leslie matrices have been used for years to model and predict the growth of animal populations. Recently, general rules have been applied that can relatively easily determine whether an animal population will grow or decline. My mentor, Dr. Justin Brown and I examine, more specifically, whether there are relationships between certain elements of a population and the dominant eigenvalue, which determines growth. Not only do we consider the general 3x3 Leslie matrix, but also we looked into modified versions for incomplete data and migration models of Leslie matrices. We successfully found several connections within these cases; however, there is much …


On The Joys Of Missing Data, Todd D. Little, Terrence D. Jorgensen, Kyle M. Lang, E. Whitney G. Moore Jan 2014

On The Joys Of Missing Data, Todd D. Little, Terrence D. Jorgensen, Kyle M. Lang, E. Whitney G. Moore

Kinesiology, Health and Sport Studies

We provide conceptual introductions to missingness mechanisms—missing completely at random (MCAR), missing at random (MAR), and missing not at random (MNAR)—and state-of-the-art methods of handling missing data—full-information maximum likelihood (FIML) and multiple imputation (MI)—followed by a discussion of planned missing designs: multiform questionnaire protocols, two-method measurement models, and wave-missing longitudinal designs. We reviewed 80 articles of empirical studies published in the 2012 issues of the Journal of Pediatric Psychology to present a picture of how adequately missing data are currently handled in this field. To illustrate the benefits of utilizing MI or FIML and incorporating planned missingness into study designs, …


Planned Missing Data Designs & Small Sample Size: How Small Is Too Small?, Fan Jia, E. Whitney G. Moore, Richard Kinai, Kelly S. Crowe, Alexander M. Schoemann, Todd D. Little Jan 2014

Planned Missing Data Designs & Small Sample Size: How Small Is Too Small?, Fan Jia, E. Whitney G. Moore, Richard Kinai, Kelly S. Crowe, Alexander M. Schoemann, Todd D. Little

Kinesiology, Health and Sport Studies

Utilizing planned missing data (PMD) designs (ex. 3-form surveys) enables researchers to ask participants fewer questions during the data collection process. An important question, however, is just how few participants are needed to effectively employ planned missing data designs in research studies. This paper explores this question by using simulated three-form planned missing data to assess analytic model convergence, parameter estimate bias, standard error bias, mean squared error (MSE), and relative efficiency (RE).Three models were examined: a one-time point, cross-sectional model with 3 constructs; a two-time point model with 3 constructs at each time point; and a three-time point, mediation …


The Initial Phases Of A Consistent Pricing System That Reflects The Online Sale Value Of A Horse, Curran A. Prettyman Jan 2014

The Initial Phases Of A Consistent Pricing System That Reflects The Online Sale Value Of A Horse, Curran A. Prettyman

Lewis Honors College Capstone Collection

Horses are one of the most uniquely priced commodities. This document provides a solution to an industry-wide weakness of inconsistent pricing and confusion. In the following report, an evaluation of the industry flaw is presented, an econometric approach is described in full, and a solution is proposed using insight gained from a regression analysis. This report uses an econometric approach to determine the impact of hunter jumper horse qualities on internet sale prices. Data is compiled from bigeq.com for seventy-eight horses in the states of Illinois, Indiana, Kentucky, Michigan, and Ohio. A linear regression analysis for twelve variables establishes that …


Gulf-Wide Decreases In The Size Of Large Coastal Sharks Documented By Generations Of Fishermen, Sean P. Powers, F. Joel Frodrie, Steven B. Scyphers, J. Marcus Drymon, Robert L. Shipp, Gregory W. Stunz Jan 2013

Gulf-Wide Decreases In The Size Of Large Coastal Sharks Documented By Generations Of Fishermen, Sean P. Powers, F. Joel Frodrie, Steven B. Scyphers, J. Marcus Drymon, Robert L. Shipp, Gregory W. Stunz

University Faculty and Staff Publications

Large sharks are top predators in most coastal and marine ecosystems throughout the world, and evidence of their reduced prominence in marine ecosystems has been a serious concern for fisheries and ecosystem management. Unfortunately, quantitative data to document the extent, timing, and consequences of changes in shark populations are scarce, thwarting examination of long-term (decadal, century) trends, and reconstructions based on incomplete data sets have been the subject of debate. Absence of quantitative descriptors of past ecological conditions is a generic problem facing many fields of science but is particularly troublesome for fisheries scientists who must develop specific targets for …


Grts And Graphs: Monitoring Natural Resources In Urban Landscapes, Todd R. Lookingbill, John Paul Schmit, Shawn L. Carter Jan 2012

Grts And Graphs: Monitoring Natural Resources In Urban Landscapes, Todd R. Lookingbill, John Paul Schmit, Shawn L. Carter

Geography and the Environment Faculty Publications

Environmental monitoring programs are an important tool for providing land managers with a scientific basis for management decisions. However, many ecological processes operate on spatial scales that transcend management boundaries (Schonewald-Cox 1988). For example, adjacent lands may influence protected-area resources via edge effects, source-sink dynamics, or invasion processes (Jones et al. 2009). Hydrologic alterations outside management units also may have profound effects on the integrity of resources being managed (Pringle 2000). The impacts of climate change are presenting challenges to resource management at local-to-global scales (Karl et al. 2009). This potential disparity between ecological and political boundaries presents an interesting …


Habitat Use And Abundance Patterns Of Sandhill Cranes In The Central Platte River Valley, Nebraska, 2003–2010, Todd Joseph Buckley Nov 2011

Habitat Use And Abundance Patterns Of Sandhill Cranes In The Central Platte River Valley, Nebraska, 2003–2010, Todd Joseph Buckley

School of Natural Resources: Dissertations, Theses, and Student Research

The Central Platte River Valley (CPRV) in Nebraska is an important spring stopover area for the midcontinent population of sandhill cranes. Alterations to crop rotation and loss habitat in the CPRV pose a risk to the population. Personnel drove designated routes in the CPRV from 2003–2010 to record the presence of cranes in agricultural fields and estimate abundance. I developed and evaluated models to predict habitat use and flock sizes. Alfalfa was predicted to receive the highest use followed by corn, soybeans, winter wheat, grassland, and shrubland. Use of all habitats and flock size increased as field area increased. Flock …


A Unified Approach To Non-Negative Matrix Factorization And Probabilistic Latent Semantic Indexing, Karthik Devarajan, Guoli Wang, Nader Ebrahimi Jul 2011

A Unified Approach To Non-Negative Matrix Factorization And Probabilistic Latent Semantic Indexing, Karthik Devarajan, Guoli Wang, Nader Ebrahimi

COBRA Preprint Series

Non-negative matrix factorization (NMF) by the multiplicative updates algorithm is a powerful machine learning method for decomposing a high-dimensional nonnegative matrix V into two matrices, W and H, each with nonnegative entries, V ~ WH. NMF has been shown to have a unique parts-based, sparse representation of the data. The nonnegativity constraints in NMF allow only additive combinations of the data which enables it to learn parts that have distinct physical representations in reality. In the last few years, NMF has been successfully applied in a variety of areas such as natural language processing, information retrieval, image processing, speech recognition …


A Bayesian Shared Component Model For Genetic Association Studies, Juan J. Abellan, Carlos Abellan, Juan R. Gonzalez Nov 2010

A Bayesian Shared Component Model For Genetic Association Studies, Juan J. Abellan, Carlos Abellan, Juan R. Gonzalez

COBRA Preprint Series

We present a novel approach to address genome association studies between single nucleotide polymorphisms (SNPs) and disease. We propose a Bayesian shared component model to tease out the genotype information that is common to cases and controls from the one that is specific to cases only. This allows to detect the SNPs that show the strongest association with the disease. The model can be applied to case-control studies with more than one disease. In fact, we illustrate the use of this model with a dataset of 23,418 SNPs from a case-control study by The Welcome Trust Case Control Consortium (2007) …


Minimum Description Length And Empirical Bayes Methods Of Identifying Snps Associated With Disease, Ye Yang, David R. Bickel Nov 2010

Minimum Description Length And Empirical Bayes Methods Of Identifying Snps Associated With Disease, Ye Yang, David R. Bickel

COBRA Preprint Series

The goal of determining which of hundreds of thousands of SNPs are associated with disease poses one of the most challenging multiple testing problems. Using the empirical Bayes approach, the local false discovery rate (LFDR) estimated using popular semiparametric models has enjoyed success in simultaneous inference. However, the estimated LFDR can be biased because the semiparametric approach tends to overestimate the proportion of the non-associated single nucleotide polymorphisms (SNPs). One of the negative consequences is that, like conventional p-values, such LFDR estimates cannot quantify the amount of information in the data that favors the null hypothesis of no disease-association.

We …


Conservation Implications Of A Marbled Salamander, Ambystoma Opacum, Metapopulation Model, Ethan B. Plunkett Jan 2009

Conservation Implications Of A Marbled Salamander, Ambystoma Opacum, Metapopulation Model, Ethan B. Plunkett

Masters Theses 1911 - February 2014

Amphibians are in decline globally and a significantly greater percentage of ambystomatid salamander species are in decline relative to other species; habitat loss contributes significantly to this decline. The goals of this thesis is to better understand extinction risk in a marbled salamander (ambystoma opacum) population and how forestry effects extinction risk. To achieve this goal we first estimated an important life history parameter (Chapter 1) then used a metapopulation model to estimate population viability and determine what aspects of their life history put them most at risk (Chapter 2) and finally predicted extinction risk in response to hypothetical forestry …


Multiple Imputation To Correct For Measurement Error In Admixture Estimates In Genetic Structured Association Testing, Miguel A. Padilla, Jamin Divers, Laura K. Vaughan, David B. Allison, Hemant K. Tiwari Jan 2009

Multiple Imputation To Correct For Measurement Error In Admixture Estimates In Genetic Structured Association Testing, Miguel A. Padilla, Jamin Divers, Laura K. Vaughan, David B. Allison, Hemant K. Tiwari

Psychology Faculty Publications

Objectives: Structured association tests ( SAT), like any statistical model, assumes that all variables are measured without error. Measurement error can bias parameter estimates and confound residual variance in linear models. It has been shown that admixture estimates can be contaminated with measurement error causing SAT models to suffer from the same afflictions. Multiple imputation (MI) is presented as a viable tool for correcting measurement error problems in SAT linear models with emphasis on correcting measurement error contaminated admixture estimates. Methods: Several MI methods are presented and compared, via simulation, in terms of controlling Type I error rates for both …


The Strength Of Statistical Evidence For Composite Hypotheses With An Application To Multiple Comparisons, David R. Bickel Nov 2008

The Strength Of Statistical Evidence For Composite Hypotheses With An Application To Multiple Comparisons, David R. Bickel

COBRA Preprint Series

The strength of the statistical evidence in a sample of data that favors one composite hypothesis over another may be quantified by the likelihood ratio using the parameter value consistent with each hypothesis that maximizes the likelihood function. Unlike the p-value and the Bayes factor, this measure of evidence is coherent in the sense that it cannot support a hypothesis over any hypothesis that it entails. Further, when comparing the hypothesis that the parameter lies outside a non-trivial interval to the hypotheses that it lies within the interval, the proposed measure of evidence almost always asymptotically favors the correct hypothesis …