Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Statistics and Probability

PDF

2015

Institution
Keyword
Publication
Publication Type

Articles 571 - 597 of 597

Full-Text Articles in Physical Sciences and Mathematics

New Results In Ell_1 Penalized Regression, Edward A. Roualdes Jan 2015

New Results In Ell_1 Penalized Regression, Edward A. Roualdes

Theses and Dissertations--Statistics

Here we consider penalized regression methods, and extend on the results surrounding the l1 norm penalty. We address a more recent development that generalizes previous methods by penalizing a linear transformation of the coefficients of interest instead of penalizing just the coefficients themselves. We introduce an approximate algorithm to fit this generalization and a fully Bayesian hierarchical model that is a direct analogue of the frequentist version. A number of benefits are derived from the Bayesian persepective; most notably choice of the tuning parameter and natural means to estimate the variation of estimates – a notoriously difficult task for the …


The Sensitivity Of A Test Based On Spearman's Rho In Cross-Correlation Change Point Problems, Congjian Liu Jan 2015

The Sensitivity Of A Test Based On Spearman's Rho In Cross-Correlation Change Point Problems, Congjian Liu

Electronic Theses and Dissertations

In change point problems, there are three main questions that researchers are interested in. First of all, is there a change point or not? Second, when does the change point occur in a time series? Third, how quickly can we detect the change point? In this thesis, we first explain what a change point is, and what a cross-correlation is. We then discuss prior research in this area. Then we discuss and examine a test based on Spearman's rho, introduced by Wied and Dehling (2011), which tests the null hypothesis of no change point, and compare the change point we …


Statistical Modeling Of Microrna Expression With Human Cancers, Ke-Sheng Wang, Yue Pan, Chun Xu Jan 2015

Statistical Modeling Of Microrna Expression With Human Cancers, Ke-Sheng Wang, Yue Pan, Chun Xu

Health & Biomedical Sciences Faculty Publications and Presentations

MicroRNAs (miRNAs) are small non-coding RNAs (containing about 22 nucleotides) that regulate gene expression. MiRNAs are involved in many different biological processes such as cell proliferation, differentiation, apoptosis, fat metabolism, and human cancer genes; while miRNAs may function as candidates for diagnostic and prognostic biomarkers and predictors of drug response. This paper emphasizes the statistical methods in the analysis of the associations of miRNA gene expression with human cancers and related clinical phenotypes: 1) simple statistical methods include chi-square test, correlation analysis, t-test and one-way ANOVA; 2) regression models include linear and logistic regression; 3) survival analysis approaches such as …


Controlling For Confounding When Association Is Quantified By Area Under The Roc Curve, Hadiza I. Galadima Jan 2015

Controlling For Confounding When Association Is Quantified By Area Under The Roc Curve, Hadiza I. Galadima

Theses and Dissertations

In the medical literature, there has been an increased interest in evaluating association between exposure and outcomes using nonrandomized observational studies. However, because assignments to exposure are not done randomly in observational studies, comparisons of outcomes between exposed and non-exposed subjects must account for the effect of confounders. Propensity score methods have been widely used to control for confounding, when estimating exposure effect. Previous studies have shown that conditioning on the propensity score results in biased estimation of odds ratio and hazard ratio. However, there is a lack of research into the performance of propensity score methods for estimating the …


High-Throughput Data Analysis: Application To Micronuclei Frequency And T-Cell Receptor Sequencing, Mateusz Makowski Jan 2015

High-Throughput Data Analysis: Application To Micronuclei Frequency And T-Cell Receptor Sequencing, Mateusz Makowski

Theses and Dissertations

The advent of high-throughput sequencing has brought about the creation of an unprecedented amount of research data. Analytical methodology has not been able to keep pace with the plethora of data being produced. Two assays, ImmunoSEQ and the cytokinesisblock micronucleus (CBMN), that both produce count data and have few methods available to analyze them are considered.

ImmunoSEQ is a sequencing assay that measures the beta T-cell receptor (TCR) repertoire. The ImmunoSEQ assay was used to describe the TCR repertoires of patients that have undergone hematopoietic stem cell transplantation (HSCT). Several different methods for spectratype analysis were extended to the TCR …


Considerations For Screening Designs And Follow-Up Experimentation, Robert D. Leonard Jan 2015

Considerations For Screening Designs And Follow-Up Experimentation, Robert D. Leonard

Theses and Dissertations

The success of screening experiments hinges on the effect sparsity assumption, which states that only a few of the factorial effects of interest actually have an impact on the system being investigated. The development of a screening methodology to harness this assumption requires careful consideration of the strengths and weaknesses of a proposed experimental design in addition to the ability of an analysis procedure to properly detect the major influences on the response. However, for the most part, screening designs and their complementing analysis procedures have been proposed separately in the literature without clear consideration of their ability to perform …


Proof-Of-Concept Of Environmental Dna Tools For Atlantic Sturgeon Management, Jameson Hinkle Jan 2015

Proof-Of-Concept Of Environmental Dna Tools For Atlantic Sturgeon Management, Jameson Hinkle

Theses and Dissertations

Abstract

The Atlantic Sturgeon (Acipenser oxyrinchus oxyrinchus, Mitchell) is an anadromous species that spawns in tidal freshwater rivers from Canada to Florida. Overfishing, river sedimentation and alteration of the river bottom have decreased Atlantic Sturgeon populations, and NOAA lists the species as endangered. Ecologists sometimes find it difficult to locate individuals of a species that is rare, endangered or invasive. The need for methods less invasive that can create more resolution of cryptic species presence is necessary. Environmental DNA (eDNA) is a non-invasive means of detecting rare, endangered, or invasive species by isolating nuclear or mitochondrial DNA (mtDNA) from the …


Bayesian Inference Of The Weibull-Pareto Distribution, James Dow Jan 2015

Bayesian Inference Of The Weibull-Pareto Distribution, James Dow

Electronic Theses and Dissertations

The Weibull distribution has many applications in various topics. Some of these topics include survival analysis, reliability engineering, general insurance, electrical engineering, and industrial engineering. The Weibull distribution was further extended by the Weibull-Pareto distribution. A desirable property this distribution has is its shape can skew being able to better model left or right skewed data. Examples of skewed data include human longevity and actuarial data. In this work a hierarchical Bayesian model was developed using the Weibull-Pareto distribution.


Comparing Welch's Anova, A Kruskal-Wallis Test And Traditional Anova In Case Of Heterogeneity Of Variance, Hangcheng Liu Jan 2015

Comparing Welch's Anova, A Kruskal-Wallis Test And Traditional Anova In Case Of Heterogeneity Of Variance, Hangcheng Liu

Theses and Dissertations

Analysis of variance (ANOVA) is a robust test against the normality assumption, but it may be inappropriate when the assumption of homogeneity of variance has been violated. Welch ANOVA and the Kruskal-Wallis test (a non-parametric method) can be applicable for this case. In this study we compare the three methods in empirical type I error rate and power, when heterogeneity of variance occurs and find out which method is the most suitable with which cases including balanced/unbalanced, small/large sample size, and/or with normal/non-normal distributions.


Key Factors Driving Personnel Downsizing In Multinational Military Organizations, Ilksen Gorkem, Resit Unal, Pilar Pazos Jan 2015

Key Factors Driving Personnel Downsizing In Multinational Military Organizations, Ilksen Gorkem, Resit Unal, Pilar Pazos

Engineering Management & Systems Engineering Faculty Publications

Although downsizing has long been a topic of research in traditional organizations, there are very few studies of this phenomenon in military contexts. As a result, we have little understanding of the key factors that drive personnel downsizing in military setting. This study contributes to our understanding of key factors that drive personnel downsizing in military organizations and whether those factors may differ across NATO nations’ cultural clusters. The theoretical framework for this study was built from studies in non-military contexts and adapted to fit the military environment.

This research relies on historical data from one of the largest multinational …


Statistical Engineering: A Causal-Stochastic Modeling Research Update, Teddy Steven Cotter Jan 2015

Statistical Engineering: A Causal-Stochastic Modeling Research Update, Teddy Steven Cotter

Engineering Management & Systems Engineering Faculty Publications

In the ASEM-IAC 2012, Cotter (2012) summarized prior works that led to the proposal for statistical engineering, identified the gaps in knowledge that statistical engineering needs to address, explored additional gaps in knowledge not addressed in the prior works, set forth a working definition of and body of knowledge for statistical engineering, and set forth proposals of potential systems contributions the Engineering Management profession could make toward the development of statistical engineering. In 2014, the ASQ Statistics Division, DOT&E, NASA, and IDA co-sponsored a Statistical Engineering Agreement to jointly research development of the discipline of statistical engineering. The statistics community …


Matching Methods For Biomarker Evaluation: A Mapping With Causal Inference, Debashis Ghosh, Michael Sabel Dec 2014

Matching Methods For Biomarker Evaluation: A Mapping With Causal Inference, Debashis Ghosh, Michael Sabel

Debashis Ghosh

In many medical settings, there is interest in evaluating the predictive ability of a candidate biomarker while adjusting appropriately for confounding factors. Recently, Janes and Pepe (2008, {\it Biometrics} 64: 1 -- 9) evaluated the effects of matching on classification accuracy for biomarkers. In this article, we note an analogy between the use of matching in causal inference with its role in the biomarker evaluation problem. This leads us to be able to import much of the literature on matching from causal inferential settings to the biomarker evaluation problem. This leads to a theoretical characterization of the bias properties of …


A Review Of Frequentist Tests For The 2x2 Binomial Trial, Chris Lloyd Dec 2014

A Review Of Frequentist Tests For The 2x2 Binomial Trial, Chris Lloyd

Chris J. Lloyd

The 2x2 binomial trial is the simplest of data structures yet its statistical analysis and the issues it raises have been debated and revisited for over 70 years. Which analysis should biomedical researchers use in applications? In this review, we consider frequentist tests only, specifically tests with control size either exactly or very close to exactly. These procedures can be classified as conditional and unconditional. Amongst tests motivated by a conditional model, Lancaster’s mid-p and Liebermeister’s test are less conservative than Fisher’s classical test, but do not control type 1 error. Within the conditional framework, only Fisher’s test can be …


Prognostic And Predictive Directions For Clinical Trials, Debashis Ghosh Dec 2014

Prognostic And Predictive Directions For Clinical Trials, Debashis Ghosh

Debashis Ghosh

In many clinical trials, treatment effects can be quite heterogeneous across subgroups so that individuals in different subgroups can receive different benefits of the treatment. This can be quite important for the purposes of clinical decision-making purposes. In this article, we introduce a general concept of risk score that is motivated by potential outcomes consider- ations. The concepts of prognostic and predictive directions for outcome data are defined. Their basis is in the dimension reduction (DR) literature and can also be motivated by con- ditional independence assumptions. Under some conditions, one can use existing methods from the DR literature to …


2015_Balzer_Adaptive.Pdf, Laura Balzer Dec 2014

2015_Balzer_Adaptive.Pdf, Laura Balzer

Laura B. Balzer

In randomized trials, pair-matching is an intuitive design strategy to protect study validity and to potentially
increase study power. In a common design, candidate units are identified, and their baseline characteristics used
to create the best n/2 matched pairs.Within the resulting pairs, the intervention is randomized, and the outcomes
measured at the end of follow-up.We consider this design to be adaptive, because the construction of thematched
pairs depends on the baseline covariates of all candidate units. As a consequence, the observed data cannot be
considered as n/2 independent, identically distributed pairs of units, as common practice assumes. Instead, the
observed …


Integrated Survival Analysis Using An Event Time Approach In A Bayesian Framework, Daniel Walsh, Victoria Dreitz, Dennis Heisey Dec 2014

Integrated Survival Analysis Using An Event Time Approach In A Bayesian Framework, Daniel Walsh, Victoria Dreitz, Dennis Heisey

Victoria Dreitz

No abstract provided.


Case Studies In Evaluating Time Series Prediction Models Using The Relative Mean Absolute Error, Nicholas G. Reich, Justin Lessler, Krzysztof Sakrejda, Stephen A. Lauer, Sopon Iamsirithaworn, Derek A T Cummings Dec 2014

Case Studies In Evaluating Time Series Prediction Models Using The Relative Mean Absolute Error, Nicholas G. Reich, Justin Lessler, Krzysztof Sakrejda, Stephen A. Lauer, Sopon Iamsirithaworn, Derek A T Cummings

Nicholas G Reich

Statistical prediction models inform decision-making processes in many real-world settings. Prior to using predictions in practice, one must rigorously test and validate candidate models to ensure that the proposed predictions have sufficient accuracy to be used in practice. In this paper, we present a framework for evaluating time series predictions that emphasizes computational simplicity and an intuitive interpretation using the relative mean absolute error metric. For a single time series, this metric enables comparisons of candidate model predictions against naive reference models, a method that can provide useful and standardized performance benchmarks. Additionally, in applications with multiple time series, this …


Studying The Effects Of Non Oil Exports On Targeted Economic Growth In Iranian 5th Development Plan: A Computable General Equilibrium Approach, Rasoul Bakhsi Dastjerdi Dr., Reza Moosavi Mohseni Dr., Somayye Jafari Dec 2014

Studying The Effects Of Non Oil Exports On Targeted Economic Growth In Iranian 5th Development Plan: A Computable General Equilibrium Approach, Rasoul Bakhsi Dastjerdi Dr., Reza Moosavi Mohseni Dr., Somayye Jafari

Reza Moosavi Mohseni

we investigate the effects of non oil export on Iran’s economic growth using a computable general equilibrium (CGE) and study which tradable sectors has a larger share in reaching to targeted growth rate 8% in 5th socio economic development plan. We calibrate the model by GAMS (with emphasis on foreign trade sector). Numerical solution to the model is based on Iran’s social accounting matrix (SAM). Results show that 2.03% of targeted economic growth rate is achieved by encouraging a 6% growth in export. It also be mentioned that industry and mine sector in Iran, has more influence on growth than …


Copula Modelling Of Dependence In Multivariate Time Series, Michael S. Smith Dec 2014

Copula Modelling Of Dependence In Multivariate Time Series, Michael S. Smith

Michael Stanley Smith

Almost all existing nonlinear multivariate time series models remain linear, conditional on a point in time or latent regime. Here, an alternative is proposed, where nonlinear serial and cross-sectional dependence is captured by a copula model. The copula defines a multivariate time series on the unit cube. A drawable vine copula is employed, along with a factorization which allows the marginal and transitional densities of the time series to be expressed analytically. The factorization also provides for simple conditions under which the series is stationary and/or Markov, as well as being parsimonious. A parallel algorithm for computing the likelihood is …


Adaptive Pair-Matching In Randomized Trials With Unbiased And Efficient Effect Estimation, Laura Balzer, M Petersen, M Van Der Laan, The Search Consortium Dec 2014

Adaptive Pair-Matching In Randomized Trials With Unbiased And Efficient Effect Estimation, Laura Balzer, M Petersen, M Van Der Laan, The Search Consortium

Laura B. Balzer

In randomized trials, pair-matching is an intuitive design strategy to protect study validity and to potentially
increase study power. In a common design, candidate units are identified, and their baseline characteristics used
to create the best n∕2 matched pairs.Within the resulting pairs, the intervention is randomized, and the outcomes
measured at the end of follow-up.We consider this design to be adaptive, because the construction of thematched
pairs depends on the baseline covariates of all candidate units. As a consequence, the observed data cannot be
considered as n∕2 independent, identically distributed pairs of units, as common practice assumes. Instead, the
observed …


Simulating Univariate And Multivariate Nonnormal Distributions Through The Method Of Percentiles, Jennifer Koran, Todd C. Headrick, Tzu Chun Kuo Dec 2014

Simulating Univariate And Multivariate Nonnormal Distributions Through The Method Of Percentiles, Jennifer Koran, Todd C. Headrick, Tzu Chun Kuo

Todd Christopher Headrick

This article derives a standard normal-based power method polynomial transformation for Monte Carlo simulation studies, approximating distributions, and fitting distributions to data based on the method of percentiles. The proposed method is used primarily when (1) conventional (or L) moment-based estimators such as skew (or L-skew) and kurtosis (or L -kurtosis) are unknown or (2) data are unavailable but percentiles are known (e.g., standardized test score reports). The proposed transformation also has the advantage that solutions to polynomial coefficients are available in simple closed form and thus obviates numerical equation solving. A procedure is also described for simulating power method …


Optimal Full Matching For Survival Outcomes: A Method That Merits More Widespread Use, Peter Austin, Elizabeth Stuart Dec 2014

Optimal Full Matching For Survival Outcomes: A Method That Merits More Widespread Use, Peter Austin, Elizabeth Stuart

Peter Austin

Matching on the propensity score is a commonly used analytic method for estimating the effects of treatments on outcomes. Commonly used propensity score matching methods include nearest neighbor matching and nearest neighbor caliper matching. Rosenbaum (1991) proposed an optimal full matching approach, in which matched strata are formed consisting of either one treated subject and at least one control subject or one control subject and at least one treated subject. Full matching has been used rarely in the applied literature. Furthermore, its performance for use with survival outcomes has not been rigorously evaluated. We propose a method to use full …


Quantile Rank Maps: A New Tool For Understanding Individual Brain Development, Huaihou Chen, Clare Kelly, F. Xavier Castellanos, Ye He, Xi-Nian Zuo, Philip T. Reiss Dec 2014

Quantile Rank Maps: A New Tool For Understanding Individual Brain Development, Huaihou Chen, Clare Kelly, F. Xavier Castellanos, Ye He, Xi-Nian Zuo, Philip T. Reiss

Philip T. Reiss

We propose a novel method for neurodevelopmental brain mapping that displays how an individual’s values for a quantity of interest compare with age-specific norms. By estimating smoothly age-varying distributions at a set of brain regions of interest, we derive age-dependent region-wise quantile ranks for a given individual, which can be presented in the form of a brain map. Such quantile rank maps could potentially be used for clinical screening. Bootstrap-based confidence intervals are proposed for the quantile rank estimates. We also propose a recalibrated Kolmogorov-Smirnov test for detecting group differences in the age-varying distribution. This test is shown to be …


Cross-Validation And Hypothesis Testing In Neuroimaging: An Irenic Comment On The Exchange Between Friston And Lindquist Et Al., Philip T. Reiss Dec 2014

Cross-Validation And Hypothesis Testing In Neuroimaging: An Irenic Comment On The Exchange Between Friston And Lindquist Et Al., Philip T. Reiss

Philip T. Reiss

The “ten ironic rules for statistical reviewers” presented by Friston (2012) prompted a rebuttal by Lindquist et al. (2013), which was followed by a rejoinder by Friston (2013). A key issue left unresolved in this discussion is the use of cross-validation to test the significance of predictive analyses. This note discusses the role that cross-validation-based and related hypothesis tests have come to play in modern data analyses, in neuroimaging and other fields. It is shown that such tests need not be suboptimal and can fill otherwise-unmet inferential needs.


Epidemiology Of Latency And Relapse In Plasmodium Vivax Malaria, Andrew A. Lover Dec 2014

Epidemiology Of Latency And Relapse In Plasmodium Vivax Malaria, Andrew A. Lover

Andrew Lover

Malaria is a major contributor to health burdens throughout the regions where it is endemic. Historically, it was believed that there was limited morbidity and essentially no mortality associated with Plasmodium vivax; however, evidence from diverse settings now suggests that infections with P. vivax can be both severe and fatal. This awareness has highlighted a critical gap: the vast majority of research has been directed towards P. falciparum, leading to a decades-long neglect of epidemiological and clinical studies of P. vivax. There exists a large body of historical data on human experimental infections with P. vivax; these studies in controlled …


Considerations For Comprehensive Analyses Of Sporozoite-Based Controlled Human Malaria Infection Studies, Andrew A. Lover Dec 2014

Considerations For Comprehensive Analyses Of Sporozoite-Based Controlled Human Malaria Infection Studies, Andrew A. Lover

Andrew Lover

There has been renewed interest in the use of sporozoite-based approaches for controlled human malaria infections (CHMIs), and several sets of human challenge studies have recently completed. A study undertaken in Tanzania and published in 2014 found dose dependence between 10,000 and 25,000 sporozoite doses, as well as divergent times-to-parasitemia relative to earlier studies in European volunteers, with important implications for planning future studies. Analysis of time-to-event data has had extensive development in recent years, but these methods have had limited exposure outside biostatistics. Expansion of the published analyses to include recent methodological approaches optimized for the types of data …


Short Report: Study Variability In Recent Human Challenge Experiments With Plasmodium Falciparum Sporozoites (Pfspz Challenge), Andrew A. Lover Dec 2014

Short Report: Study Variability In Recent Human Challenge Experiments With Plasmodium Falciparum Sporozoites (Pfspz Challenge), Andrew A. Lover

Andrew Lover

There has been renewed interest in the use of sporozoite-based approaches for malaria vaccination and controlled human infections, and several sets of human challenge studies have recently completed. A study undertaken in Tanzania and published in 2014 found dose-dependence between 10,000 and 25,000 sporozoite doses, as well as divergent times-to-parasitemia relative to earlier studies in European volunteers. However, this analysis shows that these conclusions are based upon suboptimal analytical methods; with more optimal analysis, there is no evidence for dose-dependence within this dose range; and more importantly, no evidence for differences in event times between Dutch and Tanzanian study sites. …