Physical Sciences and Mathematics | Open Access Articles

Developing An Alternative Way To Analyze Nanostring Data, Shu Shen Jan 2016

Developing An Alternative Way To Analyze Nanostring Data, Shu Shen

Theses and Dissertations--Statistics

Nanostring technology provides a new method to measure gene expressions. It's more sensitive than microarrays and able to do more gene measurements than RT-PCR with similar sensitivity. This system produces counts for each target gene and tabulates them. Counts can be normalized by using an Excel macro or nSolver before analysis. Both methods rely on data normalization prior to statistical analysis to identify differentially expressed genes. Alternatively, we propose to model gene expressions as a function of positive controls and reference gene measurements. Simulations and examples are used to compare this model with Nanostring normalization methods. The results show that …

Go to article

Statistical Inference On Trimmed Means, Lorenz Curves, And Partial Area Under Roc Curves By Empirical Likelihood Method, Yumin Zhao Jan 2016

Statistical Inference On Trimmed Means, Lorenz Curves, And Partial Area Under Roc Curves By Empirical Likelihood Method, Yumin Zhao

Theses and Dissertations--Statistics

Traditionally the inference on trimmed means, Lorenz Curves, and partial AUC (pAUC) under ROC curves have been done based on the asymptotic normality of the statistics. Based on the theory of empirical likelihood, in this dissertation we developed novel methods to do statistical inferences on trimmed means, Lorenz curves, and pAUC. A common characteristic among trimmed means, Lorenz curves, and pAUC is that their inferences are not based on the whole set of samples. Qin and Tsao (2002), Qin et al. (2013), and Qin et al. (2011) recently published their re- searches on the inferences of trimmed means, Lorenz curves, …

Go to article

Evaluating A Bystander Intervention Program On Reproductive Coercion: Using Quasi-Experimental Design Strategies To Address Methodologic Issues In Randomized Community Prevention Trials, Catherine P. Starnes Jan 2016

Evaluating A Bystander Intervention Program On Reproductive Coercion: Using Quasi-Experimental Design Strategies To Address Methodologic Issues In Randomized Community Prevention Trials, Catherine P. Starnes

Theses and Dissertations--Epidemiology and Biostatistics

Community (or cluster) randomized trials are trials in which communities or groups of individuals (clusters) are randomized to receive the intervention of interest. Community randomized trials frequently more closely resemble a natural experiment than a randomized controlled trial (RCT) following intervention allocation. In particular, the effects of non-compliance can pose methodologic challenges in estimating the intervention effect which may require a quasiexperimental approach in order to minimize bias.

The motivating example to illustrate these issues is the Green Dot High School (GDHS) study. The GDHS study was a longitudinal, cluster-randomized controlled trial designed to assess the effectiveness of a bystander …

Go to article

Development In Normal Mixture And Mixture Of Experts Modeling, Meng Qi Jan 2016

Development In Normal Mixture And Mixture Of Experts Modeling, Meng Qi

Theses and Dissertations--Statistics

In this dissertation, first we consider the problem of testing homogeneity and order in a contaminated normal model, when the data is correlated under some known covariance structure. To address this problem, we developed a moment based homogeneity and order test, and design weights for test statistics to increase power for homogeneity test. We applied our test to microarray about Down’s syndrome. This dissertation also studies a singular Bayesian information criterion (sBIC) for a bivariate hierarchical mixture model with varying weights, and develops a new data dependent information criterion (sFLIC).We apply our model and criteria to birth- weight and gestational …

Go to article

Multi-State Models With Missing Covariates, Wenjie Lou Jan 2016

Multi-State Models With Missing Covariates, Wenjie Lou

Theses and Dissertations--Statistics

Multi-state models have been widely used to analyze longitudinal event history data obtained in medical studies. The tools and methods developed recently in this area require the complete observed datasets. While, in many applications measurements on certain components of the covariate vector are missing on some study subjects. In this dissertation, several likelihood-based methodologies were proposed to deal with datasets with different types of missing covariates efficiently when applying multi-state models.

Firstly, a maximum observed data likelihood method was proposed when the data has a univariate missing pattern and the missing covariate is a categorical variable. The construction of the …

Go to article

Topics In Logistic Regression Analysis, Zhiheng Xie Jan 2016

Topics In Logistic Regression Analysis, Zhiheng Xie

Theses and Dissertations--Statistics

Discrete-time Markov chains have been used to analyze the transition of subjects from intact cognition to dementia with mild cognitive impairment and global impairment as intervening transient states, and death as competing risk. A multinomial logistic regression model is used to estimate the probability distribution in each row of the one-step transition matrix that correspond to the transient states. We investigate some goodness of fit tests for a multinomial distribution with covariates to assess the fit of this model to the data. We propose a modified chi-square test statistic and a score test statistic for the multinomial assumption in each …

Go to article

Continuous Time Multi-State Models For Interval Censored Data, Lijie Wan Jan 2016

Continuous Time Multi-State Models For Interval Censored Data, Lijie Wan

Theses and Dissertations--Statistics

Continuous-time multi-state models are widely used in modeling longitudinal data of disease processes with multiple transient states, yet the analysis is complex when subjects are observed periodically, resulting in interval censored data. Recently, most studies focused on modeling the true disease progression as a discrete time stationary Markov chain, and only a few studies have been carried out regarding non-homogenous multi-state models in the presence of interval-censored data. In this dissertation, several likelihood-based methodologies were proposed to deal with interval censored data in multi-state models.

Firstly, a continuous time version of a homogenous Markov multi-state model with backward transitions was …

Go to article

Statistical Methods For Environmental Exposure Data Subject To Detection Limits, Yuchen Yang Jan 2016

Statistical Methods For Environmental Exposure Data Subject To Detection Limits, Yuchen Yang

Theses and Dissertations--Statistics

In this dissertation, we develop unified and efficient nonparametric statistical methods for estimating and comparing environmental exposure distributions in presence of detection limits. In the first part, we propose a kernel-smoothed nonparametric estimator for the exposure distribution without imposing any independence assumption between the exposure level and detection limit. We show that the proposed estimator is consistent and asymptotically normal. Simulation studies demonstrate that the proposed estimator performs well in practical situations. A colon cancer study is provided for illustration. In the second part, we develop a class of test statistics to compare exposure distributions between two groups by using …

Go to article

Empirical Likelihood And Differentiable Functionals, Zhiyuan Shen Jan 2016

Empirical Likelihood And Differentiable Functionals, Zhiyuan Shen

Theses and Dissertations--Statistics

Empirical likelihood (EL) is a recently developed nonparametric method of statistical inference. It has been shown by Owen (1988,1990) and many others that empirical likelihood ratio (ELR) method can be used to produce nice confidence intervals or regions. Owen (1988) shows that -2logELR converges to a chi-square distribution with one degree of freedom subject to a linear statistical functional in terms of distribution functions. However, a generalization of Owen's result to the right censored data setting is difficult since no explicit maximization can be obtained under constraint in terms of distribution functions. Pan and Zhou (2002), instead, study the …

Go to article

Statistical Methods For Handling Intentional Inaccurate Responders, Kristen J. Mcquerry Jan 2016

Statistical Methods For Handling Intentional Inaccurate Responders, Kristen J. Mcquerry

Theses and Dissertations--Statistics

In self-report data, participants who provide incorrect responses are known as intentional inaccurate responders. This dissertation provides statistical analyses for address intentional inaccurate responses in the data.

Previous work with adolescent self-report, labeled survey participants who intentionally provide inaccurate answers as mischievous responders. This phenomenon also occurs in clinical research. For example, pregnant women who smoke may report that they are nonsmokers. Our advantage is that we do not solely have self-report answers and can verify responses with lab values. Currently, there is no clear method for handling these intentional inaccurate respondents when it comes to making statistical inferences.

We …

Go to article

Statistical Inference On Dynamical Systems, Hongyuan Wang Jan 2016

Statistical Inference On Dynamical Systems, Hongyuan Wang

Theses and Dissertations--Statistics

The ordinary differential equation (ODE) is one representative and popular tool in modeling dynamical systems, which are widely implemented in physics, biology, economics, chemistry and biomedical sciences, etc. Because of the importance of dynamical systems in scientific studies, they are the main focuses of my dissertation.

The first chapter of the dissertation is introduction and literature review, which mainly focuses on numerical integration algorithms of ODEs that are difficult to solve analytically, as well as derivative-free optimization algorithms for the so-called inverse problem.

The second chapter is on the estimation method based on numerical solvers of differential equations. We start …

Go to article

Aggregated Quantitative Multifactor Dimensionality Reduction, Rebecca E. Crouch Jan 2016

Aggregated Quantitative Multifactor Dimensionality Reduction, Rebecca E. Crouch

Theses and Dissertations--Statistics

We consider the problem of making predictions for quantitative phenotypes based on gene-to-gene interactions among selected Single Nucleotide Polymorphisms (SNPs). Previously, Quantitative Multifactor Dimensionality Reduction (QMDR) has been applied to detect gene-to-gene interactions associated with elevated quantitative phenotypes, by creating a dichotomous predictor from one interaction which has been deemed optimal. We propose an Aggregated Quantitative Multifactor Dimensionality Reduction (AQMDR), which exhaustively considers all k-way interactions among a set of SNPs and replaces the dichotomous predictor from QMDR with a continuous aggregated score. We evaluate this new AQMDR method in a series of simulations for two-way and three-way interactions, …

Go to article

Improved Models For Differential Analysis For Genomic Data, Hong Wang Jan 2016

Improved Models For Differential Analysis For Genomic Data, Hong Wang

Theses and Dissertations--Statistics

This paper intend to develop novel statistical methods to improve genomic data analysis, especially for differential analysis. We considered two different data type: NanoString nCounter data and somatic mutation data. For NanoString nCounter data, we develop a novel differential expression detection method. The method considers a generalized linear model of the negative binomial family to characterize count data and allows for multi-factor design. Data normalization is incorporated in the model framework through data normalization parameters, which are estimated from control genes embedded in the nCounter system. For somatic mutation data, we develop beta-binomial model-based approaches to identify highly or lowly …

Go to article

Physical Sciences and Mathematics Commons^™

Full-Text Articles in Physical Sciences and Mathematics

Developing An Alternative Way To Analyze Nanostring Data, Shu Shen

Theses and Dissertations--Statistics

Statistical Inference On Trimmed Means, Lorenz Curves, And Partial Area Under Roc Curves By Empirical Likelihood Method, Yumin Zhao

Theses and Dissertations--Statistics

Evaluating A Bystander Intervention Program On Reproductive Coercion: Using Quasi-Experimental Design Strategies To Address Methodologic Issues In Randomized Community Prevention Trials, Catherine P. Starnes

Theses and Dissertations--Epidemiology and Biostatistics

Development In Normal Mixture And Mixture Of Experts Modeling, Meng Qi

Theses and Dissertations--Statistics

Multi-State Models With Missing Covariates, Wenjie Lou

Theses and Dissertations--Statistics

Topics In Logistic Regression Analysis, Zhiheng Xie

Theses and Dissertations--Statistics

Continuous Time Multi-State Models For Interval Censored Data, Lijie Wan

Theses and Dissertations--Statistics

Statistical Methods For Environmental Exposure Data Subject To Detection Limits, Yuchen Yang

Theses and Dissertations--Statistics

Empirical Likelihood And Differentiable Functionals, Zhiyuan Shen

Theses and Dissertations--Statistics

Statistical Methods For Handling Intentional Inaccurate Responders, Kristen J. Mcquerry

Theses and Dissertations--Statistics

Statistical Inference On Dynamical Systems, Hongyuan Wang

Theses and Dissertations--Statistics

Aggregated Quantitative Multifactor Dimensionality Reduction, Rebecca E. Crouch

Theses and Dissertations--Statistics

Improved Models For Differential Analysis For Genomic Data, Hong Wang

Theses and Dissertations--Statistics