Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Statistics and Probability

Theses and Dissertations

2007

Statistics

Articles 1 - 2 of 2

Full-Text Articles in Physical Sciences and Mathematics

Sensitivity To Distributional Assumptions In Estimation Of The Odp Thresholding Function, Wendy Jill Bunn Jul 2007

Sensitivity To Distributional Assumptions In Estimation Of The Odp Thresholding Function, Wendy Jill Bunn

Theses and Dissertations

Recent technological advances in fields like medicine and genomics have produced high-dimensional data sets and a challenge to correctly interpret experimental results. The Optimal Discovery Procedure (ODP) (Storey 2005) builds on the framework of Neyman-Pearson hypothesis testing to optimally test thousands of hypotheses simultaneously. The method relies on the assumption of normally distributed data; however, many applications of this method will violate this assumption. This thesis investigates the sensitivity of this method to detection of significant but nonnormal data. Overall, estimation of the ODP with the method described in this thesis is satisfactory, except when the nonnormal alternative distribution has …


A Simulation-Based Approach For Evaluating Gene Expression Analyses, Carly Ruth Pendleton Mar 2007

A Simulation-Based Approach For Evaluating Gene Expression Analyses, Carly Ruth Pendleton

Theses and Dissertations

Microarrays enable biologists to measure differences in gene expression in thousands of genes simultaneously. The data produced by microarrays present a statistical challenge, one which has been met both by new modifications of existing methods and by completely new approaches. One of the difficulties with a new approach to microarray analysis is validating the method's power and sensitivity. A simulation study could provide such validation by simulating gene expression data and investigating the method's response to changes in the data; however, due to the complex dependencies and interactions found in gene expression data, such a simulation would be complicated and …