Microarrays | Open Access Articles | Digital Commons Network™

Loss-Based Estimation With Cross-Validation: Applications To Microarray Data Analysis And Motif Finding, Sandrine Dudoit, Mark J. Van Der Laan, Sunduz Keles, Annette M. Molinaro, Sandra E. Sinisi, Siew Leng Teng Dec 2003

Loss-Based Estimation With Cross-Validation: Applications To Microarray Data Analysis And Motif Finding, Sandrine Dudoit, Mark J. Van Der Laan, Sunduz Keles, Annette M. Molinaro, Sandra E. Sinisi, Siew Leng Teng

U.C. Berkeley Division of Biostatistics Working Paper Series

Current statistical inference problems in genomic data analysis involve parameter estimation for high-dimensional multivariate distributions, with typically unknown and intricate correlation patterns among variables. Addressing these inference questions satisfactorily requires: (i) an intensive and thorough search of the parameter space to generate good candidate estimators, (ii) an approach for selecting an optimal estimator among these candidates, and (iii) a method for reliably assessing the performance of the resulting estimator. We propose a unified loss-based methodology for estimator construction, selection, and performance assessment with cross-validation. In this approach, the parameter of interest is defined as the risk minimizer for a suitable …

Go to article

Stochastic Models Based On Molecular Hybridization Theory For Short Oligonucleotide Microarrays, Zhijin Wu, Richard Leblanc, Rafael A. Irizarry Sep 2003

Stochastic Models Based On Molecular Hybridization Theory For Short Oligonucleotide Microarrays, Zhijin Wu, Richard Leblanc, Rafael A. Irizarry

Johns Hopkins University, Dept. of Biostatistics Working Papers

High density oligonucleotide expression arrays are a widely used tool for the measurement of gene expression on a large scale. Affymetrix GeneChip arrays appear to dominate this market. These arrays use short oligonucleotides to probe for genes in an RNA sample. Due to optical noise, non-specific hybridization, probe-specific effects, and measurement error, ad-hoc measures of expression, that summarize probe intensities, can lead to imprecise and inaccurate results. Various researchers have demonstrated that expression measures based on simple statistical models can provide great improvements over the ad-hoc procedure offered by Affymetrix. Recently, physical models based on molecular hybridization theory, have been …

Go to article

Design Considerations For Efficient And Effective Microarray Studies, M. Kathleen Kerr Jun 2003

Design Considerations For Efficient And Effective Microarray Studies, M. Kathleen Kerr

UW Biostatistics Working Paper Series

This paper describes the theoretical and practical issues in experimental design for gene expression microarrays. Specifically, this paper (1) discusses the basic principles of design (randomization, replication, and blocking) as they pertain to microarrays, and (2) provides some general guidelines for statisticians designing microarray studies.

Go to article

Cluster Stability Scores For Microarray Data In Cancer Studies, Mark Smolkin, Debashis Ghosh Jun 2003

Cluster Stability Scores For Microarray Data In Cancer Studies, Mark Smolkin, Debashis Ghosh

The University of Michigan Department of Biostatistics Working Paper Series

A potential benefit of profiling of tissue samples using microarrays is the generation of molecular fingerprints that will define subtypes of disease. Hierarchical clustering has been the primary analytical tool used to define disease subtypes from microarray experiments in cancer settings. Assessing cluster reliability poses a major complication in analyzing output from these procedures. While much work has been done on assessing the global question of number of clusters in a dataset, relatively little research exists on assessing stability of individual clusters. A potential benefit of profiling of tissue samples using microarrays is the generation of molecular fingerprints that will …

Go to article

Linear Models For Microarray Data Analysis: Hidden Similarities And Differences, M. Kathleen Kerr May 2003

Linear Models For Microarray Data Analysis: Hidden Similarities And Differences, M. Kathleen Kerr

UW Biostatistics Working Paper Series

In the past several years many linear models have been proposed for analyzing two-color microarray data. As presented in the literature, many of these models appear dramatically different. However, many of these models are reformulations of the same basic approach to analyzing microarray data. This paper demonstrates the equivalence of some of these models. Attention is directed at choices in microarray data analysis that have a larger impact on the results than the choice of linear model.

Go to article

Selecting Differentially Expressed Genes From Microarray Experiments, Margaret S. Pepe, Gary M. Longton, Garnet L. Anderson, Michel Schummer Jan 2003

Selecting Differentially Expressed Genes From Microarray Experiments, Margaret S. Pepe, Gary M. Longton, Garnet L. Anderson, Michel Schummer

UW Biostatistics Working Paper Series

High throughput technologies, such as gene expression arrays and protein mass spectrometry, allow one to simultaneously evaluate thousands of potential biomarkers that distinguish different tissue types. Of particular interest here is cancer versus normal organ tissues. We consider statistical methods to rank genes (or proteins) in regards to differential expression between tissues. Various statistical measures are considered and we argue that two measures related to the Receiver Operating Characteristic Curve are particularly suitable for this purpose. We also propose that sampling variability in the gene rankings be quantified and suggest using the “selection probability function”, the probability distribution of rankings …

Go to article

Microarrays Commons^™

Full-Text Articles in Microarrays

Loss-Based Estimation With Cross-Validation: Applications To Microarray Data Analysis And Motif Finding, Sandrine Dudoit, Mark J. Van Der Laan, Sunduz Keles, Annette M. Molinaro, Sandra E. Sinisi, Siew Leng Teng

U.C. Berkeley Division of Biostatistics Working Paper Series

Stochastic Models Based On Molecular Hybridization Theory For Short Oligonucleotide Microarrays, Zhijin Wu, Richard Leblanc, Rafael A. Irizarry

Johns Hopkins University, Dept. of Biostatistics Working Papers

Design Considerations For Efficient And Effective Microarray Studies, M. Kathleen Kerr

UW Biostatistics Working Paper Series

Cluster Stability Scores For Microarray Data In Cancer Studies, Mark Smolkin, Debashis Ghosh

The University of Michigan Department of Biostatistics Working Paper Series

Linear Models For Microarray Data Analysis: Hidden Similarities And Differences, M. Kathleen Kerr

UW Biostatistics Working Paper Series

Selecting Differentially Expressed Genes From Microarray Experiments, Margaret S. Pepe, Gary M. Longton, Garnet L. Anderson, Michel Schummer

UW Biostatistics Working Paper Series