Open Access. Powered by Scholars. Published by Universities.®
- Discipline
- Keyword
-
- Classification (2)
- Microarray (2)
- Prediction (2)
- Analysis of variance (1)
- Bioinformatics (1)
-
- Block design (1)
- Censored data (1)
- Comparative genomic hybridization (1)
- Composite sampling (1)
- Cross-validation (1)
- Density estimation (1)
- Discrimination (1)
- Estimation (1)
- Experimental design (1)
- Exploratory analysis (1)
- Gene expression (1)
- Gene expression microarray (1)
- Genomics (1)
- High density oligonucleotide (1)
- Linear model (1)
- Loss function (1)
- Model selection (1)
- Molecular hybridization (1)
- Motif finding (1)
- Multivariate outcome (1)
- Proteomics (1)
- ROC curves (1)
- Random effect (1)
- Regression trees (1)
- Replication (1)
Articles 1 - 6 of 6
Full-Text Articles in Microarrays
Loss-Based Estimation With Cross-Validation: Applications To Microarray Data Analysis And Motif Finding, Sandrine Dudoit, Mark J. Van Der Laan, Sunduz Keles, Annette M. Molinaro, Sandra E. Sinisi, Siew Leng Teng
Loss-Based Estimation With Cross-Validation: Applications To Microarray Data Analysis And Motif Finding, Sandrine Dudoit, Mark J. Van Der Laan, Sunduz Keles, Annette M. Molinaro, Sandra E. Sinisi, Siew Leng Teng
U.C. Berkeley Division of Biostatistics Working Paper Series
Current statistical inference problems in genomic data analysis involve parameter estimation for high-dimensional multivariate distributions, with typically unknown and intricate correlation patterns among variables. Addressing these inference questions satisfactorily requires: (i) an intensive and thorough search of the parameter space to generate good candidate estimators, (ii) an approach for selecting an optimal estimator among these candidates, and (iii) a method for reliably assessing the performance of the resulting estimator. We propose a unified loss-based methodology for estimator construction, selection, and performance assessment with cross-validation. In this approach, the parameter of interest is defined as the risk minimizer for a suitable …
Stochastic Models Based On Molecular Hybridization Theory For Short Oligonucleotide Microarrays, Zhijin Wu, Richard Leblanc, Rafael A. Irizarry
Stochastic Models Based On Molecular Hybridization Theory For Short Oligonucleotide Microarrays, Zhijin Wu, Richard Leblanc, Rafael A. Irizarry
Johns Hopkins University, Dept. of Biostatistics Working Papers
High density oligonucleotide expression arrays are a widely used tool for the measurement of gene expression on a large scale. Affymetrix GeneChip arrays appear to dominate this market. These arrays use short oligonucleotides to probe for genes in an RNA sample. Due to optical noise, non-specific hybridization, probe-specific effects, and measurement error, ad-hoc measures of expression, that summarize probe intensities, can lead to imprecise and inaccurate results. Various researchers have demonstrated that expression measures based on simple statistical models can provide great improvements over the ad-hoc procedure offered by Affymetrix. Recently, physical models based on molecular hybridization theory, have been …
Design Considerations For Efficient And Effective Microarray Studies, M. Kathleen Kerr
Design Considerations For Efficient And Effective Microarray Studies, M. Kathleen Kerr
UW Biostatistics Working Paper Series
This paper describes the theoretical and practical issues in experimental design for gene expression microarrays. Specifically, this paper (1) discusses the basic principles of design (randomization, replication, and blocking) as they pertain to microarrays, and (2) provides some general guidelines for statisticians designing microarray studies.
Cluster Stability Scores For Microarray Data In Cancer Studies, Mark Smolkin, Debashis Ghosh
Cluster Stability Scores For Microarray Data In Cancer Studies, Mark Smolkin, Debashis Ghosh
The University of Michigan Department of Biostatistics Working Paper Series
A potential benefit of profiling of tissue samples using microarrays is the generation of molecular fingerprints that will define subtypes of disease. Hierarchical clustering has been the primary analytical tool used to define disease subtypes from microarray experiments in cancer settings. Assessing cluster reliability poses a major complication in analyzing output from these procedures. While much work has been done on assessing the global question of number of clusters in a dataset, relatively little research exists on assessing stability of individual clusters. A potential benefit of profiling of tissue samples using microarrays is the generation of molecular fingerprints that will …
Linear Models For Microarray Data Analysis: Hidden Similarities And Differences, M. Kathleen Kerr
Linear Models For Microarray Data Analysis: Hidden Similarities And Differences, M. Kathleen Kerr
UW Biostatistics Working Paper Series
In the past several years many linear models have been proposed for analyzing two-color microarray data. As presented in the literature, many of these models appear dramatically different. However, many of these models are reformulations of the same basic approach to analyzing microarray data. This paper demonstrates the equivalence of some of these models. Attention is directed at choices in microarray data analysis that have a larger impact on the results than the choice of linear model.
Selecting Differentially Expressed Genes From Microarray Experiments, Margaret S. Pepe, Gary M. Longton, Garnet L. Anderson, Michel Schummer
Selecting Differentially Expressed Genes From Microarray Experiments, Margaret S. Pepe, Gary M. Longton, Garnet L. Anderson, Michel Schummer
UW Biostatistics Working Paper Series
High throughput technologies, such as gene expression arrays and protein mass spectrometry, allow one to simultaneously evaluate thousands of potential biomarkers that distinguish different tissue types. Of particular interest here is cancer versus normal organ tissues. We consider statistical methods to rank genes (or proteins) in regards to differential expression between tissues. Various statistical measures are considered and we argue that two measures related to the Receiver Operating Characteristic Curve are particularly suitable for this purpose. We also propose that sampling variability in the gene rankings be quantified and suggest using the “selection probability function”, the probability distribution of rankings …