Open Access. Powered by Scholars. Published by Universities.®
![Digital Commons Network](http://assets.bepress.com/20200205/img/dcn/DCsunburst.png)
Physical Sciences and Mathematics Commons™
Open Access. Powered by Scholars. Published by Universities.®
- Discipline
-
- Statistics and Probability (43)
- Life Sciences (20)
- Microarrays (16)
- Biostatistics (13)
- Computer Sciences (9)
-
- Genetics and Genomics (8)
- Medicine and Health Sciences (8)
- Statistical Methodology (7)
- Statistical Theory (7)
- Applied Statistics (6)
- Mathematics (6)
- Medical Sciences (6)
- Bioinformatics (5)
- Statistical Models (5)
- Biochemistry, Biophysics, and Structural Biology (4)
- Chemistry (4)
- Medical Biomathematics and Biometrics (4)
- Agriculture (3)
- Computational Biology (3)
- Engineering (3)
- Multivariate Analysis (3)
- Survival Analysis (3)
- Biochemistry (2)
- Biomedical Engineering and Bioengineering (2)
- Design of Experiments and Sample Surveys (2)
- Genetics (2)
- Geriatrics (2)
- Medical Specialties (2)
- Molecular Biology (2)
- Institution
-
- COBRA (8)
- University of Kentucky (5)
- Selected Works (4)
- Brigham Young University (3)
- Clemson University (3)
-
- SelectedWorks (3)
- Claremont Colleges (2)
- Kansas State University Libraries (2)
- Louisiana Tech University (2)
- The University of Southern Mississippi (2)
- University of Nebraska - Lincoln (2)
- Virginia Commonwealth University (2)
- Wayne State University (2)
- City University of New York (CUNY) (1)
- Cleveland State University (1)
- Florida International University (1)
- Marquette University (1)
- Missouri University of Science and Technology (1)
- New Jersey Institute of Technology (1)
- Old Dominion University (1)
- Southern Methodist University (1)
- The Texas Medical Center Library (1)
- The University of Akron (1)
- University at Albany, State University of New York (1)
- University of Nebraska Medical Center (1)
- University of South Carolina (1)
- University of Tennessee, Knoxville (1)
- Utah State University (1)
- Western University (1)
- Wright State University (1)
- Publication Year
- Publication
-
- Theses and Dissertations (5)
- U.C. Berkeley Division of Biostatistics Working Paper Series (4)
- Dan Nettleton (3)
- Shuo Jiao (3)
- UW Biostatistics Working Paper Series (3)
-
- All Theses (2)
- Conference on Applied Statistics in Agriculture (2)
- Dissertations (2)
- Doctoral Dissertations (2)
- Faculty Publications (2)
- Journal of Modern Applied Statistical Methods (2)
- Pomona Faculty Publications and Research (2)
- Sanders-Brown Center on Aging Faculty Publications (2)
- All Dissertations (1)
- Browse all Theses and Dissertations (1)
- Debashis Ghosh (1)
- Department of Statistics: Dissertations, Theses, and Student Work (1)
- Dissertations & Theses (Open Access) (1)
- Dissertations, Theses, and Capstone Projects (1)
- ETD Archive (1)
- Electronic Thesis and Dissertation Repository (1)
- FIU Electronic Theses and Dissertations (1)
- IGC Proceedings (1997-2023) (1)
- Journal Articles: Biostatistics (1)
- Legacy Theses & Dissertations (2009 - 2024) (1)
- Masters Theses (1)
- Mathematics & Statistics Theses & Dissertations (1)
- Mathematics and Statistics Faculty Publications (1)
- Mathematics and Statistics Faculty Research & Creative Works (1)
- Mathematics, Statistics and Computer Science Faculty Research and Publications (1)
- Publication Type
Articles 31 - 57 of 57
Full-Text Articles in Physical Sciences and Mathematics
A Locus-Based Paradigm For Generating Systems Biological Inferences From Large Scale Functional Genomics Datasets, Ajish Dominic George
A Locus-Based Paradigm For Generating Systems Biological Inferences From Large Scale Functional Genomics Datasets, Ajish Dominic George
Legacy Theses & Dissertations (2009 - 2024)
Genomics data is growing at a exponential rate. The ability to integrate new results with existing knowledge about genomic biology is rapidly becoming the limiting factor as there no universal language with which to describe genomic functional elements. To integrate and compare new and existing genomic data, we define our basic functional unit of a genome to be a locus -- a set of positional coordinates along any genome with an arbitrary amount of functional annotations attached. The locus concept enables addressing genomic elements and annotations at any level of granularity from entire swaths of chromosomes to single base-positions. We …
Focus On Rna Isolation: Obtaining Rna For Microrna (Mirna) Expression Profiling Analyses Of Neural Tissue, Wang-Xia Wang, Bernard R. Wilfred, Donald A. Baldwin, R. Benjamin Isett, Na Ren, Arnold J. Stromberg, Peter T. Nelson
Focus On Rna Isolation: Obtaining Rna For Microrna (Mirna) Expression Profiling Analyses Of Neural Tissue, Wang-Xia Wang, Bernard R. Wilfred, Donald A. Baldwin, R. Benjamin Isett, Na Ren, Arnold J. Stromberg, Peter T. Nelson
Sanders-Brown Center on Aging Faculty Publications
MicroRNAs (miRNAs) are present in all known plant and animal tissues and appear to be somewhat concentrated in the mammalian nervous system. Many different miRNA expression profiling platforms have been described. However, relatively little research has been published to establish the importance of 'upstream' variables in RNA isolation for neural miRNA expression profiling. We tested whether apparent changes in miRNA expression profiles may be associated with tissue processing, RNA isolation techniques, or different cell types in the sample. RNA isolation was performed on a single brain sample using eight different RNA isolation methods, and results were correlated using a conventional …
On Correcting The Overestimation Of The Permutation Based False Discovery Rate Estimator., Shuo Jiao, Shunpu Zhang
On Correcting The Overestimation Of The Permutation Based False Discovery Rate Estimator., Shuo Jiao, Shunpu Zhang
Shuo Jiao
Motivation: Recent attempts to account for multiple testing in the analysis of microarray data have focused on controlling the false discovery rate (FDR), which is defined as the expected percentage of the number of false positive genes among the claimed significant genes. As a consequence, the accuracy of the FDR estimators will
be important for correctly controlling FDR. Xie et al. found that the standard permutation method of estimating FDR is biased and proposed to delete the predicted differentially expressed (DE) genes in the estimation of FDR for one-sample comparison. However, we notice that the formula of the FDR used …
Statistical Issues In The Normalizationof Multi-Species Microarray Data, John R. Stevens, Balasubramanian Ganesan, Prerak Desai, Sweta Rajan, Bart C. Weimer
Statistical Issues In The Normalizationof Multi-Species Microarray Data, John R. Stevens, Balasubramanian Ganesan, Prerak Desai, Sweta Rajan, Bart C. Weimer
Conference on Applied Statistics in Agriculture
Several species of bacteria are involved in the production of cheese, including Lactobacillus brevis and Lactococcus lactis. A custom-designed Affymetrix microarray was recently developed to study gene expression in three organisms on a single chip. This array contains only perfect match features for the coding and non-coding regions in the genomes of all three sequences. The multi-species nature of this array version raises interesting questions regarding the preprocessing or normalization strategies for the analysis of gene expression data. We present and evaluate several possible strategies using both cDNA dilution data and experimental expression data from a repeated measures design. The …
The Expression Of Microrna Mir-107 Decreases Early In Alzheimer's Disease And May Accelerate Disease Progression Through Regulation Of Β-Site Amyloid Precursor Protein-Cleaving Enzyme 1, Wang-Xia Wang, Bernard W. Rajeev, Arnold J. Stromberg, Na Ren, Guiliang Tang, Qingwei Huang, Isidore Rigoutsos, Peter T. Nelson
The Expression Of Microrna Mir-107 Decreases Early In Alzheimer's Disease And May Accelerate Disease Progression Through Regulation Of Β-Site Amyloid Precursor Protein-Cleaving Enzyme 1, Wang-Xia Wang, Bernard W. Rajeev, Arnold J. Stromberg, Na Ren, Guiliang Tang, Qingwei Huang, Isidore Rigoutsos, Peter T. Nelson
Sanders-Brown Center on Aging Faculty Publications
MicroRNAs (miRNAs) are small regulatory RNAs that participate in posttranscriptional gene regulation in a sequence-specific manner. However, little is understood about the role(s) of miRNAs in Alzheimer's disease (AD). We used miRNA expression microarrays on RNA extracted from human brain tissue from the University of Kentucky Alzheimer's Disease Center Brain Bank with near-optimal clinicopathological correlation. Cases were separated into four groups: elderly nondemented with negligible AD-type pathology, nondemented with incipient AD pathology, mild cognitive impairment (MCI) with moderate AD pathology, and AD. Among the AD-related miRNA expression changes, miR-107 was exceptional because miR-107 levels decreased significantly even in patients with …
The T-Mixture Model Approach For Detecting Differentially Expressed Genes In Microarrays, Shuo Jiao, Shunpu Zhang
The T-Mixture Model Approach For Detecting Differentially Expressed Genes In Microarrays, Shuo Jiao, Shunpu Zhang
Shuo Jiao
The finite mixture model approach has attracted much attention in analyzing microarray data due to its robustness to the excessive variability which is common in the microarray data. Pan (2003) proposed to use the normal mixture model method (MMM) to estimate the distribution of a test statistic and its null distribution. However, considering the fact that the test statistic is often of t-type, our studies find that the rejection region from MMM is often significantly larger than the correct rejection region, resulting an inflated type I error. This motivates us to propose the t-mixture model (TMM) approach. In this paper, …
Molecular Targets Of 2,3,7,8-Tetrachlorodibenzo-P-Dioxin (Tcdd) Within The Zebrafish Ovary: Insights Into Tcdd-Induced Endocrine Disruption And Reproductive Toxicity, Tisha C. King Heiden, Craig Struble, Matthew L. Rise, Martin J. Hessner, Reinhold J. Hutz, Michael J. Carvan Iii
Molecular Targets Of 2,3,7,8-Tetrachlorodibenzo-P-Dioxin (Tcdd) Within The Zebrafish Ovary: Insights Into Tcdd-Induced Endocrine Disruption And Reproductive Toxicity, Tisha C. King Heiden, Craig Struble, Matthew L. Rise, Martin J. Hessner, Reinhold J. Hutz, Michael J. Carvan Iii
Mathematics, Statistics and Computer Science Faculty Research and Publications
TCDD is a reproductive toxicant and endocrine disruptor, yet the mechanisms by which it causes these reproductive alterations are not fully understood. In order to provide additional insight into the molecular mechanisms that underlie TCDD's reproductive toxicity, we assessed TCDD-induced transcriptional changes in the ovary as they relate to previously described impacts on serum estradiol concentrations and altered follicular development in zebrafish. In silico computational approaches were used to correlate candidate regulatory motifs with observed changes in gene expression. Our data suggest that TCDD inhibits follicle maturation via attenuated gonadotropin responsiveness and/or depressed estradiol biosynthesis, and that interference of estrogen-regulated …
Development Of Oligonucleotide Microarray For High Throughput Dna Methylation Analysis, Xiaopeng Li
Development Of Oligonucleotide Microarray For High Throughput Dna Methylation Analysis, Xiaopeng Li
ETD Archive
DNA methylation is a key event regulating gene expression. DNA methylation analysis plays a pivotal role in unlocking association of epigenetic events with cancer. However, simultaneous evaluation of the methylation status of multiple genes is still a technical challenge. Microarray is a promising approach for high-throughput analysis of the methylation status at numerous CpG sites within multiple genes of interest. In this dissertation study, we conducted a systematic study to examine the use of microarray methods for methylation analysis. First, a robust universal microarray was established with more flexible in design and content, and potential cost saving over commercial arrays. …
Probe Level Analysis Of Affymetrix Microarray Data, Richard Ellis Kennedy
Probe Level Analysis Of Affymetrix Microarray Data, Richard Ellis Kennedy
Theses and Dissertations
The analysis of Affymetrix GeneChip® data is a complex, multistep process. Most often, methodscondense the multiple probe level intensities into single probeset level measures (such as RobustMulti-chip Average (RMA), dChip and Microarray Suite version 5.0 (MAS5)), which are thenfollowed by application of statistical tests to determine which genes are differentially expressed. An alternative approach is a probe-level analysis, which tests for differential expression directly using the probe-level data. Probe-level models offer the potential advantage of more accurately capturing sources of variation in microarray experiments. However, this has not been thoroughly investigated, since current research efforts have largely focused on the …
Development Of Informative Priors In Microarray Studies, Kassandra M. Fronczyk
Development Of Informative Priors In Microarray Studies, Kassandra M. Fronczyk
Theses and Dissertations
Microarrays measure the abundance of DNA transcripts for thousands of gene sequences, simultaneously facilitating genomic comparisons across tissue types or disease status. These experiments are used to understand fundamental aspects of growth and development and to explore the underlying genetic causes of many diseases. The data from most microarray studies are found in open-access online databases. Bayesian models are ideal for the analysis of microarray data because of their ability to integrate prior information; however, most current Bayesian analyses use empirical or flat priors. We present a Perl script to build an informative prior by mining online databases for similar …
Data Mining And Analysis On Multiple Time Series Object Data, Chunyu Jiang
Data Mining And Analysis On Multiple Time Series Object Data, Chunyu Jiang
Browse all Theses and Dissertations
Huge amount of data is available in our society and the need for turning such data into useful information and knowledge is urgent. Data mining is an important field addressing that need and significant progress has been achieved in the last decade. In several important application areas, data arises in the format of Multiple Time Series Object (MTSO) data, where each data object is an array of time series over a large set of features and each has an associated class or state. Very little research has been conducted towards this kind of data. Examples include computational toxicology, where each …
A Comparison Of Microarray Analyses: A Mixed Models Approach Versus The Significance Analysis Of Microarrays, Nathan Wallace Stephens
A Comparison Of Microarray Analyses: A Mixed Models Approach Versus The Significance Analysis Of Microarrays, Nathan Wallace Stephens
Theses and Dissertations
DNA microarrays are a relatively new technology for assessing the expression levels of thousands of genes simultaneously. Researchers hope to find genes that are differentially expressed by hybridizing cDNA from known treatment sources with various genes spotted on the microarrays. The large number of tests involved in analyzing microarrays has raised new questions in multiple testing. Several approaches for identifying differentially expressed genes have been proposed. This paper considers two: (1) a mixed models approach, and (2) the Signiffcance Analysis of Microarrays.
A Novel Computational Framework For Fast, Distributed Computing And Knowledge Integration For Microarray Gene Expression Data Analysis, Prerna Sethi
Doctoral Dissertations
The healthcare burden and suffering due to life-threatening diseases such as cancer would be significantly reduced by the design and refinement of computational interpretation of micro-molecular data collected by bioinformaticians. Rapid technological advancements in the field of microarray analysis, an important component in the design of in-silico molecular medicine methods, have generated enormous amounts of such data, a trend that has been increasing exponentially over the last few years. However, the analysis and handling of these data has become one of the major bottlenecks in the utilization of the technology. The rate of collection of these data has far surpassed …
2^K Factorials In Blocks Of Size 2, With Application To Two-Color Microarray Experiments, Kathleen F. Kerr
2^K Factorials In Blocks Of Size 2, With Application To Two-Color Microarray Experiments, Kathleen F. Kerr
UW Biostatistics Working Paper Series
When a two-level design must be run in blocks of size two, there is a unique blocking scheme that enables estimation of all the main effects. Unfortunately this design does not enable estimation of any two-factor interactions. When the experimental goal is to estimate all main effects and two-factor interactions, it is necessary to combine replicates of the experiment that use different blocking schemes. In this paper we identify such designs for up to eight factors that enable estimation of all main effects and two-factor interactions with the fewest number of replications. In addition, we give a construction for general …
Bioinformatics Framework For Genotyping Microarray Data Analysis, Kai Zhang
Bioinformatics Framework For Genotyping Microarray Data Analysis, Kai Zhang
Dissertations
Functional genomics is a flourishing science enabled by recent technological breakthroughs in high-throughput instrumentation and microarray data analysis. Genotyping microarrays establish the genotypes of DNA sequences containing single nucleotide polymorphisms (SNPs), and can help biologists probe the functions of different genes and/or construct complex gene interaction networks. The enormous amount of data from these experiments makes it infeasible to perform manual processing to obtain accurate and reliable results in daily routines. Advanced algorithms as well as an integrated software toolkit are needed to help perform reliable and fast data analysis.
The author developed a MatlabTM based software package, called …
Yeast Through The Ages: A Statistical Analysis Of Genetic Changes In Aging Yeast, Alison Wise '05, Johanna S. Hardin, Laura Hoopes
Yeast Through The Ages: A Statistical Analysis Of Genetic Changes In Aging Yeast, Alison Wise '05, Johanna S. Hardin, Laura Hoopes
Pomona Faculty Publications and Research
Microarray technology allows for the expression levels of thousands of genes in a cell to be measured simultaneously. The technology provides great potential in the fields of biology and medicine, as the analysis of data obtained from microarray experiments gives insight into the roles of specific genes and the associated changes across experimental conditions (e.g., aging, mutation, radiation therapy, drug dosage). The application of statistical tools to microarray data can help make sense of the experiment and thereby advance genetic, biological, and medical research. Likewise, microarrays provide an exciting means through which to explore statistical techniques.
Optimal Feature Selection For Nearest Centroid Classifiers, With Applications To Gene Expression Microarrays, Alan R. Dabney, John D. Storey
Optimal Feature Selection For Nearest Centroid Classifiers, With Applications To Gene Expression Microarrays, Alan R. Dabney, John D. Storey
UW Biostatistics Working Paper Series
Nearest centroid classifiers have recently been successfully employed in high-dimensional applications. A necessary step when building a classifier for high-dimensional data is feature selection. Feature selection is typically carried out by computing univariate statistics for each feature individually, without consideration for how a subset of features performs as a whole. For subsets of a given size, we characterize the optimal choice of features, corresponding to those yielding the smallest misclassification rate. Furthermore, we propose an algorithm for estimating this optimal subset in practice. Finally, we investigate the applicability of shrinkage ideas to nearest centroid classifiers. We use gene-expression microarrays for …
A Bayesian And Covariate Approach To Combine Results From Multiple Microarray Studies, John R. Stevens, R. W. Doerge
A Bayesian And Covariate Approach To Combine Results From Multiple Microarray Studies, John R. Stevens, R. W. Doerge
Conference on Applied Statistics in Agriculture
The growing popularity of microarray technology for testing changes in gene expression has resulted in multiple laboratories independently seeking to identify genes related to the same disease in the same organism. Despite the uniform nature of the technology, chance variation and fundamental differences between laboratories can result in considerable disagreement between the lists of significant candidate genes from each laboratory. By adjusting for known differences between laboratories through the use of covariates and employing a Bayesian framework to effectively account for between-laboratory variability, the results of multiple similar studies can be systematically combined via a meta-analysis. Meta-analyses yield additional information …
A Platform-Independent Software Suite For Statistical Analysis Of High Dimensional Biology Data, David B. Allison, Jacob P. L. Brand, Jode W. Edwards, Gary L. Gadbury, Kyoungmi Kim, Tapan Mehta, Grier P. Page, Amit Patki, Vinodh Srinivasasainagendra, Prinal Trivedi, Jelai Wang, Stanislav O. Zakharkin
A Platform-Independent Software Suite For Statistical Analysis Of High Dimensional Biology Data, David B. Allison, Jacob P. L. Brand, Jode W. Edwards, Gary L. Gadbury, Kyoungmi Kim, Tapan Mehta, Grier P. Page, Amit Patki, Vinodh Srinivasasainagendra, Prinal Trivedi, Jelai Wang, Stanislav O. Zakharkin
Mathematics and Statistics Faculty Research & Creative Works
Many efforts in microarray data analysis are focused on providing tools and methods for the qualitative analysis of microarray data. HDBStat! (High-Dimensional Biology-Statistics) is a software package designed for analysis of high dimensional biology data such as microarray data. It was initially developed for the analysis of microarray gene expression data, but it can also be used for some applications in proteomics and other aspects of genomics. HDBStat! provides statisticians and biologists a flexible and easy-to-use interface to analyze complex microarray data using a variety of methods for data preprocessing, quality control analysis and hypothesis testing.
Multiple Testing Procedures: R Multtest Package And Applications To Genomics, Katherine S. Pollard, Sandrine Dudoit, Mark J. Van Der Laan
Multiple Testing Procedures: R Multtest Package And Applications To Genomics, Katherine S. Pollard, Sandrine Dudoit, Mark J. Van Der Laan
U.C. Berkeley Division of Biostatistics Working Paper Series
The Bioconductor R package multtest implements widely applicable resampling-based single-step and stepwise multiple testing procedures (MTP) for controlling a broad class of Type I error rates, in testing problems involving general data generating distributions (with arbitrary dependence structures among variables), null hypotheses, and test statistics. The current version of multtest provides MTPs for tests concerning means, differences in means, and regression parameters in linear and Cox proportional hazards models. Procedures are provided to control Type I error rates defined as tail probabilities for arbitrary functions of the numbers of false positives and rejected hypotheses. These error rates include tail probabilities …
Nonparametric Methods For Analyzing Replication Origins In Genomewide Data, Debashis Ghosh
Nonparametric Methods For Analyzing Replication Origins In Genomewide Data, Debashis Ghosh
The University of Michigan Department of Biostatistics Working Paper Series
Due to the advent of high-throughput genomic technology, it has become possible to globally monitor cellular activities on a genomewide basis. With these new methods, scientists can begin to address important biological questions. One such question involves the identification of replication origins, which are regions in chromosomes where DNA replication is initiated. In addition, one hypothesis regarding replication origins is that their locations are non-random throughout the genome. In this article, we develop methods for identification of and cluster inference regarding replication origins involving genomewide expression data. We compare several nonparametric regression methods for the identification of replication origin locations. …
A Statistical Method For Constructing Transcriptional Regulatory Networks Using Gene Expression And Sequence Data , Biao Xing, Mark J. Van Der Laan
A Statistical Method For Constructing Transcriptional Regulatory Networks Using Gene Expression And Sequence Data , Biao Xing, Mark J. Van Der Laan
U.C. Berkeley Division of Biostatistics Working Paper Series
Transcriptional regulation is one of the most important means of gene regulation. Uncovering transcriptional regulatory network helps us to understand the complex cellular process. In this paper, we describe a comprehensive statistical approach for constructing the transcriptional regulatory network using data of gene expression, promoter sequence, and transcription factor binding sites. Our simulation studies show that the overall and false positive error rates in the estimated transcriptional regulatory network are expected to be small if the systematic noise in the constructed feature matrix is small. Our analysis based on 658 microarray experiments on yeast gene expression programs and 46 transcription …
Evaluation Of Multiple Models To Distinguish Closely Related Forms Of Disease Using Dna Microarray Data: An Application To Multiple Myeloma, Johanna S. Hardin, Michael Waddell, C. David Page, Fenghuang Zhan, Bart Barlogie, John Shaughnessy, John J. Crowley
Evaluation Of Multiple Models To Distinguish Closely Related Forms Of Disease Using Dna Microarray Data: An Application To Multiple Myeloma, Johanna S. Hardin, Michael Waddell, C. David Page, Fenghuang Zhan, Bart Barlogie, John Shaughnessy, John J. Crowley
Pomona Faculty Publications and Research
Motivation: Standard laboratory classification of the plasma cell dyscrasia monoclonal gammopathy of undetermined significance (MGUS) and the overt plasma cell neoplasm multiple myeloma (MM) is quite accurate, yet, for the most part, biologically uninformative. Most, if not all, cancers are caused by inherited or acquired genetic mutations that manifest themselves in altered gene expression patterns in the clonally related cancer cells. Microarray technology allows for qualitative and quantitative measurements of the expression levels of thousands of genes simultaneously, and it has now been used both to classify cancers that are morphologically indistinguishable and to predict response to therapy. It is …
Loss-Based Estimation With Cross-Validation: Applications To Microarray Data Analysis And Motif Finding, Sandrine Dudoit, Mark J. Van Der Laan, Sunduz Keles, Annette M. Molinaro, Sandra E. Sinisi, Siew Leng Teng
Loss-Based Estimation With Cross-Validation: Applications To Microarray Data Analysis And Motif Finding, Sandrine Dudoit, Mark J. Van Der Laan, Sunduz Keles, Annette M. Molinaro, Sandra E. Sinisi, Siew Leng Teng
U.C. Berkeley Division of Biostatistics Working Paper Series
Current statistical inference problems in genomic data analysis involve parameter estimation for high-dimensional multivariate distributions, with typically unknown and intricate correlation patterns among variables. Addressing these inference questions satisfactorily requires: (i) an intensive and thorough search of the parameter space to generate good candidate estimators, (ii) an approach for selecting an optimal estimator among these candidates, and (iii) a method for reliably assessing the performance of the resulting estimator. We propose a unified loss-based methodology for estimator construction, selection, and performance assessment with cross-validation. In this approach, the parameter of interest is defined as the risk minimizer for a suitable …
Design Considerations For Efficient And Effective Microarray Studies, M. Kathleen Kerr
Design Considerations For Efficient And Effective Microarray Studies, M. Kathleen Kerr
UW Biostatistics Working Paper Series
This paper describes the theoretical and practical issues in experimental design for gene expression microarrays. Specifically, this paper (1) discusses the basic principles of design (randomization, replication, and blocking) as they pertain to microarrays, and (2) provides some general guidelines for statisticians designing microarray studies.
A Semiparametric Regression Model For Oligonucleotide Arrays, Jianhua Hu, Guosheng Yin
A Semiparametric Regression Model For Oligonucleotide Arrays, Jianhua Hu, Guosheng Yin
Journal of Modern Applied Statistical Methods
A semiparametric model incorporating the spline smoothing technique is proposed to study oligonucleotide gene expression data. No specific parametric functional form is assumed for mismatch probe intensities, which allows much more flexibility in the fitted model. The new approach improves the model fitting, hence the estimation of expression indexes. The method is applied to a data set of 18 HuGeneFL arrays.
Multiple Hypothesis Testing In Microarray Experiments, Sandrine Dudoit, Juliet Popper Shaffer, Jennifer C. Boldrick
Multiple Hypothesis Testing In Microarray Experiments, Sandrine Dudoit, Juliet Popper Shaffer, Jennifer C. Boldrick
U.C. Berkeley Division of Biostatistics Working Paper Series
DNA microarrays are a new and promising biotechnology which allows the monitoring of expression levels in cells for thousands of genes simultaneously. An important and common question in microarray experiments is the identification of differentially expressed genes, i.e., genes whose expression levels are associated with a response or covariate of interest. The biological question of differential expression can be restated as a problem in multiple hypothesis testing: the simultaneous test for each gene of the null hypothesis of no association between the expression levels and the responses or covariates. As a typical microarray experiment measures expression levels for thousands of …