Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

2003

Statistics and Probability

Selected Works

Bayesian methods; Bioinformatics; Mixture distributions; Multinomial distribution; SAGE;

Articles 1 - 1 of 1

Full-Text Articles in Physical Sciences and Mathematics

Bayesian Shrinkage Estimation Of The Relative Abundance Of Mrna Transcripts Using Sage, Jeffrey S. Morris, Keith A. Baggerly, Kevin R. Coombes Mar 2003

Bayesian Shrinkage Estimation Of The Relative Abundance Of Mrna Transcripts Using Sage, Jeffrey S. Morris, Keith A. Baggerly, Kevin R. Coombes

Jeffrey S. Morris

Serial analysis of gene expression (SAGE) is a technology for quantifying gene expression in biological tissue that yields count data that can be modeled by a multinomial distribution with two characteristics: skewness in the relative frequencies and small sample size relative to the dimension. As a result of these characteristics, a given SAGE sample may fail to capture a large number of expressed mRNA species present in the tissue. Empirical estimators of mRNA species’ relative abundance effectively ignore these missing species, and as a result tend to overestimate the abundance of the scarce observed species comprising a vast majority of …