Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Agriculture

Kansas State University Libraries

Poisson distribution

Publication Year

Articles 1 - 3 of 3

Full-Text Articles in Physical Sciences and Mathematics

The Nuances Of Statistically Analyzing Next-Generation Sequencing Data, Sanvesh Srivastava, R. W. Doerge Apr 2012

The Nuances Of Statistically Analyzing Next-Generation Sequencing Data, Sanvesh Srivastava, R. W. Doerge

Conference on Applied Statistics in Agriculture

High-throughput sequencing technologies, in particular next-generation sequencing (NGS) technologies, have emerged as the preferred approach for exploring both gene function and pathway organization. Data from NGS technologies pose new computational and statistical challenges because of their massive size, limited replicate information, large number of genes (high-dimensionality), and discrete form. They are more complex than data from previous high-throughput technologies such as microarrays. In this work we focus on the statistical issues in analyzing and modeling NGS data for selecting genes suitable for further exploration and present a brief review of the relevant statistical methods. We discuss visualization methods to assess …


A Hierarchical Bayesian Approach For Detecting Differential Gene Expression In Unreplicated Rna-Sequencing Data, Sanvesh Srivastava, R. W. Doerge May 2011

A Hierarchical Bayesian Approach For Detecting Differential Gene Expression In Unreplicated Rna-Sequencing Data, Sanvesh Srivastava, R. W. Doerge

Conference on Applied Statistics in Agriculture

Next-generation sequencing technologies have emerged as a promising technology in a variety of fields, including genomics, epigenomics, and transcriptomics. These technologies play an important role in understanding cell organization and functionality. Unlike data from earlier technologies (e.g., microarrays), data from next-generation sequencing technologies are highly replicable with little technical variation. One application of next-generation sequencing technologies is RNA-Sequencing (RNA-Seq). It is used for detecting differential gene expression between different biological conditions. While statistical methods for detecting differential expression in RNA-Seq data exist, one serious limitation to these methods is the absence of biological replication. At present, the high cost of …


A Non-Parametric Empirical Bayes Approach For Estimating Transcript Abundance In Un-Replicated Next-Generation Sequencing Data, Sanvesh Srivastava, R. W. Doerge Apr 2010

A Non-Parametric Empirical Bayes Approach For Estimating Transcript Abundance In Un-Replicated Next-Generation Sequencing Data, Sanvesh Srivastava, R. W. Doerge

Conference on Applied Statistics in Agriculture

Empirical Bayes approaches have been widely used to analyze data from high throughput sequencing devices. These approaches rely on borrowing information available for all the genes across samples to get better estimates of gene level expression. To date, transcript abundance in data from next generation sequencing (NGS) technologies has been estimated using parametric approaches for analyzing count data, namely – gamma-Poisson model, negative binomial model, and over-dispersed logistic model. One serious limitation of these approaches is they cannot be applied in absence of replication. The high cost of NGS technologies imposes a serious restriction on the number of biological replicates …