Open Access. Powered by Scholars. Published by Universities.®

Life Sciences Commons

Open Access. Powered by Scholars. Published by Universities.®

Selected Works

Genetics and Genomics

2009

Computational Biology/Bioinformatics

Articles 1 - 5 of 5

Full-Text Articles in Life Sciences

Uniqueprimer - A Web Utility For Design Of Specific Pcr Primers And Probes, Torstein Tengs Jan 2009

Uniqueprimer - A Web Utility For Design Of Specific Pcr Primers And Probes, Torstein Tengs

Dr. Torstein Tengs

We have developed a web-based tool for design of specific PCR primers and probes. The program allows you to enter primer sequence information as well as an optional probe, and sequence similarity searches (MegaBLAST) will be performed to see if the sequences match the same sequence entry in the specified database. If primers (and probe) match, this will be reported. The program can handle overlapping amplicons, amplification from a single primer, ambiguous bases and other problematic cases.


Hierarchical Hidden Markov Model With Application To Joint Analysis Of Chip-Chip And Chip-Seq Data, Hyungwon Choi, Debashis Ghosh, Zhaohui S. Qin Jan 2009

Hierarchical Hidden Markov Model With Application To Joint Analysis Of Chip-Chip And Chip-Seq Data, Hyungwon Choi, Debashis Ghosh, Zhaohui S. Qin

Debashis Ghosh

Motivation: Identication of transcription factor binding sites (TFBS) is a fundamental problem in understanding the mechanism of gene regulation. The ChIP-chip technology has accelerated this eort by providing a simultaneous genome-wide map of TFBS in a high-throughput fashion. Recently, a sequencing-based ChIP-seq has appeared as a promising alternative that can identify targets with an improved sensitivity/specicity in high resolution. However, studies have suggested that distinct experimental platforms can be complementary in TFBS identication. The availability of data obtained from multiple platforms motivates a meta-analysis for improved identication of candidate motifs.

Results: In this work, we propose a hierarchical hidden Markov …


A Double-Layered Mixture Model For The Joint Analysis Of Dna Copy Number And Gene Expression Data, Debashis Ghosh Jan 2009

A Double-Layered Mixture Model For The Joint Analysis Of Dna Copy Number And Gene Expression Data, Debashis Ghosh

Debashis Ghosh

Copy number aberration is a common form of genomic instability in cancer. Gene expression is closely tied to cytogenetic events by the central dogma of molecular biology, and serves as a mediator of copy number changes in disease phenotypes. Accordingly, it is of interest to develop proper statistical methods for jointly analyzing copy number and gene expression data. This work describes a novel Bayesian inferential approach for a double-layered mixture model (DLMM) which directly models the stochastic nature of copy number data and identifies abnormally expressed genes due to aberrant copy number. Simulation studies were conducted to illustrate the robustness …


Discrete Nonparametric Algorithms For Outlier Detection With Genomic Data, Debashis Ghosh Jan 2009

Discrete Nonparametric Algorithms For Outlier Detection With Genomic Data, Debashis Ghosh

Debashis Ghosh

In high-throughput studies involving genetic data such as from gene expression microarrays, differential expression analysis between two or more experimental conditions has been a very common analytical task. Much of the resulting literature on multiple comparisons has paid relatively little attention to the choice of test statistic. In this article, we focus on the issue of choice of test statistic based on a special pattern of differential expression. The approach here is based on recasting multiple comparisons procedures for assessing outlying expression values. A major complication is that the resulting p-values are discrete; some theoretical properties of sequential testing procedures …


Identification Of Yeast Transcriptional Regulation Networks Using Multivariate Random Forests, Yuanyuan Xiao, Mark Segal Dec 2008

Identification Of Yeast Transcriptional Regulation Networks Using Multivariate Random Forests, Yuanyuan Xiao, Mark Segal

Mark R Segal

The recent availability of whole-genome scale data sets that investigate complementary and diverse aspects of transcriptional regulation has spawned an increased need for new and effective computational approaches to analyze and integrate these large scale assays. Here, we propose a novel algorithm, based on random forest methodology, to relate gene expression (as derived from expression microarrays) to sequence features residing in gene promoters (as derived from DNA motif data) and transcription factor binding to gene promoters (as derived from tiling microarrays). We extend the random forest approach to model a multivariate response as represented, for example, by time-course gene expression …