Open Access. Powered by Scholars. Published by Universities.®

Molecular Biology Commons

Open Access. Powered by Scholars. Published by Universities.®

Theses/Dissertations

Physical Sciences and Mathematics

Bioinformatics and computational biology

Publication Year

Articles 1 - 3 of 3

Full-Text Articles in Molecular Biology

Bayesian Recombination Detection Modeling And Application, Fang Fang Jan 2006

Bayesian Recombination Detection Modeling And Application, Fang Fang

Retrospective Theses and Dissertations

As a key evolutionary process, recombination shapes the genetic structure of virus populations. The increased availability of virus sequences provides a chance to study virus recombination through molecular data. Many statistical methods have been developed, and a lot of the methods are phylogenetic-based. My research focuses on recombination modeling and data analysis;I first apply an existing phylogenetic-base method, Bayesian dual change-point model (DMCP), to investigate the role of representative data types for recombination study. We conclude that consensus sequences are an all-around robust representative of virus genotypes. Using consensus data we study recombination of all full-length hepatitis B virus ...


A Modular Data Analysis Pipeline For The Discovery Of Novel Rna Motifs , Justin Schonfeld Jan 2006

A Modular Data Analysis Pipeline For The Discovery Of Novel Rna Motifs , Justin Schonfeld

Retrospective Theses and Dissertations

This dissertation presents a modular software pipeline that searches collections of RNA sequences for novel RNA motifs. In this case the motifs incorporate elements of primary and secondary structure. The motif search pipeline breaks up sets of RNA sequences into shortened segments of RNA primary sequence. The shortened segments are then folded to obtain low energy secondary structures. The distance estimation module of the pipeline then calculates distances between the folded bricks, and then analyzes the resulting distance matrices for patterns;An initial implementation of the pipeline is applied to synthetic and biological data sets. This implementation introduces a new ...


Identification Of Interface Residues Involved In Protein-Protein And Protein-Dna Interactions From Sequence Using Machine Learning Approaches , Changhui Yan Jan 2005

Identification Of Interface Residues Involved In Protein-Protein And Protein-Dna Interactions From Sequence Using Machine Learning Approaches , Changhui Yan

Retrospective Theses and Dissertations

Identification of interface residues involved in protein-protein and protein-DNA interactions is critical for understanding the functions of biological systems. Because identifying interface residues using experimental methods cannot catch up with the pace at which protein sequences are determined, computational methods that can identify interface residues are urgently needed. In this study, we apply machine-learning methods to identify interface residues with the focus on the methods using amino acid sequence information alone. We have developed classifiers for identification of the residues involved in protein-protein and protein-DNA interactions using a window of primary sequence as input. The classifiers were evaluated using both ...