Open Access. Powered by Scholars. Published by Universities.®

Digital Commons Network

Open Access. Powered by Scholars. Published by Universities.®

PDF

Bioinformatics

2008

Biology Faculty Publications

Articles 1 - 1 of 1

Full-Text Articles in Entire DC Network

An Improved String Composition Method For Sequence Comparison, Guoquing Lu, Shunpu Zhang, Xiang Fang May 2008

An Improved String Composition Method For Sequence Comparison, Guoquing Lu, Shunpu Zhang, Xiang Fang

Biology Faculty Publications

Background: Historically, two categories of computational algorithms (alignment-based and alignment-free) have been applied to sequence comparison–one of the most fundamental issues in bioinformatics. Multiple sequence alignment, although dominantly used by biologists, possesses both fundamental as well as computational limitations. Consequently, alignment-free methods have been explored as important alternatives in estimating sequence similarity. Of the alignment-free methods, the string composition vector (CV) methods, which use the frequencies of nucleotide or amino acid strings to represent sequence information, show promising results in genome sequence comparison of prokaryotes. The existing CV-based methods, however, suffer certain statistical problems, thereby underestimating the amount of evolutionary …