Open Access. Powered by Scholars. Published by Universities.®

Digital Commons Network

Open Access. Powered by Scholars. Published by Universities.®

Life Sciences

Theses

Theses/Dissertations

2012

Data processing

Articles 1 - 1 of 1

Full-Text Articles in Entire DC Network

Design Of A Distributed Hadoop Solution For The Multiple Sequence Alignment Algorithm: Clustal Omega, Jurate Daugelaite Jan 2012

Design Of A Distributed Hadoop Solution For The Multiple Sequence Alignment Algorithm: Clustal Omega, Jurate Daugelaite

Theses

Multiple Sequence Alignment (MSA) of DNA and protein sequences is one of the most essential techniques in the fields of molecular biology, computational biology and bioinformatics. It aids the identification and prediction of three dimensional structures, primary functions and evolutionary relatedness amongst groups of species, organisms, and genes. Since as the completion of the Human Genome Project and with the advent of sequencing initiatives such as the Genome 10K project, the rate of genome sequencing has increased exponentially, producing vast amounts of DNA and protein sequences. MSA algorithms, when applied to such sequence data, can identify common homology, structure and …