Open Access. Powered by Scholars. Published by Universities.®

OS and Networks Commons

Open Access. Powered by Scholars. Published by Universities.®

Masters Theses & Specialist Projects

Series

PSM

Articles 1 - 1 of 1

Full-Text Articles in OS and Networks

An Apache Hadoop Framework For Large-Scale Peptide Identification, Harinivesh Donepudi Jul 2015

An Apache Hadoop Framework For Large-Scale Peptide Identification, Harinivesh Donepudi

Masters Theses & Specialist Projects

Peptide identification is an essential step in protein identification, and Peptide Spectrum Match (PSM) data set is huge, which is a time consuming process to work on a single machine. In a typical run of the peptide identification method, PSMs are positioned by a cross correlation, a statistical score, or a likelihood that the match between the trial and hypothetical is correct and unique. This process takes a long time to execute, and there is a demand for an increase in performance to handle large peptide data sets. Development of distributed frameworks are needed to reduce the processing time, but …