Open Access. Powered by Scholars. Published by Universities.®

Biochemistry, Biophysics, and Structural Biology Commons

Open Access. Powered by Scholars. Published by Universities.®

Proteins

Life Sciences Faculty Research

Discipline

Articles 1 - 2 of 2

Full-Text Articles in Biochemistry, Biophysics, and Structural Biology

Secondary Structure, A Missing Component Of Sequence- Based Minimotif Definitions, David P. Sargeant, Michael R. Gryk, Mark W. Maciejewsk, Vishal Thapar, Vamsi Kundeti, Sanguthevar Rajasekaran, Pedro Romero, Keith Dunker, Shun-Cheng Li, Tomonori Kaneko, Martin Schiller Dec 2012

Secondary Structure, A Missing Component Of Sequence- Based Minimotif Definitions, David P. Sargeant, Michael R. Gryk, Mark W. Maciejewsk, Vishal Thapar, Vamsi Kundeti, Sanguthevar Rajasekaran, Pedro Romero, Keith Dunker, Shun-Cheng Li, Tomonori Kaneko, Martin Schiller

Life Sciences Faculty Research

Minimotifs are short contiguous segments of proteins that have a known biological function. The hundreds of thousands of minimotifs discovered thus far are an important part of the theoretical understanding of the specificity of protein-protein interactions, posttranslational modifications, and signal transduction that occur in cells. However, a longstanding problem is that the different abstractions of the sequence definitions do not accurately capture the specificity, despite decades of effort by many labs. We present evidence that structure is an essential component of minimotif specificity, yet is not used in minimotif definitions. Our analysis of several known minimotifs as case studies, analysis …


Achieving High Accuracy Prediction Of Minimotifs, Tian Mi, Sanguthevar Rajasekaran, Jerlin Camilus Merlin, Michael R. Gryk, Martin Schiller Sep 2012

Achieving High Accuracy Prediction Of Minimotifs, Tian Mi, Sanguthevar Rajasekaran, Jerlin Camilus Merlin, Michael R. Gryk, Martin Schiller

Life Sciences Faculty Research

The low complexity of minimotif patterns results in a high false-positive prediction rate, hampering protein function prediction. A multi-filter algorithm, trained and tested on a linear regression model, support vector machine model, and neural network model, using a large dataset of verified minimotifs, vastly improves minimotif prediction accuracy while generating few false positives. An optimal threshold for the best accuracy reaches an overall accuracy above 90%, while a stringent threshold for the best specificity generates less than 1% false positives or even no false positives and still produces more than 90% true positives for the linear regression and neural network …