Open Access. Powered by Scholars. Published by Universities.®

Molecular Biology Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 3 of 3

Full-Text Articles in Molecular Biology

Secondary Structure, A Missing Component Of Sequence- Based Minimotif Definitions, David P. Sargeant, Michael R. Gryk, Mark W. Maciejewsk, Vishal Thapar, Vamsi Kundeti, Sanguthevar Rajasekaran, Pedro Romero, Keith Dunker, Shun-Cheng Li, Tomonori Kaneko, Martin Schiller Dec 2012

Secondary Structure, A Missing Component Of Sequence- Based Minimotif Definitions, David P. Sargeant, Michael R. Gryk, Mark W. Maciejewsk, Vishal Thapar, Vamsi Kundeti, Sanguthevar Rajasekaran, Pedro Romero, Keith Dunker, Shun-Cheng Li, Tomonori Kaneko, Martin Schiller

Life Sciences Faculty Research

Minimotifs are short contiguous segments of proteins that have a known biological function. The hundreds of thousands of minimotifs discovered thus far are an important part of the theoretical understanding of the specificity of protein-protein interactions, posttranslational modifications, and signal transduction that occur in cells. However, a longstanding problem is that the different abstractions of the sequence definitions do not accurately capture the specificity, despite decades of effort by many labs. We present evidence that structure is an essential component of minimotif specificity, yet is not used in minimotif definitions. Our analysis of several known minimotifs as case studies, analysis …


Achieving High Accuracy Prediction Of Minimotifs, Tian Mi, Sanguthevar Rajasekaran, Jerlin Camilus Merlin, Michael R. Gryk, Martin Schiller Sep 2012

Achieving High Accuracy Prediction Of Minimotifs, Tian Mi, Sanguthevar Rajasekaran, Jerlin Camilus Merlin, Michael R. Gryk, Martin Schiller

Life Sciences Faculty Research

The low complexity of minimotif patterns results in a high false-positive prediction rate, hampering protein function prediction. A multi-filter algorithm, trained and tested on a linear regression model, support vector machine model, and neural network model, using a large dataset of verified minimotifs, vastly improves minimotif prediction accuracy while generating few false positives. An optimal threshold for the best accuracy reaches an overall accuracy above 90%, while a stringent threshold for the best specificity generates less than 1% false positives or even no false positives and still produces more than 90% true positives for the linear regression and neural network …


Mutation And Complementation Of A Cellulose Synthase (Cesa) Gene, Ahmed Y. El-Araby May 2012

Mutation And Complementation Of A Cellulose Synthase (Cesa) Gene, Ahmed Y. El-Araby

Senior Honors Projects

Cellulose is a carbohydrate polymer that is composed of repeating glucose subunits. Being the most abundant organic compound in the biosphere and comprising a large percentage of all plant biomass, cellulose is extremely plentiful and has a significant role in nature. Cellulose is present in plant cell walls, in commercial products such as those made from wood or cotton, and is of interest to the biofuel industry as a potential alternative fuel source. Although indigestible by humans, cellulose is nutritionally valuable, serving as a dietary fiber. Because of its ubiquity and importance in many areas, studying cellulose will prove to …