Open Access. Powered by Scholars. Published by Universities.®

Biochemistry, Biophysics, and Structural Biology Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 20 of 20

Full-Text Articles in Biochemistry, Biophysics, and Structural Biology

Outlier Profiles Of Atomic Structures Derived From X-Ray Crystallography And From Cryo-Electron Microscopy, Lin Chen, Jing He, Angelo Facchiano Jan 2020

Outlier Profiles Of Atomic Structures Derived From X-Ray Crystallography And From Cryo-Electron Microscopy, Lin Chen, Jing He, Angelo Facchiano

Computer Science Faculty Publications

Background: As more protein atomic structures are determined from cryo-electron microscopy (cryo-EM) density maps, validation of such structures is an important task. Methods: We applied a histogram-based outlier score (HBOS) to six sets of cryo-EM atomic structures and five sets of X-ray atomic structures, including one derived from X-ray data with better than 1.5 Å resolution. Cryo-EM data sets contain structures released by December 2016 and those released between 2017 and 2019, derived from resolution ranges 0–4 Å and 4–6 Å respectively. Results: The distribution of HBOS values in five sets of X-ray structures show that HBOS is sensitive distinguishing …


Deepep: A Deep Learning Framework For Identifying Essential Proteins, Min Zeng, Min Li, Fang-Xiang Wu, Yaohang Li, Yi Pan Dec 2019

Deepep: A Deep Learning Framework For Identifying Essential Proteins, Min Zeng, Min Li, Fang-Xiang Wu, Yaohang Li, Yi Pan

Computer Science Faculty Publications

Background: Essential proteins are crucial for cellular life and thus, identification of essential proteins is an important topic and a challenging problem for researchers. Recently lots of computational approaches have been proposed to handle this problem. However, traditional centrality methods cannot fully represent the topological features of biological networks. In addition, identifying essential proteins is an imbalanced learning problem; but few current shallow machine learning-based methods are designed to handle the imbalanced characteristics. Results: We develop DeepEP based on a deep learning framework that uses the node2vec technique, multi-scale convolutional neural networks and a sampling technique to identify essential proteins. …


Prediction Of Lncrna-Disease Associations Based On Inductive Matrix Completion, Chengqian Lu, Mengyun Yang, Feng Luo, Fang-Xiang Wu, Min Li, Yi Pan, Yaohang Li, Jianxin Wang Apr 2018

Prediction Of Lncrna-Disease Associations Based On Inductive Matrix Completion, Chengqian Lu, Mengyun Yang, Feng Luo, Fang-Xiang Wu, Min Li, Yi Pan, Yaohang Li, Jianxin Wang

Computer Science Faculty Publications

Motivation: Accumulating evidences indicate that long non-coding RNAs (lncRNAs) play pivotal roles in various biological processes. Mutations and dysregulations of lncRNAs are implicated in miscellaneous human diseases. Predicting lncRNA–disease associations is beneficial to disease diagnosis as well as treatment. Although many computational methods have been developed, precisely identifying lncRNA–disease associations, especially for novel lncRNAs, remains challenging.

Results: In this study, we propose a method (named SIMCLDA) for predicting potential lncRNA– disease associations based on inductive matrix completion. We compute Gaussian interaction profile kernel of lncRNAs from known lncRNA–disease interactions and functional similarity of diseases based on disease–gene and gene–gene onotology …


Tracing Actin Filament Bundles In Three-Dimensional Electron Tomography Density Maps Of Hair Cell Stereocilia, Salim Sazzed, Junha Song, Julio Kovacs, Willi Wriggers, Manfred Auer, Jing He Apr 2018

Tracing Actin Filament Bundles In Three-Dimensional Electron Tomography Density Maps Of Hair Cell Stereocilia, Salim Sazzed, Junha Song, Julio Kovacs, Willi Wriggers, Manfred Auer, Jing He

Computer Science Faculty Publications

Cryo-electron tomography (cryo-ET) is a powerful method of visualizing the three-dimensional organization of supramolecular complexes, such as the cytoskeleton, in their native cell and tissue contexts. Due to its minimal electron dose and reconstruction artifacts arising from the missing wedge during data collection, cryo-ET typically results in noisy density maps that display anisotropic XY versus Z resolution. Molecular crowding further exacerbates the challenge of automatically detecting supramolecular complexes, such as the actin bundle in hair cell stereocilia. Stereocilia are pivotal to the mechanoelectrical transduction process in inner ear sensory epithelial hair cells. Given the complexity and dense arrangement of actin …


An Investigation Of Atomic Structures Derived From X-Ray Crystallography And Cryo-Electron Microscopy Using Distal Blocks Of Side-Chains, Lin Chen, Jing He, Salim Sazzed, Rayshawn Walker Jan 2018

An Investigation Of Atomic Structures Derived From X-Ray Crystallography And Cryo-Electron Microscopy Using Distal Blocks Of Side-Chains, Lin Chen, Jing He, Salim Sazzed, Rayshawn Walker

Computer Science Faculty Publications

Cryo-electron microscopy (cryo-EM) is a structure determination method for large molecular complexes. As more and more atomic structures are determined using this technique, it is becoming possible to perform statistical characterization of side-chain conformations. Two data sets were involved to characterize block lengths for each of the 18 types of amino acids. One set contains 9131 structures resolved using X-ray crystallography from density maps with better than or equal to 1.5 Å resolutions, and the other contains 237 protein structures derived from cryo-EM density maps with 2-4 Å resolutions. The results show that the normalized probability density function of block …


An Effective Computational Method Incorporating Multiple Secondary Structure Predictions In Topology Determination For Cryo-Em Images, Abhishek Biswas, Desh Ranjan, Mohammad Zubair, Stephanie Zeil, Kamal Al Nasr, Jing He Jan 2017

An Effective Computational Method Incorporating Multiple Secondary Structure Predictions In Topology Determination For Cryo-Em Images, Abhishek Biswas, Desh Ranjan, Mohammad Zubair, Stephanie Zeil, Kamal Al Nasr, Jing He

Computer Science Faculty Publications

A key idea in de novo modeling of a medium-resolution density image obtained from cryo-electron microscopy is to compute the optimal mapping between the secondary structure traces observed in the density image and those predicted on the protein sequence. When secondary structures are not determined precisely, either from the image or from the amino acid sequence of the protein, the computational problem becomes more complex. We present an efficient method that addresses the secondary structure placement problem in presence of multiple secondary structure predictions and computes the optimal mapping. We tested the method using 12 simulated images from alpha-proteins and …


Comparing An Atomic Model Or Structure To A Corresponding Cryo-Electron Microscopy Image At The Central Axis Of A Helix, Stephanie Zeil, Julio Kovacs, Willy Wriggers, Jing He Jan 2017

Comparing An Atomic Model Or Structure To A Corresponding Cryo-Electron Microscopy Image At The Central Axis Of A Helix, Stephanie Zeil, Julio Kovacs, Willy Wriggers, Jing He

Computer Science Faculty Publications

Three-dimensional density maps of biological specimens from cryo-electron microscopy (cryo-EM) can be interpreted in the form of atomic models that are modeled into the density, or they can be compared to known atomic structures. When the central axis of a helix is detectable in a cryo-EM density map, it is possible to quantify the agreement between this central axis and a central axis calculated from the atomic model or structure. We propose a novel arc-length association method to compare the two axes reliably. This method was applied to 79 helices in simulated density maps and six case studies using cryo-EM …


Deep Models For Brain Em Image Segmentation: Novel Insights And Improved Performance, Ahmed Fakhry, Hanchuan Peng, Shuiwang Ji Jan 2016

Deep Models For Brain Em Image Segmentation: Novel Insights And Improved Performance, Ahmed Fakhry, Hanchuan Peng, Shuiwang Ji

Computer Science Faculty Publications

Motivation: Accurate segmentation of brain electron microscopy (EM) images is a critical step in dense circuit reconstruction. Although deep neural networks (DNNs) have been widely used in a number of applications in computer vision, most of these models that proved to be effective on image classification tasks cannot be applied directly to EM image segmentation, due to the different objectives of these tasks. As a result, it is desirable to develop an optimized architecture that uses the full power of DNNs and tailored specifically for EM image segmentation.

Results: In this work, we proposed a novel design of DNNs for …


Isquest: Finding Insertion Sequences In Prokaryotic Sequence Fragment Data, Abhishek Biswas, David T. Gauthier, Desh Ranjan, Mohammad Zubair Jun 2015

Isquest: Finding Insertion Sequences In Prokaryotic Sequence Fragment Data, Abhishek Biswas, David T. Gauthier, Desh Ranjan, Mohammad Zubair

Computer Science Faculty Publications

Motivation: Insertion sequences (ISs) are transposable elements present in most bacterial and archaeal genomes that play an important role in genomic evolution. The increasing availability of sequenced prokaryotic genomes offers the opportunity to study ISs comprehensively, but development of efficient and accurate tools is required for discovery and annotation. Additionally, prokaryotic genomes are frequently deposited as incomplete, or draft stage because of the substantial cost and effort required to finish genome assembly projects. Development of methods to identify IS directly from raw sequence reads or draft genomes are therefore desirable. Software tools such as Optimized Annotation System for Insertion Sequences …


Template-Based C8-Scorpion: A Protein 8 State Secondary Structure Prediction Method Using Structural Information And Context-Based Features, Ashraf Yaseen, Yaohang Li Jan 2014

Template-Based C8-Scorpion: A Protein 8 State Secondary Structure Prediction Method Using Structural Information And Context-Based Features, Ashraf Yaseen, Yaohang Li

Computer Science Faculty Publications

Background: Secondary structures prediction of proteins is important to many protein structure modeling applications. Correct prediction of secondary structures can significantly reduce the degrees of freedom in protein tertiary structure modeling and therefore reduces the difficulty of obtaining high resolution 3D models.

Methods: In this work, we investigate a template-based approach to enhance 8-state secondary structure prediction accuracy. We construct structural templates from known protein structures with certain sequence similarity. The structural templates are then incorporated as features with sequence and evolutionary information to train two-stage neural networks. In case of structural templates absence, heuristic structural information is incorporated instead. …


Automated Identification Of Cell Type Specific Genes In The Mouse Brain By Image Computing Of Expression Patterns, Rongjian Li, Wenlu Zhang, Shuiwang Ji Jan 2014

Automated Identification Of Cell Type Specific Genes In The Mouse Brain By Image Computing Of Expression Patterns, Rongjian Li, Wenlu Zhang, Shuiwang Ji

Computer Science Faculty Publications

Background: Differential gene expression patterns in cells of the mammalian brain result in the morphological, connectional, and functional diversity of cells. A wide variety of studies have shown that certain genes are expressed only in specific cell-types. Analysis of cell-type-specific gene expression patterns can provide insights into the relationship between genes, connectivity, brain regions, and cell-types. However, automated methods for identifying cell-type-specific genes are lacking to date.

Results: Here, we describe a set of computational methods for identifying cell-type-specific genes in the mouse brain by automated image computing of in situ hybridization (ISH) expression patterns. We applied invariant image feature …


Tracing Beta Strands Using Strandtwister From Cryo-Em Density Maps At Medium Resolutions, Dong Si, Jing He Jan 2014

Tracing Beta Strands Using Strandtwister From Cryo-Em Density Maps At Medium Resolutions, Dong Si, Jing He

Computer Science Faculty Publications

Major secondary structure elements such as α helices and β sheets can be computationally detected from cryoelectron microscopy (cryo-EM) density maps with medium resolutions of 5–10 A˚ . However, a critical piece of information for modeling atomic structures is missing, because there are no tools to detect β strands from cryo-EM maps at medium resolutions. We propose a method, StrandTwister, to detect the traces of β strands through the analysis of twist, an intrinsic nature of a β sheet. StrandTwister has been tested using 100 β sheets simulated at 10 A˚ resolution and 39 β sheets computationally detected from cryo-EM …


Intensity-Based Skeletonization Of Cryoem Gray-Scale Images Using A True Segmentation-Free Algorithm, Kamal Al Nasr, Chunmei Liu, Mugizi Rwebangira, Legand Burge, Jing He Jan 2013

Intensity-Based Skeletonization Of Cryoem Gray-Scale Images Using A True Segmentation-Free Algorithm, Kamal Al Nasr, Chunmei Liu, Mugizi Rwebangira, Legand Burge, Jing He

Computer Science Faculty Publications

Cryo-electron microscopy is an experimental technique that is able to produce 3D gray-scale images of protein molecules. In contrast to other experimental techniques, cryo-electron microscopy is capable of visualizing large molecular complexes such as viruses and ribosomes. At medium resolution, the positions of the atoms are not visible and the process cannot proceed. The medium-resolution images produced by cryo-electron microscopy are used to derive the atomic structure of the proteins in de novo modeling. The skeletons of the 3D gray-scale images are used to interpret important information that is helpful in de novo modeling. Unfortunately, not all features of the …


Estimating Loop Length From Cryoem Images At Medium Resolutions, Andrew Mcknight, Dong Si, Kamal Al Nasr, Andrey Chemikov, Nikos Chrisochoides, Jing He Jan 2013

Estimating Loop Length From Cryoem Images At Medium Resolutions, Andrew Mcknight, Dong Si, Kamal Al Nasr, Andrey Chemikov, Nikos Chrisochoides, Jing He

Computer Science Faculty Publications

Background: De novo protein modeling approaches utilize 3-dimensional (3D) images derived from electron cryomicroscopy (CryoEM) experiments. The skeleton connecting two secondary structures such as α-helices represent the loop in the 3D image. The accuracy of the skeleton and of the detected secondary structures are critical in De novo modeling. It is important to measure the length along the skeleton accurately since the length can be used as a constraint in modeling the protein.

Results: We have developed a novel computational geometric approach to derive a simplified curve in order to estimate the loop length along the skeleton. The method was …


Computational Genetic Neuroanatomy Of The Developing Mouse Brain: Dimensionality Reduction, Visualization, And Clustering, Shuiwang Ji Jan 2013

Computational Genetic Neuroanatomy Of The Developing Mouse Brain: Dimensionality Reduction, Visualization, And Clustering, Shuiwang Ji

Computer Science Faculty Publications

Background: The structured organization of cells in the brain plays a key role in its functional efficiency. This delicate organization is the consequence of unique molecular identity of each cell gradually established by precise spatiotemporal gene expression control during development. Currently, studies on the molecular-structural association are beginning to reveal how the spatiotemporal gene expression patterns are related to cellular differentiation and structural development.

Results: In this article, we aim at a global, data-driven study of the relationship between gene expressions and neuroanatomy in the developing mouse brain. To enable visual explorations of the high-dimensional data, we map the in …


Conformational Sampling In Template Free Protein Loop Structure Modeling: An Overview, Yaohang Li Jan 2013

Conformational Sampling In Template Free Protein Loop Structure Modeling: An Overview, Yaohang Li

Computer Science Faculty Publications

Accurately modeling protein loops is an important step to predict three-dimensional structures as well as to understand functions of many proteins. Because of their high flexibility, modeling the three-dimensional structures of loops is difficult and is usually treated as a "mini protein folding problem" under geometric constraints. In the past decade, there has been remarkable progress in template-free loop structure modeling due to advances of computational methods as well as stably increasing number of known structures available in PDB. This mini review provides an overview on the recent computational approaches for loop structure modeling. In particular, we focus on the …


Estimation Of Alternative Splicing Isoform Frequencies From Rna-Seq Data, Marius Nicolae, Serghei Mangul, Ion I. Măndoiu, Alexander Zelikovskiy Jan 2011

Estimation Of Alternative Splicing Isoform Frequencies From Rna-Seq Data, Marius Nicolae, Serghei Mangul, Ion I. Măndoiu, Alexander Zelikovskiy

Computer Science Faculty Publications

Background: Massively parallel whole transcriptome sequencing, commonly referred as RNA-Seq, is quickly becoming the technology of choice for gene expression profiling. However, due to the short read length delivered by current sequencing technologies, estimation of expression levels for alternative splicing gene isoforms remains challenging.

Results: In this paper we present a novel expectation-maximization algorithm for inference of isoform- and genespecific expression levels from RNA-Seq data. Our algorithm, referred to as IsoEM, is based on disambiguating information provided by the distribution of insert sizes generated during sequencing library preparation, and takes advantage of base quality scores, strand and read pairing information …


Computational Network Analysis Of The Anatomical And Genetic Organizations In The Mouse Brain, Shuiwang Ji Jan 2011

Computational Network Analysis Of The Anatomical And Genetic Organizations In The Mouse Brain, Shuiwang Ji

Computer Science Faculty Publications

Motivation: The mammalian central nervous system (CNS) generates high-level behavior and cognitive functions. Elucidating the anatomical and genetic organizations in the CNS is a key step toward understanding the functional brain circuitry. The CNS contains an enormous number of cell types, each with unique gene expression patterns. Therefore, it is of central importance to capture the spatial expression patterns in the brain. Currently, genome-wide atlas of spatial expression patterns in the mouse brain has been made available, and the data are in the form of aligned 3D data arrays. The sheer volume and complexity of these data pose significant challenges …


The 3rd Computational Structural Bioinformatics Workshop, Jing He, Di Wu Jan 2010

The 3rd Computational Structural Bioinformatics Workshop, Jing He, Di Wu

Computer Science Faculty Publications

As many other domains in biology, molecular structures have proposed challenging but interesting computational problems. The unique challenge of the 3-dimensional molecular structures comes from the combination of the fundamental concepts of physics, chemistry, biology and geometry, and it is often computationally intensive to search for the correct structure. The Computational Structural Bioinformatics Workshop (CSBW) is a workshop that focuses on the fundamental computational work that is related to 3-dimensional molecular structures. This workshop aims to bring together researchers with expertise in bioinformatics, computational biology, structural biology, data mining, optimization and high performance computing to discuss recent results, new techniques, …


Improving Predicted Protein Loop Structure Ranking Using A Pareto-Optimality Consensus Method, Yaohang Li, Ionel Rata, See-Wing Chiu, Erik Jakobsson Jan 2010

Improving Predicted Protein Loop Structure Ranking Using A Pareto-Optimality Consensus Method, Yaohang Li, Ionel Rata, See-Wing Chiu, Erik Jakobsson

Computer Science Faculty Publications

Background

Accurate protein loop structure models are important to understand functions of many proteins. Identifying the native or near-native models by distinguishing them from the misfolded ones is a critical step in protein loop structure prediction.

Results

We have developed a Pareto Optimal Consensus (POC) method, which is a consensus model ranking approach to integrate multiple knowledge- or physics-based scoring functions. The procedure of identifying the models of best quality in a model set includes: 1) identifying the models at the Pareto optimal front with respect to a set of scoring functions, and 2) ranking them based on the fuzzy …