Life Sciences | Open Access Articles | Digital Commons Network™

Efficient Algorithms For Prokaryotic Whole Genome Assembly And Finishing, Abhishek Biswas Oct 2015

Efficient Algorithms For Prokaryotic Whole Genome Assembly And Finishing, Abhishek Biswas

Computer Science Theses & Dissertations

De-novo genome assembly from DNA fragments is primarily based on sequence overlap information. In addition, mate-pair reads or paired-end reads provide linking information for joining gaps and bridging repeat regions. Genome assemblers in general assemble long contiguous sequences (contigs) using both overlapping reads and linked reads until the assembly runs into an ambiguous repeat region. These contigs are further bridged into scaffolds using linked read information. However, errors can be made in both phases of assembly due to high error threshold of overlap acceptance and linking based on too few mate reads. Identical as well as similar repeat regions can …

Go to article

Computational Development For Secondary Structure Detection From Three-Dimensional Images Of Cryo-Electron Microscopy, Dong Si Apr 2015

Computational Development For Secondary Structure Detection From Three-Dimensional Images Of Cryo-Electron Microscopy, Dong Si

Computer Science Theses & Dissertations

Electron cryo-microscopy (cryo-EM) as a cutting edge technology has carved a niche for itself in the study of large-scale protein complex. Although the protein backbone of complexes cannot be derived directly from the medium resolution (5-10 Å) of amino acids from three-dimensional (3D) density images, secondary structure elements (SSEs) such as alpha-helices and beta-sheets can still be detected. The accuracy of SSE detection from the volumetric protein density images is critical for ab initio backbone structure derivation in cryo-EM. So far it is challenging to detect the SSEs automatically and accurately from the density images at these resolutions. This dissertation …

Go to article

De Novo Protein Structure Modeling And Energy Function Design, Lin Chen Jan 2015

De Novo Protein Structure Modeling And Energy Function Design, Lin Chen

Computer Science Theses & Dissertations

The two major challenges in protein structure prediction problems are (1) the lack of an accurate energy function and (2) the lack of an efficient search algorithm. A protein energy function accurately describing the interaction between residues is able to supervise the optimization of a protein conformation, as well as select native or native-like structures from numerous possible conformations. An efficient search algorithm must be able to reduce a conformational space to a reasonable size without missing the native conformation. My PhD research studies focused on these two directions.

A protein energy function—the distance and orientation dependent energy function of …

Go to article

Life Sciences Commons^™

Full-Text Articles in Life Sciences

Efficient Algorithms For Prokaryotic Whole Genome Assembly And Finishing, Abhishek Biswas

Computer Science Theses & Dissertations

Computational Development For Secondary Structure Detection From Three-Dimensional Images Of Cryo-Electron Microscopy, Dong Si

Computer Science Theses & Dissertations

De Novo Protein Structure Modeling And Energy Function Design, Lin Chen

Computer Science Theses & Dissertations