Open Access. Powered by Scholars. Published by Universities.®

Computational Biology Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 17 of 17

Full-Text Articles in Computational Biology

Towards More Complete Metagenomic Analyses Through Circularized Genomes And Conjugative Elements, Benjamin R. Joris Aug 2022

Towards More Complete Metagenomic Analyses Through Circularized Genomes And Conjugative Elements, Benjamin R. Joris

Electronic Thesis and Dissertation Repository

Advancements in sequencing technologies have revolutionized biological sciences and led to the emergence of a number of fields of research. One such field of research is metagenomics, which is the study of the genomic content of complex communities of bacteria. The goal of this thesis was to contribute computational methodology that can maximize the data generated in these studies and to apply these protocols human and environmental metagenomic samples.

Standard metagenomic analyses include a step for binning of assembled contigs, which has previously been shown to exclude mobile genetic elements, and I demonstrated that this phenomenon extends to all conjugative …


Modeling Electrostatics In Molecular Biology And Its Relevance With Molecular Mechanisms Of Diseases, Mahesh Koirala Aug 2022

Modeling Electrostatics In Molecular Biology And Its Relevance With Molecular Mechanisms Of Diseases, Mahesh Koirala

All Dissertations

Electrostatics plays an essential role in molecular biology. Modeling electrostatics in molecular biology is complicated due to the water phase, mobile ions, and irregularly shaped inhomogeneous biological macromolecules. This dissertation presents the popular DelPhi package that solves PBE and delivers the electrostatic potential distribution of biomolecules. We used the newly developed DelPhiForce steered Molecular Dynamics (DFMD) approach to model the binding of barstar to barnase and demonstrated that the first-principles method could also model the binding. This dissertation also reflects the use of existing computational approaches to model the effects of Single Amino Acid Variations (SAVs) to reveal molecular mechanisms …


In Silico Characterization Of Protein-Protein Interactions Mediated By Short Linear Motifs, Heidy Elkhaligy Jun 2022

In Silico Characterization Of Protein-Protein Interactions Mediated By Short Linear Motifs, Heidy Elkhaligy

FIU Electronic Theses and Dissertations

Short linear motifs (SLiMs), often found in intrinsically disordered regions (IDPs), can initiate protein-protein interactions in eukaryotes. Although pathogens tend to have less disorder than eukaryotes, their proteins alter host cellular function through molecular mimicry of SLiMs. The first objective was to study sequence-based structure properties of viral SLiMs in the ELM database and the conservation of selected viral motifs involved in the virus life cycle. The second objective was to compare the structural features for SliMs in pathogens and eukaryotes in the ELM database. Our analysis showed that many viral SliMs are not found in IDPs, particularly glycosylation motifs. …


An Investigation Of Epigenetic Mechanisms Driving The Biology Of Head And Neck Squamous Cell Carcinoma, Scot Carson Callahan May 2022

An Investigation Of Epigenetic Mechanisms Driving The Biology Of Head And Neck Squamous Cell Carcinoma, Scot Carson Callahan

Dissertations & Theses (Open Access)

Head and neck squamous cell carcinoma (HNSCC) is the 6th most common cancer worldwide and is associated with significant morbidity and mortality. To date, the majority of work in the field has focused on genomic alterations such as mutations and copy number alterations. However, the clinical success of targeted therapies that exploit known genomic alterations, such as EGFR mutations, has remained mixed. Over the past decade, the importance of epigenetic regulators has come to the forefront, with the realization that many of these genes are mutated in cancer. Despite this realization, the role of epigenetics in regulating tumorigenesis, progression and …


Unveiling Global Roles Of G-Quadruplexes And G4-22 In Human Genetics, Ruth Barros De Paula Aug 2021

Unveiling Global Roles Of G-Quadruplexes And G4-22 In Human Genetics, Ruth Barros De Paula

Dissertations & Theses (Open Access)

G-quadruplexes are non-B DNA structures formed by four or more runs of repeated guanines that confer unique features to living organism’s genomes. These sequences are enriched in regulatory regions, such as promoters and 5’ UTRs, and have distinct regulatory roles in both health and disease states. Even though previous studies showed the impact of G4 in gene expression, none of them summarized the location-specific effect of G4. Also, there is no broad understanding about the most common G4 repeat in the human genome, named here as G4-22, and how it links to the evolution of mammals and their biology. In …


Deciphering The Ck2-Dependent Phosphoproteome And Its Integration With Regulatory Ptm Networks, Teresa Nunez De Villavicencio Diaz Nov 2020

Deciphering The Ck2-Dependent Phosphoproteome And Its Integration With Regulatory Ptm Networks, Teresa Nunez De Villavicencio Diaz

Electronic Thesis and Dissertation Repository

Protein functions are regulated by the post-translational addition of covalent modifications on certain amino acids. Depending on their distance within the 3-dimensional structure, addition/removal of individual post translational modifications (PTMs) can be impacted by others. This PTM interplay constitutes an essential regulatory mechanism that interconnects the molecular networks in the cell. Protein CK2, a clinically relevant acidophilic Ser/Thr kinase, may be responsible for 10-20% of the human phosphoproteome. Such estimates agree with the number of known substrates, which continues to expand. Furthermore, the demonstration that CK2 participates in hierarchical phosphorylation and has similar sequence determinants to caspases suggest extensive PTM …


Microbial Ecology Of South Florida Surface Waters: Examining The Potential For Anthropogenic Influences, Chase P. Donnelly Aug 2018

Microbial Ecology Of South Florida Surface Waters: Examining The Potential For Anthropogenic Influences, Chase P. Donnelly

HCNSO Student Theses and Dissertations

South Florida contains one of the largest subtropical wetlands in the world, and yet not much is known about the microbes that live in these surface waters. These microbes play an important role in chemical cycling and maintaining good water quality for both human and ecosystem health. The hydrology of Florida’s surface waters is tightly regulated with the use of canal and levee systems run by the US Army Corps of Engineers and The South Florida Water Management District. These canals run through the Everglades, agriculture, and urban environments to control water levels in Lake Okeechobee, the Water Conservation Areas, …


Copy Number Variation In The Porcine Genome Detected From Whole-Genome Sequence, Rebecca Anderson Mar 2018

Copy Number Variation In The Porcine Genome Detected From Whole-Genome Sequence, Rebecca Anderson

Honors Theses

Copy number variations (CNVs) are large insertions, deletions, and duplications in the genome that vary between individuals in a species. These variations are known to impact a broad range of phenotypes from molecular-level traits to higher-order clinical phenotypes. CNVs have been linked to complex traits in humans such as autism, attention deficit hyperactivity disorder, nervous system disorders, and early-onset extreme obesity. In this study, whole-genome sequence was obtained from 72 founders of an intensely phenotyped experimental swine herd at the U.S. Meat Animal Research Center (USMARC) in Clay Center, Nebraska. This included 24 boars (12 Duroc and 12 Landrace) and …


Mrub_1325, Mrub_1326, Mrub_1327, And Mrub_1328 Are Orthologs Of B_3454, B_3455, B_3457, B_3458, Respectively Found In Escherichia Coli Coding For A Branched Chain Amino Acid Atp Binding Cassette (Abc) Transporter System, Bennett Tomlin, Adam Buric, Dr. Lori Scott Jan 2018

Mrub_1325, Mrub_1326, Mrub_1327, And Mrub_1328 Are Orthologs Of B_3454, B_3455, B_3457, B_3458, Respectively Found In Escherichia Coli Coding For A Branched Chain Amino Acid Atp Binding Cassette (Abc) Transporter System, Bennett Tomlin, Adam Buric, Dr. Lori Scott

Meiothermus ruber Genome Analysis Project

In this project we investigated the biological function of the genes Mrub_1325, Mrub_1326, Mrub_1327, and Mrub_1328 (KEGG map number 02010). We predict these genes encode components of a Branched Chain Amino Acid ATP Binding Cassette (ABC) transporter: 1) Mrub_1325 (DNA coordinates 1357399-1358130 on the reverse strand) encodes the ATP binding domain; 2) Mrub_1326 (DNA coordinates 1358127-1359899 on the reverse strand) encodes the ATP-binding domain and permease domain; 3) Mrub_1327 (DNA coordinates 1359899-1360930 on the reverse strand) encodes a permease domain; and 4)Mrub_1328 (DNA coordinates 1711022-1712185 on the reverse strand) encodes the substrate binding domain. This system is not predicted to …


Machine Learning Based Protein Sequence To (Un)Structure Mapping And Interaction Prediction, Sumaiya Iqbal Aug 2017

Machine Learning Based Protein Sequence To (Un)Structure Mapping And Interaction Prediction, Sumaiya Iqbal

University of New Orleans Theses and Dissertations

Proteins are the fundamental macromolecules within a cell that carry out most of the biological functions. The computational study of protein structure and its functions, using machine learning and data analytics, is elemental in advancing the life-science research due to the fast-growing biological data and the extensive complexities involved in their analyses towards discovering meaningful insights. Mapping of protein’s primary sequence is not only limited to its structure, we extend that to its disordered component known as Intrinsically Disordered Proteins or Regions in proteins (IDPs/IDRs), and hence the involved dynamics, which help us explain complex interaction within a cell that …


Annotation And Identification Of Several Glycerolipid Metabolic Related Ortholog Genes; Mrub_0437, Mrub_1813 And Mrub_2759 In The Organism Meithermus Ruber And Their Predicted Respective Orthologs B3926, B4042 And Bo514 Found In E.Coli., Abdul Rahman Abdul Kader, Dr. Lori R. Scott Jan 2017

Annotation And Identification Of Several Glycerolipid Metabolic Related Ortholog Genes; Mrub_0437, Mrub_1813 And Mrub_2759 In The Organism Meithermus Ruber And Their Predicted Respective Orthologs B3926, B4042 And Bo514 Found In E.Coli., Abdul Rahman Abdul Kader, Dr. Lori R. Scott

Meiothermus ruber Genome Analysis Project

We predict Mrub_0437 encodes the enzyme glycerol kinase (DNA coordinates [417621..419183), which is an intermediary step of the glycerolipid metabolic pathway (KEGG map00561), It catalyzes the conversion of glycerol to sn-Glycerol-3-phosphate. The E. coli K12 MG1655 ortholog is predicted to be b3926.

We predict Mrub_1813 encodes the enzyme diacylglycerol kinase (DNA coordinates [1864659..1865063), which is an intermediary step of the glycerolipid metabolic pathway (KEGG map00561), It catalyzes the conversion of 1,2-diacyl-sn-glycerol to 1,2-diacyl-sn-glycerol 3-phosphate. The E. coli K12 MG1655 ortholog is predicted to be b4042.

We predict Mrub_2759 encodes the enzyme glycerol kinase (DNA coordinates [2799712..2800665), which is an intermediary …


Punctuated Evolution Within A Eurythermic Genus (Mesenchytraeus) Of Segmented Worms: Genetic Modification Of The Glacier Ice Worm F1f0 Atp Synthase, Shirley A. Lang Dec 2016

Punctuated Evolution Within A Eurythermic Genus (Mesenchytraeus) Of Segmented Worms: Genetic Modification Of The Glacier Ice Worm F1f0 Atp Synthase, Shirley A. Lang

Graduate School of Biomedical Sciences Theses and Dissertations

Segmented worms (Annelida) are among the most successful animal inhabitants of extreme environments worldwide. An unusual group of Mesenchytraeus worms endemic to the Pacific Northwest of North America occupy geographically proximal ecozones ranging from low elevation temperate rainforests to high altitude glaciers. Along this altitudinal transect, Mesenchytraeus representatives from disparate habitat types were collected and subjected to deep mitochondrial and nuclear phylogenetic analyses. Evidence presented here employing modern bioinformatic analyses (i.e., maximum likelihood, Bayesian inference, multi-species coalescent) supports a Mesenchytraeus “explosion” in the upper Miocene (5-10 million years ago) that gave rise to ice, snow and terrestrial worms, derived from …


Ten Simple Rules For Taking Advantage Of Git And Github, Yasset Perez-Riverol, Laurent Gatto, Rui Wang, Timo Sachsenberg, Julian Uszkoreit, Felipe Da Veiga Leprevost, Christian Fufezan, Tobias Ternent, Stephen J. Eglen, Daniel S. Katz, Tom J. Pollard, Alexander Konovalov, Robert M. Flight, Kai Blin, Juan Antonio Vizcaíno Jul 2016

Ten Simple Rules For Taking Advantage Of Git And Github, Yasset Perez-Riverol, Laurent Gatto, Rui Wang, Timo Sachsenberg, Julian Uszkoreit, Felipe Da Veiga Leprevost, Christian Fufezan, Tobias Ternent, Stephen J. Eglen, Daniel S. Katz, Tom J. Pollard, Alexander Konovalov, Robert M. Flight, Kai Blin, Juan Antonio Vizcaíno

Molecular and Cellular Biochemistry Faculty Publications

No abstract provided.


Hpcnmf: A High-Performance Toolbox For Non-Negative Matrix Factorization, Karthik Devarajan, Guoli Wang Feb 2016

Hpcnmf: A High-Performance Toolbox For Non-Negative Matrix Factorization, Karthik Devarajan, Guoli Wang

COBRA Preprint Series

Non-negative matrix factorization (NMF) is a widely used machine learning algorithm for dimension reduction of large-scale data. It has found successful applications in a variety of fields such as computational biology, neuroscience, natural language processing, information retrieval, image processing and speech recognition. In bioinformatics, for example, it has been used to extract patterns and profiles from genomic and text-mining data as well as in protein sequence and structure analysis. While the scientific performance of NMF is very promising in dealing with high dimensional data sets and complex data structures, its computational cost is high and sometimes could be critical for …


A Pipeline For Creation Of Genome-Scale Metabolic Reconstructions, Shaun W. Norris Jan 2016

A Pipeline For Creation Of Genome-Scale Metabolic Reconstructions, Shaun W. Norris

Theses and Dissertations

The decreasing costs of next generation sequencing technologies and the increasing speeds at which they work have lead to an abundance of 'omic datasets. The need for tools and methods to analyze, annotate, and model these datasets to better understand biological systems is growing. Here we present a novel software pipeline to reconstruct the metabolic model of an organism in silico starting from its genome sequence and a novel compilation of biological databases to better serve the generation of metabolic models. We validate these methods using five Gardnerella vaginalis strains and compare the gene annotation results to NCBI and the …


An Exploration Of The Phylogenetic Placement Of Recently Discovered Ultrasmall Archaeal Lineages, Jeffrey M. O'Brien Aug 2015

An Exploration Of The Phylogenetic Placement Of Recently Discovered Ultrasmall Archaeal Lineages, Jeffrey M. O'Brien

Honors Scholar Theses

In recent years, several new clades within the domain Achaea have been discovered. This is due in part to microbiological sampling of novel environments, and the increasing ability to detect and sequence uncultivable organisms through metagenomic analysis. These organisms share certain features, such as small cell size and streamlined genomes. Reduction in genome size can present difficulties to phylogenetic reconstruction programs. Since there is less genetic data to work with, these organisms often have missing genes in concatenated multiple sequence alignments. Evolutionary Biologists have not reached a consensus on the placement of these lineages in the archaeal evolutionary tree. There …


Comparative Genomics Of Microbial Chemoreceptor Sequence, Structure, And Function, Aaron Daniel Fleetwood Dec 2014

Comparative Genomics Of Microbial Chemoreceptor Sequence, Structure, And Function, Aaron Daniel Fleetwood

Doctoral Dissertations

Microbial chemotaxis receptors (chemoreceptors) are complex proteins that sense the external environment and signal for flagella-mediated motility, serving as the GPS of the cell. In order to sense a myriad of physicochemical signals and adapt to diverse environmental niches, sensory regions of chemoreceptors are frenetically duplicated, mutated, or lost. Conversely, the chemoreceptor signaling region is a highly conserved protein domain. Extreme conservation of this domain is necessary because it determines very specific helical secondary, tertiary, and quaternary structures of the protein while simultaneously choreographing a network of interactions with the adaptor protein CheW and the histidine kinase CheA. This dichotomous …