Open Access. Powered by Scholars. Published by Universities.®

Computational Biology Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 18 of 18

Full-Text Articles in Computational Biology

Alterations Of The Gut Mycobiome In Patients With Ms - A Bioinformatic Approach, Saumya Shah May 2022

Alterations Of The Gut Mycobiome In Patients With Ms - A Bioinformatic Approach, Saumya Shah

Honors Scholar Theses

The mycobiome is the fungal component of the gut microbiome and is implicated in several autoimmune diseases. However, its role in multiple sclerosis (MS) has not been studied. We performed descriptive and formal statistical tests using the R language to characterize the gut mycobiome in people with MS (pwMS) and healthy controls. We found that the microbiome composition of multiple sclerosis patients is different from healthy people. The mycobiome had significantly higher alpha diversity and inter-subject variation in pwMS than controls. Additionally, Saccharomyces and Aspergillus were over-represented in pwMS. Different mycobiome profiles, defined as mycotypes, were associated with different bacterial …


Unveiling Global Roles Of G-Quadruplexes And G4-22 In Human Genetics, Ruth Barros De Paula Aug 2021

Unveiling Global Roles Of G-Quadruplexes And G4-22 In Human Genetics, Ruth Barros De Paula

Dissertations & Theses (Open Access)

G-quadruplexes are non-B DNA structures formed by four or more runs of repeated guanines that confer unique features to living organism’s genomes. These sequences are enriched in regulatory regions, such as promoters and 5’ UTRs, and have distinct regulatory roles in both health and disease states. Even though previous studies showed the impact of G4 in gene expression, none of them summarized the location-specific effect of G4. Also, there is no broad understanding about the most common G4 repeat in the human genome, named here as G4-22, and how it links to the evolution of mammals and their biology. In …


Composition And Homology In The Taxonomic Classification Of Escherichia Coli, Tanya Irani Jan 2021

Composition And Homology In The Taxonomic Classification Of Escherichia Coli, Tanya Irani

Theses and Dissertations (Comprehensive)

As new techniques have been introduced, specifically the possibility of complete genome sequencing, better methods of defining bacterial species have also been proposed. One of the most recently proposed methods, using bioinformatic techniques, is to calculate the average nucleotide identity (ANI) between the homologous genome segments of different isolates. Another method for species discrimination that has been tested successfully is the similarity of DNA compositional signatures. However, in a recent update, DNA signatures split the available Escherichia coli complete genomes into three groups. To check if this result was consistent with such genomes belonging to different species, we tested methods …


Polerovirus Genomic Variation And Mechanisms Of Silencing Suppression By P0 Protein, Natalie Holste Nov 2020

Polerovirus Genomic Variation And Mechanisms Of Silencing Suppression By P0 Protein, Natalie Holste

School of Biological Sciences: Dissertations, Theses, and Student Research

The family Luteoviridae consists of three genera: Luteovirus, Enamovirus, and Polerovirus. The genus Polerovirus contains 32 virus species. All are transmitted by aphids and can infect a wide variety of crops from cereals and wheat to cucurbits and peppers. However, little is known about how this wide range of hosts and vectors developed. In poleroviruses, aphid transmission and virion formation is mediated by the coat protein read-through domain (CPRT) while silencing suppression and phloem limitation is mediated by Protein 0 (P0)—a protein unique to poleroviruses. P0 gives poleroviruses a great advantage amongst plant viruses and diversifies polerovirus species, but the …


Mrub_3019 Casa Gene Is An Ortholog To E. Coli B2760, Kelsey Heiland, Dr. Lori Scott Feb 2019

Mrub_3019 Casa Gene Is An Ortholog To E. Coli B2760, Kelsey Heiland, Dr. Lori Scott

Meiothermus ruber Genome Analysis Project

This research is part of the Meiothermus ruber genome annotation project which aims to predict gene function with various bioinformatics tools. We investigated the function of Mrub_3019, which encodes the CasA protein involved in the multi-subunit effector complex for the CRISPR-Cas immunity system and predicted it to be an ortholog of E. coli K12 MG1655 b2760 (casA). We predicted that Mrub_3019 encodes the protein CasA, which is involved in PAM recognition of CRISPR interference pathway. Foreign DNA will bind to CasA, which signals Cas3 for helicase-mediated DNA degradation. Our hypothesis is supported by low E-values for pairwise alignment in NCBI …


Mrub_3015 Is Orthologous To The B2757 Gene Found In Escherichia Coli Coding For Casd, Ramona Collins, Dr. Lori Scott Feb 2019

Mrub_3015 Is Orthologous To The B2757 Gene Found In Escherichia Coli Coding For Casd, Ramona Collins, Dr. Lori Scott

Meiothermus ruber Genome Analysis Project

This project is part of the Meiothermus ruber genome analysis project, which uses a collection of online bioinformatics tools to predict gene function. We investigated the biological function of the gene Mrub_3015, which we hypothesize is a component of the CRISPR-Cas prokaryotic defense system. We predict that Mrub_3015 (DNA coordinates 3055550...3056245) encodes the the CRISPR-associated protein cas5, which is integral in maintaining the crRNA-DNA structure, keeping the complex from base pairing with the target phage DNA. Our hypothesis is supported by identical hits for Mrub_3015 and b2527 to the KEGG, Pfam, TIGRfam, CDD and PDB databases as well as a …


Mrub_3018 Is Orthologous To E. Coli B2759 (Casb), Kyle Parker, Dr. Lori Scott Feb 2019

Mrub_3018 Is Orthologous To E. Coli B2759 (Casb), Kyle Parker, Dr. Lori Scott

Meiothermus ruber Genome Analysis Project

This project is part of the Meiothermus ruber genome analysis project, which uses a collection of online bioinformatics tools to predict gene function. We studied the biological activity of the Mrub_3018 gene, which we hypothesize is orthologous to E. coli gene B2759. We predicted that Mrub_3018(DNA coordinates 3057916… 3058524) encodes the protein CasB. CasB is a protein in the CRISPR CASCADE that will function as a structural protein. When the rest of the proteins form an “S” formation CasB will connect the front and back of the “S” creating a back bone for the structure. It will help bind DNA …


Mrub_3014 Is Orthologous To B2756, Samir Abdelkarim, Dr. Lori Scott Jan 2019

Mrub_3014 Is Orthologous To B2756, Samir Abdelkarim, Dr. Lori Scott

Meiothermus ruber Genome Analysis Project

This project is part of the Meiothermus ruber genome analysis project, which uses a collection of online bioinformatics tools to predict gene function. We investigated the biological function of the gene Mrub_3014, which we hypothesize is a component of the CRISPR-Cas prokaryotic defense system. We predict that Mrub_3014 (DNA coordinates 3054943..3055575) encodes CRISPR-associated protein Cse3/case which function as an endonuclease. Our hypothesis is supported by identical hits for Mrub_3014 and b2756 to the KEGG, Pfam, TIGRfam, CDD and PDB databases, as well as a low E-value for a pairwise NCBI BLAST comparison. Both protein products are predicted to be localized …


M. Ruber Mrub_3013 Is Orthologous To E. Coli B2755, Laura Butcher, Dr. Lori Scott Jan 2019

M. Ruber Mrub_3013 Is Orthologous To E. Coli B2755, Laura Butcher, Dr. Lori Scott

Meiothermus ruber Genome Analysis Project

This project is part of the Meiothermus ruber genome analysis project, which uses a collection of online bioinformatics tools to predict gene function. We investigated the biological function of gene Mrub_3013, which we hypothesize is orthologous to b2755 in E. coli K12 MG1655 (a.k.a. Cas1). We investigated the biological function of a gene with the M. ruber locus tag of Mrub_3013, which we hypothesize is a component of the CRISPR-Cas prokaryotic defense system in M. ruber. We predict that Mrub_3013 (DNA coordinates 3,053,978-3,054,940) encodes the protein Cas1 which as part of the CRISPR-Cas system, selects and cuts the foreign …


Mrub_3020, A Paralog Of Mrub_1489, Is Orthologous To E. Coli Casc (Locus Tag B2761), Alfred Dei-Ampeh, Dr. Lori Scott Jan 2019

Mrub_3020, A Paralog Of Mrub_1489, Is Orthologous To E. Coli Casc (Locus Tag B2761), Alfred Dei-Ampeh, Dr. Lori Scott

Meiothermus ruber Genome Analysis Project

This project is part of the Meiothermus ruber genome analysis project, which uses a collection of online bioinformatics tools to predict gene function. We investigated the biological functions of two genes: mrub_3020 and mrub_1489. We make two hypotheses in this investigation: a) mrub_3020 is orthologous to the gene b2761 in E. coli K12 MG1655 (a.k.a. casC); b) mrub_1489 is a paralog of mrub_3020. We also predict that the two genes encode unique proteins: mrub_3020 with DNA coordinates 3060491…3063190 encodes a CRISPR – associated helicase (Cas3) that supports the Cascade complex of the CRISPR – Cas adaptive immune system …


Microbial Ecology Of South Florida Surface Waters: Examining The Potential For Anthropogenic Influences, Chase P. Donnelly Aug 2018

Microbial Ecology Of South Florida Surface Waters: Examining The Potential For Anthropogenic Influences, Chase P. Donnelly

HCNSO Student Theses and Dissertations

South Florida contains one of the largest subtropical wetlands in the world, and yet not much is known about the microbes that live in these surface waters. These microbes play an important role in chemical cycling and maintaining good water quality for both human and ecosystem health. The hydrology of Florida’s surface waters is tightly regulated with the use of canal and levee systems run by the US Army Corps of Engineers and The South Florida Water Management District. These canals run through the Everglades, agriculture, and urban environments to control water levels in Lake Okeechobee, the Water Conservation Areas, …


Mrub_1325, Mrub_1326, Mrub_1327, And Mrub_1328 Are Orthologs Of B_3454, B_3455, B_3457, B_3458, Respectively Found In Escherichia Coli Coding For A Branched Chain Amino Acid Atp Binding Cassette (Abc) Transporter System, Bennett Tomlin, Adam Buric, Dr. Lori Scott Jan 2018

Mrub_1325, Mrub_1326, Mrub_1327, And Mrub_1328 Are Orthologs Of B_3454, B_3455, B_3457, B_3458, Respectively Found In Escherichia Coli Coding For A Branched Chain Amino Acid Atp Binding Cassette (Abc) Transporter System, Bennett Tomlin, Adam Buric, Dr. Lori Scott

Meiothermus ruber Genome Analysis Project

In this project we investigated the biological function of the genes Mrub_1325, Mrub_1326, Mrub_1327, and Mrub_1328 (KEGG map number 02010). We predict these genes encode components of a Branched Chain Amino Acid ATP Binding Cassette (ABC) transporter: 1) Mrub_1325 (DNA coordinates 1357399-1358130 on the reverse strand) encodes the ATP binding domain; 2) Mrub_1326 (DNA coordinates 1358127-1359899 on the reverse strand) encodes the ATP-binding domain and permease domain; 3) Mrub_1327 (DNA coordinates 1359899-1360930 on the reverse strand) encodes a permease domain; and 4)Mrub_1328 (DNA coordinates 1711022-1712185 on the reverse strand) encodes the substrate binding domain. This system is not predicted to …


Annotation And Identification Of Several Glycerolipid Metabolic Related Ortholog Genes; Mrub_0437, Mrub_1813 And Mrub_2759 In The Organism Meithermus Ruber And Their Predicted Respective Orthologs B3926, B4042 And Bo514 Found In E.Coli., Abdul Rahman Abdul Kader, Dr. Lori R. Scott Jan 2017

Annotation And Identification Of Several Glycerolipid Metabolic Related Ortholog Genes; Mrub_0437, Mrub_1813 And Mrub_2759 In The Organism Meithermus Ruber And Their Predicted Respective Orthologs B3926, B4042 And Bo514 Found In E.Coli., Abdul Rahman Abdul Kader, Dr. Lori R. Scott

Meiothermus ruber Genome Analysis Project

We predict Mrub_0437 encodes the enzyme glycerol kinase (DNA coordinates [417621..419183), which is an intermediary step of the glycerolipid metabolic pathway (KEGG map00561), It catalyzes the conversion of glycerol to sn-Glycerol-3-phosphate. The E. coli K12 MG1655 ortholog is predicted to be b3926.

We predict Mrub_1813 encodes the enzyme diacylglycerol kinase (DNA coordinates [1864659..1865063), which is an intermediary step of the glycerolipid metabolic pathway (KEGG map00561), It catalyzes the conversion of 1,2-diacyl-sn-glycerol to 1,2-diacyl-sn-glycerol 3-phosphate. The E. coli K12 MG1655 ortholog is predicted to be b4042.

We predict Mrub_2759 encodes the enzyme glycerol kinase (DNA coordinates [2799712..2800665), which is an intermediary …


Genomic Characterization Of Polyps In Familial Adenomatous Polyposis Patients And Identification Of Candidate Chemopreventive Drugs, Francis A. San Lucas Aug 2014

Genomic Characterization Of Polyps In Familial Adenomatous Polyposis Patients And Identification Of Candidate Chemopreventive Drugs, Francis A. San Lucas

Dissertations & Theses (Open Access)

Familial adenomatous polyposis (FAP) is an autosomal dominant disease characterized by APC germline mutations and the development of hundreds to thousands of premalignant adenomas in the gastrointestinal tract at a young age. If left untreated, these patients inevitably develop colon cancer (CRC) and small bowel tumors. We performed exome sequencing of samples from 12 FAP patients to characterize adenomas and to identify candidate genes of adenoma development that may serve as potential targets for chemoprevention drug development. From each patient, a blood and at least one polyp were sequenced with a total of 25 polyps analyzed. In some cases, normal …


Transcriptome Analysis Of Sea Lamprey Embryogenesis, Zakary Ilya Yermolenko May 2014

Transcriptome Analysis Of Sea Lamprey Embryogenesis, Zakary Ilya Yermolenko

Seton Hall University Dissertations and Theses (ETDs)

The sea lamprey (Petromyzon marinus) has survived throughout evolution for hundreds of millions of years. It is considered an invasive species to the Great Lakes that has caused dramatic changes in the ecosystem for fish communities resulting in the collapse of a fishing industry that was previously valued at billions of dollars. Successful management of the sea lamprey is essential to a sustainable fishing industry and biodiversity. Therefore sea lamprey embryos were studied at various stages of development by growing them in a simulated habitat. RNAs from adult female ovaries and embryos at different time points during embryogenesis …


Small Rna Expression During Programmed Rearragement Of A Vertebrate Genome, Joseph R. Herdy Iii Jan 2014

Small Rna Expression During Programmed Rearragement Of A Vertebrate Genome, Joseph R. Herdy Iii

Theses and Dissertations--Biology

The sea lamprey (Petromyzon marinus) undergoes programmed genome rearrangements (PGRs) during embryogenesis that results in the deletion of ~0.5 Gb of germline DNA from the somatic lineage. The underlying mechanism of these rearrangements remains largely unknown. miRNAs (microRNAs) and piRNAs (PIWI interacting RNAs) are two classes of small noncoding RNAs that play important roles in early vertebrate development, including differentiation of cell lineages, modulation of signaling pathways, and clearing of maternal transcripts. Here, I utilized next generation sequencing to determine the temporal expression of miRNAs, piRNAs, and other small noncoding RNAs during the first five days of lamprey …


Evolving Hard Problems: Generating Human Genetics Datasets With A Complex Etiology, Daniel S Himmelstein, Casey S Greene, Jason H Moore Jul 2011

Evolving Hard Problems: Generating Human Genetics Datasets With A Complex Etiology, Daniel S Himmelstein, Casey S Greene, Jason H Moore

Dartmouth Scholarship

BackgroundA goal of human genetics is to discover genetic factors that influence individuals' susceptibility to common diseases. Most common diseases are thought to result from the joint failure of two or more interacting components instead of single component failures. This greatly complicates both the task of selecting informative genetic variants and the task of modeling interactions between them. We and others have previously developed algorithms to detect and model the relationships between these genetic factors and disease. Previously these methods have been evaluated with datasets simulated according to pre-defined genetic models.


Multifactor Dimensionality Reduction Analysis Identifies Specific Nucleotide Patterns Promoting Genetic Polymorphisms, Eric Arehart, Scott Gleim, Bill White, John Hwa, Jason H. Moore Mar 2009

Multifactor Dimensionality Reduction Analysis Identifies Specific Nucleotide Patterns Promoting Genetic Polymorphisms, Eric Arehart, Scott Gleim, Bill White, John Hwa, Jason H. Moore

Dartmouth Scholarship

The fidelity of DNA replication serves as the nidus for both genetic evolution and genomic instability fostering disease. Single nucleotide polymorphisms (SNPs) constitute greater than 80% of the genetic variation between individuals. A new theory regarding DNA replication fidelity has emerged in which selectivity is governed by base-pair geometry through interactions between the selected nucleotide, the complementary strand, and the polymerase active site. We hypothesize that specific nucleotide combinations in the flanking regions of SNP fragments are associated with mutation.