Open Access. Powered by Scholars. Published by Universities.®

Computational Biology Commons

Open Access. Powered by Scholars. Published by Universities.®

Bioinformatics

Genomics

Institution
Publication Year
Publication
Publication Type

Articles 1 - 30 of 38

Full-Text Articles in Computational Biology

Convolutional Neural Network-Based Gene Prediction Using Buffalograss As A Model System, Michael Morikone Nov 2023

Convolutional Neural Network-Based Gene Prediction Using Buffalograss As A Model System, Michael Morikone

Complex Biosystems PhD Program: Dissertations

The task of gene prediction has been largely stagnant in algorithmic improvements compared to when algorithms were first developed for predicting genes thirty years ago. Rather than iteratively improving the underlying algorithms in gene prediction tools by utilizing better performing models, most current approaches update existing tools through incorporating increasing amounts of extrinsic data to improve gene prediction performance. The traditional method of predicting genes is done using Hidden Markov Models (HMMs). These HMMs are constrained by having strict assumptions made about the independence of genes that do not always hold true. To address this, a Convolutional Neural Network (CNN) …


Integrating Omim And Intact Data For The Analysis Of Gene-Phenotype Interactions In Complex Diseases: A Linux-Based Computational Tool For Network Analysis, Devin Keane May 2023

Integrating Omim And Intact Data For The Analysis Of Gene-Phenotype Interactions In Complex Diseases: A Linux-Based Computational Tool For Network Analysis, Devin Keane

All Theses

The field of genetics is constantly evolving. New advances in bioinformatics and computational approaches are leading to exciting new developments in our ability to treat and prevent diseases. Computational genetics provides valuable insights into the complex mechanisms and layers of biological communication that shape an organism's phenotype. Understanding these mechanisms is critical to advancing human health.

The study of diseases in genetics requires a comprehensive understanding of the interactions between various biological processes, including gene expression, protein synthesis, RNA, metabolism, and cell-cell communication. To effectively address the root causes of such diseases, multi-disciplinary approaches that integrate information from different levels …


The Genomics Of Autism-Related Genes Il1rapl1 And Il1rapl2: Insights Into Their Cortical Distribution, Cell-Type Specificity, And Developmental Trajectories, Jacob Weaver Apr 2023

The Genomics Of Autism-Related Genes Il1rapl1 And Il1rapl2: Insights Into Their Cortical Distribution, Cell-Type Specificity, And Developmental Trajectories, Jacob Weaver

MUSC Theses and Dissertations

Neuropsychiatric disorders have a significant impact on modern society. These disorders affect a large percentage of the population: schizophrenia has a world-wide prevalence of 1% and autism spectrum disorders (ASD) affects 1 in 59 school-aged children in the US. There is substantial evidence that most neuropsychiatric disorders have a genetic component. Thus, with the advent of high throughput sequencing much effort has gone into identifying genetic variants associated with these disorders. The emerging picture from these studies is a complex one where hundreds of genes with small effects interact with a varied landscape of common variants to result in disease. …


Towards More Complete Metagenomic Analyses Through Circularized Genomes And Conjugative Elements, Benjamin R. Joris Aug 2022

Towards More Complete Metagenomic Analyses Through Circularized Genomes And Conjugative Elements, Benjamin R. Joris

Electronic Thesis and Dissertation Repository

Advancements in sequencing technologies have revolutionized biological sciences and led to the emergence of a number of fields of research. One such field of research is metagenomics, which is the study of the genomic content of complex communities of bacteria. The goal of this thesis was to contribute computational methodology that can maximize the data generated in these studies and to apply these protocols human and environmental metagenomic samples.

Standard metagenomic analyses include a step for binning of assembled contigs, which has previously been shown to exclude mobile genetic elements, and I demonstrated that this phenomenon extends to all conjugative …


Methods And Tools To Improve Performance Of Plant Genome Analysis, Drew Ferrell Aug 2022

Methods And Tools To Improve Performance Of Plant Genome Analysis, Drew Ferrell

Theses and Dissertations

Multi -omics data analysis and integration facilitates hypothesis building toward an understanding of genes and pathway responses driven by environments. Methods designed to estimate and analyze gene expression, with regard to treatments or conditions, can be leveraged to understand gene-level responses in the cell. However, genes often interact and signal within larger structures such as pathways and networks. Complex studies guided toward describing dynamic genetic pathways and networks require algorithms or methods designed for inference based on gene interactions and related topologies. Classes of algorithms and methods may be integrated into generalized workflows for comparative genomics studies, as multi -omics …


Characterizing Endogenous Dicer Products To Unravel Novel Rnai Biogenesis Pathways, Jacob Oche Peter Jun 2022

Characterizing Endogenous Dicer Products To Unravel Novel Rnai Biogenesis Pathways, Jacob Oche Peter

Dissertations

ABSTRACT

RNA interference (RNAi) is a pervasive gene regulatory mechanism in eukaryotes based on the action of multiple classes of small RNA (sRNA). Exploiting RNAi pathways in non-model systems have great potential for creating potent RNAi technologies. Here, we accessed RNAi-mediated control of gene expression in the two-spotted spider mite, Tetranychus urticae (T. urticae) using engineered dsRNA designed to modulate the host RNAi pathway and increase RNAi efficacy. Analysis of Dicer (Dcr) generated fragments revealed how exogenous RNAs access the host RNAi pathway in this animal, opening avenues for designing RNAi technology for their control. Further, some organisms …


An Investigation Of Epigenetic Mechanisms Driving The Biology Of Head And Neck Squamous Cell Carcinoma, Scot Carson Callahan May 2022

An Investigation Of Epigenetic Mechanisms Driving The Biology Of Head And Neck Squamous Cell Carcinoma, Scot Carson Callahan

Dissertations & Theses (Open Access)

Head and neck squamous cell carcinoma (HNSCC) is the 6th most common cancer worldwide and is associated with significant morbidity and mortality. To date, the majority of work in the field has focused on genomic alterations such as mutations and copy number alterations. However, the clinical success of targeted therapies that exploit known genomic alterations, such as EGFR mutations, has remained mixed. Over the past decade, the importance of epigenetic regulators has come to the forefront, with the realization that many of these genes are mutated in cancer. Despite this realization, the role of epigenetics in regulating tumorigenesis, progression and …


Comparative Genomics Methods And Applications, Emily N. Alden Jul 2021

Comparative Genomics Methods And Applications, Emily N. Alden

Biomedical Sciences ETDs

Virtually all fields of biology have benefited from the advancements in comparative genomics technologies, specifically in the study of evolution. In this dissertation I develop and use comparative genomic technologies to investigate the novel SARS-CoV-2 virus, assembly the first genome of the black lace domestic angelfish and identify germline genetic variants associated with altered breast cancer-specific survival. Our genome tiling array for the novel coronavirus presents a rapid and cost-effective method to sequence the entire viral genome and can be used to track the rapid evolution of viral variants in the population. The domestic angelfish is a member of the …


Analysis Of Subtelomeric Rextal Assemblies Using Quast, Tunazzina Islam, Desh Ranjan, Mohammad Zubair, Eleanor Young, Ming Xiao, Harold Riethman Jan 2021

Analysis Of Subtelomeric Rextal Assemblies Using Quast, Tunazzina Islam, Desh Ranjan, Mohammad Zubair, Eleanor Young, Ming Xiao, Harold Riethman

Computer Science Faculty Publications

Genomic regions of high segmental duplication content and/or structural variation have led to gaps and misassemblies in the human reference sequence, and are refractory to assembly from whole-genome short-read datasets. Human subtelomere regions are highly enriched in both segmental duplication content and structural variations, and as a consequence are both impossible to assemble accurately and highly variable from individual to individual. Recently, we developed a pipeline for improved region-specific assembly called Regional Extension of Assemblies Using Linked-Reads (REXTAL). In this study, we evaluate REXTAL and genome-wide assembly (Supernova) approaches on 10X Genomics linked-reads data sets partitioned and barcoded using the …


Composition And Homology In The Taxonomic Classification Of Escherichia Coli, Tanya Irani Jan 2021

Composition And Homology In The Taxonomic Classification Of Escherichia Coli, Tanya Irani

Theses and Dissertations (Comprehensive)

As new techniques have been introduced, specifically the possibility of complete genome sequencing, better methods of defining bacterial species have also been proposed. One of the most recently proposed methods, using bioinformatic techniques, is to calculate the average nucleotide identity (ANI) between the homologous genome segments of different isolates. Another method for species discrimination that has been tested successfully is the similarity of DNA compositional signatures. However, in a recent update, DNA signatures split the available Escherichia coli complete genomes into three groups. To check if this result was consistent with such genomes belonging to different species, we tested methods …


Investigation Of Proliferation Suppressors In Genetic Fitness Screens, Walter Frank Lenoir Iv Dec 2020

Investigation Of Proliferation Suppressors In Genetic Fitness Screens, Walter Frank Lenoir Iv

Dissertations & Theses (Open Access)

Innovation of CRISPR gene-editing technology has provided scientists genome manipulation tools that allowed rapid advancement of scientific capabilities and thus improved our ability to systematically study mammalian genetic functional profiles. Genome-wide CRISPR knockout screens conducted in collections of human cell lines can knock out genes at multiple loci, and have provided new insights into functional roles for independent genes. This method has launched massive efforts in looking across genetic backgrounds for context specific genetic vulnerabilities within cancer. Much of the research effort thus far has been spent on optimizing phenotype distinctions between essential, genes required for cell fitness, and non-essential, …


Decoding The Evolutionary Response To Prostate Cancer Therapy Using Plasma Genome Sequencing, Naveen Ramesh Dec 2020

Decoding The Evolutionary Response To Prostate Cancer Therapy Using Plasma Genome Sequencing, Naveen Ramesh

Dissertations & Theses (Open Access)

Investigating genome evolution in response to therapy is difficult in human tissue samples due to the difficulty in accessing metastatic tumor sites and logistical challenges of collecting longitudinal samples. To overcome these issues, we developed an unbiased whole-genome plasma DNA sequencing approach called PEGASUS that concurrently measures genomic copy number and exome mutations from archival cryostored plasma samples. This approach was applied to study longitudinal blood plasma samples from prostate cancer patients. A molecular characterization of archival plasma DNA from 233 patients and genomic profiling of 101 patients identified clinical correlations of aneuploid plasma DNA profiles with poor survival, increased …


Machine Learning With Digital Signal Processing For Rapid And Accurate Alignment-Free Genome Analysis: From Methodological Design To A Covid-19 Case Study, Gurjit Singh Randhawa Jun 2020

Machine Learning With Digital Signal Processing For Rapid And Accurate Alignment-Free Genome Analysis: From Methodological Design To A Covid-19 Case Study, Gurjit Singh Randhawa

Electronic Thesis and Dissertation Repository

In the field of bioinformatics, taxonomic classification is the scientific practice of identifying, naming, and grouping of organisms based on their similarities and differences. The problem of taxonomic classification is of immense importance considering that nearly 86% of existing species on Earth and 91% of marine species remain unclassified. Due to the magnitude of the datasets, the need exists for an approach and software tool that is scalable enough to handle large datasets and can be used for rapid sequence comparison and analysis. We propose ML-DSP, a stand-alone alignment-free software tool that uses Machine Learning and Digital Signal Processing to …


Simplicity Diffexpress: A Bespoke Cloud-Based Interface For Rna-Seq Differential Expression Modeling And Analysis, Cintia C. Palu, Marcelo Ribeiro-Alves, Yanxin Wu, Brendan Lawlor, Pavel V. Baranov, Brian Kelly, Paul Walsh May 2019

Simplicity Diffexpress: A Bespoke Cloud-Based Interface For Rna-Seq Differential Expression Modeling And Analysis, Cintia C. Palu, Marcelo Ribeiro-Alves, Yanxin Wu, Brendan Lawlor, Pavel V. Baranov, Brian Kelly, Paul Walsh

Department of Computer Science Publications

One of the key challenges for transcriptomics-based research is not only the processing of large data but also modeling the complexity of features that are sources of variation across samples, which is required for an accurate statistical analysis. Therefore, our goal is to foster access for wet lab researchers to bioinformatics tools, in order to enhance their ability to explore biological aspects and validate hypotheses with robust analysis. In this context, user-friendly interfaces can enable researchers to apply computational biology methods without requiring bioinformatics expertise. Such bespoke platforms can improve the quality of the findings by allowing the researcher to …


Mrub_3019 Casa Gene Is An Ortholog To E. Coli B2760, Kelsey Heiland, Dr. Lori Scott Feb 2019

Mrub_3019 Casa Gene Is An Ortholog To E. Coli B2760, Kelsey Heiland, Dr. Lori Scott

Meiothermus ruber Genome Analysis Project

This research is part of the Meiothermus ruber genome annotation project which aims to predict gene function with various bioinformatics tools. We investigated the function of Mrub_3019, which encodes the CasA protein involved in the multi-subunit effector complex for the CRISPR-Cas immunity system and predicted it to be an ortholog of E. coli K12 MG1655 b2760 (casA). We predicted that Mrub_3019 encodes the protein CasA, which is involved in PAM recognition of CRISPR interference pathway. Foreign DNA will bind to CasA, which signals Cas3 for helicase-mediated DNA degradation. Our hypothesis is supported by low E-values for pairwise alignment in NCBI …


Mrub_3015 Is Orthologous To The B2757 Gene Found In Escherichia Coli Coding For Casd, Ramona Collins, Dr. Lori Scott Feb 2019

Mrub_3015 Is Orthologous To The B2757 Gene Found In Escherichia Coli Coding For Casd, Ramona Collins, Dr. Lori Scott

Meiothermus ruber Genome Analysis Project

This project is part of the Meiothermus ruber genome analysis project, which uses a collection of online bioinformatics tools to predict gene function. We investigated the biological function of the gene Mrub_3015, which we hypothesize is a component of the CRISPR-Cas prokaryotic defense system. We predict that Mrub_3015 (DNA coordinates 3055550...3056245) encodes the the CRISPR-associated protein cas5, which is integral in maintaining the crRNA-DNA structure, keeping the complex from base pairing with the target phage DNA. Our hypothesis is supported by identical hits for Mrub_3015 and b2527 to the KEGG, Pfam, TIGRfam, CDD and PDB databases as well as a …


Mrub_3018 Is Orthologous To E. Coli B2759 (Casb), Kyle Parker, Dr. Lori Scott Feb 2019

Mrub_3018 Is Orthologous To E. Coli B2759 (Casb), Kyle Parker, Dr. Lori Scott

Meiothermus ruber Genome Analysis Project

This project is part of the Meiothermus ruber genome analysis project, which uses a collection of online bioinformatics tools to predict gene function. We studied the biological activity of the Mrub_3018 gene, which we hypothesize is orthologous to E. coli gene B2759. We predicted that Mrub_3018(DNA coordinates 3057916… 3058524) encodes the protein CasB. CasB is a protein in the CRISPR CASCADE that will function as a structural protein. When the rest of the proteins form an “S” formation CasB will connect the front and back of the “S” creating a back bone for the structure. It will help bind DNA …


Mrub_3014 Is Orthologous To B2756, Samir Abdelkarim, Dr. Lori Scott Jan 2019

Mrub_3014 Is Orthologous To B2756, Samir Abdelkarim, Dr. Lori Scott

Meiothermus ruber Genome Analysis Project

This project is part of the Meiothermus ruber genome analysis project, which uses a collection of online bioinformatics tools to predict gene function. We investigated the biological function of the gene Mrub_3014, which we hypothesize is a component of the CRISPR-Cas prokaryotic defense system. We predict that Mrub_3014 (DNA coordinates 3054943..3055575) encodes CRISPR-associated protein Cse3/case which function as an endonuclease. Our hypothesis is supported by identical hits for Mrub_3014 and b2756 to the KEGG, Pfam, TIGRfam, CDD and PDB databases, as well as a low E-value for a pairwise NCBI BLAST comparison. Both protein products are predicted to be localized …


M. Ruber Mrub_3013 Is Orthologous To E. Coli B2755, Laura Butcher, Dr. Lori Scott Jan 2019

M. Ruber Mrub_3013 Is Orthologous To E. Coli B2755, Laura Butcher, Dr. Lori Scott

Meiothermus ruber Genome Analysis Project

This project is part of the Meiothermus ruber genome analysis project, which uses a collection of online bioinformatics tools to predict gene function. We investigated the biological function of gene Mrub_3013, which we hypothesize is orthologous to b2755 in E. coli K12 MG1655 (a.k.a. Cas1). We investigated the biological function of a gene with the M. ruber locus tag of Mrub_3013, which we hypothesize is a component of the CRISPR-Cas prokaryotic defense system in M. ruber. We predict that Mrub_3013 (DNA coordinates 3,053,978-3,054,940) encodes the protein Cas1 which as part of the CRISPR-Cas system, selects and cuts the foreign …


Mrub_3020, A Paralog Of Mrub_1489, Is Orthologous To E. Coli Casc (Locus Tag B2761), Alfred Dei-Ampeh, Dr. Lori Scott Jan 2019

Mrub_3020, A Paralog Of Mrub_1489, Is Orthologous To E. Coli Casc (Locus Tag B2761), Alfred Dei-Ampeh, Dr. Lori Scott

Meiothermus ruber Genome Analysis Project

This project is part of the Meiothermus ruber genome analysis project, which uses a collection of online bioinformatics tools to predict gene function. We investigated the biological functions of two genes: mrub_3020 and mrub_1489. We make two hypotheses in this investigation: a) mrub_3020 is orthologous to the gene b2761 in E. coli K12 MG1655 (a.k.a. casC); b) mrub_1489 is a paralog of mrub_3020. We also predict that the two genes encode unique proteins: mrub_3020 with DNA coordinates 3060491…3063190 encodes a CRISPR – associated helicase (Cas3) that supports the Cascade complex of the CRISPR – Cas adaptive immune system …


Copy Number Variation In The Porcine Genome Detected From Whole-Genome Sequence, Rebecca Anderson Mar 2018

Copy Number Variation In The Porcine Genome Detected From Whole-Genome Sequence, Rebecca Anderson

Honors Theses

Copy number variations (CNVs) are large insertions, deletions, and duplications in the genome that vary between individuals in a species. These variations are known to impact a broad range of phenotypes from molecular-level traits to higher-order clinical phenotypes. CNVs have been linked to complex traits in humans such as autism, attention deficit hyperactivity disorder, nervous system disorders, and early-onset extreme obesity. In this study, whole-genome sequence was obtained from 72 founders of an intensely phenotyped experimental swine herd at the U.S. Meat Animal Research Center (USMARC) in Clay Center, Nebraska. This included 24 boars (12 Duroc and 12 Landrace) and …


Efficient Alignment Algorithms For Dna Sequencing Data, Nilesh Vinod Khiste Jan 2018

Efficient Alignment Algorithms For Dna Sequencing Data, Nilesh Vinod Khiste

Electronic Thesis and Dissertation Repository

The DNA Next Generation Sequencing (NGS) technologies produce data at a low cost, enabling their application to many ambitious fields such as cancer research, disease control, personalized medicine etc. However, even after a decade of research, the modern aligners and assemblers are far from providing efficient and error free genome alignments and assemblies respectively. This is due to the inherent nature of the genome alignment and assembly problem, which involves many complexities. Many algorithms to address this problem have been proposed over the years, but there still is a huge scope for improvement in this research space.

Many new genome …


Mrub_1325, Mrub_1326, Mrub_1327, And Mrub_1328 Are Orthologs Of B_3454, B_3455, B_3457, B_3458, Respectively Found In Escherichia Coli Coding For A Branched Chain Amino Acid Atp Binding Cassette (Abc) Transporter System, Bennett Tomlin, Adam Buric, Dr. Lori Scott Jan 2018

Mrub_1325, Mrub_1326, Mrub_1327, And Mrub_1328 Are Orthologs Of B_3454, B_3455, B_3457, B_3458, Respectively Found In Escherichia Coli Coding For A Branched Chain Amino Acid Atp Binding Cassette (Abc) Transporter System, Bennett Tomlin, Adam Buric, Dr. Lori Scott

Meiothermus ruber Genome Analysis Project

In this project we investigated the biological function of the genes Mrub_1325, Mrub_1326, Mrub_1327, and Mrub_1328 (KEGG map number 02010). We predict these genes encode components of a Branched Chain Amino Acid ATP Binding Cassette (ABC) transporter: 1) Mrub_1325 (DNA coordinates 1357399-1358130 on the reverse strand) encodes the ATP binding domain; 2) Mrub_1326 (DNA coordinates 1358127-1359899 on the reverse strand) encodes the ATP-binding domain and permease domain; 3) Mrub_1327 (DNA coordinates 1359899-1360930 on the reverse strand) encodes a permease domain; and 4)Mrub_1328 (DNA coordinates 1711022-1712185 on the reverse strand) encodes the substrate binding domain. This system is not predicted to …


A Polyglot Approach To Bioinformatics Data Integration: Phylogenetic Analysis Of Hiv-1, Steven Reisman, Catherine Putonti, George K. Thiruvathukal, Konstantin Läufer Oct 2017

A Polyglot Approach To Bioinformatics Data Integration: Phylogenetic Analysis Of Hiv-1, Steven Reisman, Catherine Putonti, George K. Thiruvathukal, Konstantin Läufer

Konstantin Läufer

RNA-interference has potential therapeutic use against HIV-1 by targeting highly-functional mRNA sequences that contribute to the virulence of the virus. Empirical work has shown that within cell lines, all of the HIV-1 genes are affected by RNAi-induced gene silencing. While promising, inherent in this treatment is the fact that RNAi sequences must be highly specific. HIV, however, mutates rapidly, leading to the evolution of viral escape mutants. In fact, such strains are under strong selection to include mutations within the targeted region, evading the RNAi therapy and thus increasing the virus’ fitness in the host. Taking a phylogenetic approach, we …


A Polyglot Approach To Bioinformatics Data Integration: Phylogenetic Analysis Of Hiv-1, Steven Reisman, Catherine Putonti, George K. Thiruvathukal, Konstantin Läufer Sep 2017

A Polyglot Approach To Bioinformatics Data Integration: Phylogenetic Analysis Of Hiv-1, Steven Reisman, Catherine Putonti, George K. Thiruvathukal, Konstantin Läufer

Catherine Putonti

RNA-interference has potential therapeutic use against HIV-1 by targeting highly-functional mRNA sequences that contribute to the virulence of the virus. Empirical work has shown that within cell lines, all of the HIV-1 genes are affected by RNAi-induced gene silencing. While promising, inherent in this treatment is the fact that RNAi sequences must be highly specific. HIV, however, mutates rapidly, leading to the evolution of viral escape mutants. In fact, such strains are under strong selection to include mutations within the targeted region, evading the RNAi therapy and thus increasing the virus’ fitness in the host. Taking a phylogenetic approach, we …


Genomics Rna-Seq Analysis Part 2_ Kallisto Indexing And Quantification (Updated 11/17), Ray A. Enke, Melika Rahmani-Mofrad Dec 2016

Genomics Rna-Seq Analysis Part 2_ Kallisto Indexing And Quantification (Updated 11/17), Ray A. Enke, Melika Rahmani-Mofrad

Ray Enke Ph.D.

This in class exercise is a hands on activity designed to teach students about how to run Kallisto indexing quantification using CyVerse DE apps as part of a eukaryotic RNA-seq analysis pipeline.


Genomics Rna-Seq Analysis Part 3-Sleuth Data Visualization (Updated 11/17), Ray A. Enke, Scott Schumacker Dec 2016

Genomics Rna-Seq Analysis Part 3-Sleuth Data Visualization (Updated 11/17), Ray A. Enke, Scott Schumacker

Ray Enke Ph.D.

This in class exercise is a hands on activity designed to teach students about how to run Sleuth statistical modeling and RStudio data visualization package using Kallisto pseudoalignment output files as part of a eukaryotic RNA-seq analysis pipeline.


Development Of An In Silico Kir Genotyping Algorithm And Its Application To Population And Cancer Immunogenetic Analyses, Howard Rosoff Aug 2016

Development Of An In Silico Kir Genotyping Algorithm And Its Application To Population And Cancer Immunogenetic Analyses, Howard Rosoff

Dissertations & Theses (Open Access)

Gene content determination and variant calling in the complex KIR genomic region are useful for immune system function analysis, pathogenesis and disease risk factor elucidation, immunotherapy development, evolutionary investigations, and human migration modeling. Sequence-specific oligonucleotide and sequence-specific primer PCR methods are the de facto standards for KIR presence/absence identification, but the current platforms are unsuitable for SNP calling, impractical for KIR typing large cohorts of DNA samples, and inapplicable for typing repositories in which sequence data, but not cells or cell analytes, are available. Alternative typing methods, such as in silico sequence-based typing, can address the problems associated with amplicon-based …


Computational Identification Of Terpene Synthase Genes And Their Evolutionary Analysis, Qidong Jia May 2016

Computational Identification Of Terpene Synthase Genes And Their Evolutionary Analysis, Qidong Jia

Doctoral Dissertations

Terpenoids, the largest and most structurally and functionally diverse class of natural compounds on earth, are mostly synthesized by plants to be involved in various plant environment interactions. Some terpenoids are classified as primary metabolites essential for plant growth and development. Terpene synthases (TPSs), the key enzymes for terpenoid biosynthesis, are the major determinant of the tremendous diversity of terpenoid carbon skeletons. The TPS genes represent a mid-size family of about 30-100 functional genes in almost all major sequenced plant genomes. TPSs are also found in fungi and bacteria, but microbial TPS genes share low levels of sequence similarity and …


An Exploration Of The Phylogenetic Placement Of Recently Discovered Ultrasmall Archaeal Lineages, Jeffrey M. O'Brien Aug 2015

An Exploration Of The Phylogenetic Placement Of Recently Discovered Ultrasmall Archaeal Lineages, Jeffrey M. O'Brien

Honors Scholar Theses

In recent years, several new clades within the domain Achaea have been discovered. This is due in part to microbiological sampling of novel environments, and the increasing ability to detect and sequence uncultivable organisms through metagenomic analysis. These organisms share certain features, such as small cell size and streamlined genomes. Reduction in genome size can present difficulties to phylogenetic reconstruction programs. Since there is less genetic data to work with, these organisms often have missing genes in concatenated multiple sequence alignments. Evolutionary Biologists have not reached a consensus on the placement of these lineages in the archaeal evolutionary tree. There …