Open Access. Powered by Scholars. Published by Universities.®

Genetics and Genomics Commons

Open Access. Powered by Scholars. Published by Universities.®

Bioinformatics

Institution
Keyword
Publication Year
Publication
Publication Type
File Type

Articles 301 - 330 of 1014

Full-Text Articles in Genetics and Genomics

Transposable Element Expression In Human Embryo Single-Cell Rna-Seq Data, Corinne Sexton Aug 2020

Transposable Element Expression In Human Embryo Single-Cell Rna-Seq Data, Corinne Sexton

UNLV Theses, Dissertations, Professional Papers, and Capstones

Transposable elements (TEs) are genetic sequences which are mobile within the genome, including DNA transposons and retrotransposons. Though the vast majority are no longer able to move or duplicate in humans, they still are actively transcribed in both germline and somatic cells, particularly in early human development. TEs are expressed in an extremely cell-type and stage specific pattern during embryogenesis, suggesting that they may either have a regulatory role in the cell or be transcribed along with cell-specific genes. However, earlier studies have focused on hESC models or early embryos up to day 6, with differing patterns of TE expression. …


Novel Long Non-Coding Rna Cdlinc Promotes Nsclc Progression, Christina J. Moss Jul 2020

Novel Long Non-Coding Rna Cdlinc Promotes Nsclc Progression, Christina J. Moss

USF Tampa Graduate Theses and Dissertations

Non-small cell lung cancer (NSCLC) is the leading cause of cancer death worldwide with a low 5-year survival rate of only around 21%. This low 5-year survival rate is due to two main reasons. First, NSCLC is often diagnosed at the later stages when it has already metastasized. Second, NSCLC is an incredibly diverse, heterogenous disease making it very hard to target the true molecular oncogenic drivers. New targets for personalized therapeutics are needed based on the expression status of each individual lung cancer tumor.

One way of looking for these new therapeutics is to begin by identifying the oncogenotype …


Regulators Of Ectopic Calcification In A Mouse Model Of Dish: A Multi-Omics Perspective, Matthew A. Veras Jun 2020

Regulators Of Ectopic Calcification In A Mouse Model Of Dish: A Multi-Omics Perspective, Matthew A. Veras

Electronic Thesis and Dissertation Repository

Diffuse idiopathic skeletal hyperostosis (DISH) is a non-inflammatory spondyloarthropathy and the second most common form of arthritis characterized by formation of ectopic mineral along the spine. Pathological findings in DISH include regional calcification of the anterior longitudinal ligament, paraspinal connective tissues, and annulus fibrosus (AF) of the intervertebral disc (IVD). Clinical symptoms of DISH include increased spine stiffness, decreased spinal range of motion, and in severe cases dysphagia and spinal cord/nerve root compression. The molecular pathways responsible for DISH have not been delineated and as such, there are no disease-modifying treatments. Clinical treatment for DISH is limited to surgical resection …


B Cell Acute Lymphoblastic Leukemia Is Driven By Activating Janus Kinase Mutations Cooperating With Spi1 And Spib Deletions In A Murine Model, Michelle Lim Jun 2020

B Cell Acute Lymphoblastic Leukemia Is Driven By Activating Janus Kinase Mutations Cooperating With Spi1 And Spib Deletions In A Murine Model, Michelle Lim

Electronic Thesis and Dissertation Repository

B cell acute lymphoblastic leukemia (B-ALL) is caused by genetic lesions in developing B cells that function as drivers for accumulation of additional mutations in an evolutionary selection process. We investigated secondary drivers of leukemogenesis and their mechanism(s) of arising in a mouse model of B-ALL driven by PU.1/Spi-B deletion (Mb1-CreDPB). Whole exome sequencing revealed recurrent mutations in Jak3 (encoding Janus Kinase 3) and Jak1. Mutations with high variant allele frequency (VAF) were dominated by C->T transition mutations that were compatible with AID, whereas the majority of mutations, with low VAF, were dominated by C->A transversions associated with …


Metabolic Network Analysis Of Filamentous Cyanobacteria, Daniel Alexis Norena-Caro Jun 2020

Metabolic Network Analysis Of Filamentous Cyanobacteria, Daniel Alexis Norena-Caro

LSU Doctoral Dissertations

Cyanobacteria were the first organisms to use oxygenic photosynthesis, converting CO2 into useful organic chemicals. However, the chemical industry has historically relied on fossil raw materials to produce organic precursors, which has contributed to global warming. Thus, cyanobacteria have emerged as sustainable stakeholders for biotechnological production. The filamentous cyanobacterium Anabaena sp. UTEX 2576 can metabolize multiple sources of Nitrogen and was studied as a platform for biotechnological production of high-value chemicals (i.e., pigments, antioxidants, vitamins and secondary metabolites). From a Chemical engineering perspective, the biomass generation in this organism was thoroughly studied by interpreting the cell as a microbial …


Machine Learning With Digital Signal Processing For Rapid And Accurate Alignment-Free Genome Analysis: From Methodological Design To A Covid-19 Case Study, Gurjit Singh Randhawa Jun 2020

Machine Learning With Digital Signal Processing For Rapid And Accurate Alignment-Free Genome Analysis: From Methodological Design To A Covid-19 Case Study, Gurjit Singh Randhawa

Electronic Thesis and Dissertation Repository

In the field of bioinformatics, taxonomic classification is the scientific practice of identifying, naming, and grouping of organisms based on their similarities and differences. The problem of taxonomic classification is of immense importance considering that nearly 86% of existing species on Earth and 91% of marine species remain unclassified. Due to the magnitude of the datasets, the need exists for an approach and software tool that is scalable enough to handle large datasets and can be used for rapid sequence comparison and analysis. We propose ML-DSP, a stand-alone alignment-free software tool that uses Machine Learning and Digital Signal Processing to …


Multi-Omics Integration For Gene Fusion Discovery And Somatic Mutation Haplotyping In Cancer, Steven Mason Foltz May 2020

Multi-Omics Integration For Gene Fusion Discovery And Somatic Mutation Haplotyping In Cancer, Steven Mason Foltz

Arts & Sciences Electronic Theses and Dissertations

Cancer is a disease caused by changes to the genome and dysregulation of gene expression. Among many types of mutations, including point mutations, small insertions and deletions, large scale structural variants, and copy number changes, gene fusions are another category of genomic and transcriptomic alteration that can lead to cancer and which can serve as therapeutic targets. We studied gene fusion events using data from The Cancer Genome Atlas, including over 9,000 patients from 33 cancer types, finding patterns of gene fusion events and dysregulation of gene expression within and across cancer types. With data from the CoMMpass study (Multiple …


Performance Analysis Of Three Bioinformatic Variant Callers Using A Somatic Reference Standard, Binu Porath, Binu Porath May 2020

Performance Analysis Of Three Bioinformatic Variant Callers Using A Somatic Reference Standard, Binu Porath, Binu Porath

Research Days

No abstract provided.


A Genomic Analysis Of Bobcat Populations In North America With A Comparison To The Canada Lynx: An Assessment Of Local Adaptation To Unique Ecoregions And Phylogeography, Jennifer C. Broderick May 2020

A Genomic Analysis Of Bobcat Populations In North America With A Comparison To The Canada Lynx: An Assessment Of Local Adaptation To Unique Ecoregions And Phylogeography, Jennifer C. Broderick

Electronic Theses and Dissertations

Bobcats (Lynx rufus) are an ecologically and genetically diverse species with a large contiguous range throughout North America. The species not only has a wide array of phenotypic variation compared to other mammals, but shows marked adaptability across ecozones with differing ecological influences. It is these various selective pressures in distinctive parts of the continent that have likely led to localized adaptations within the bobcat metapopulations. The species is also marked by its ability to maintain connectivity and populations in anthropogenically developed areas, an advantage it has over other felids, including its close relative the Canada lynx ( …


Comparative Analysis Of Metabolic Pathways Of Bacteria Used In Fermented Food, Keanu Hoang, Kiran Bastola May 2020

Comparative Analysis Of Metabolic Pathways Of Bacteria Used In Fermented Food, Keanu Hoang, Kiran Bastola

Theses/Capstones/Creative Projects

This study presents a novel methodology for analyzing metabolic pathways. Utilizing KEGG REST API through a Biopython package and file parser, data about whether or not a bacteria has an enzyme or not was extracted. The results found that differences in metabolic pathway enrichment values follow along the lines of genera and pathway type. In particular, bacteria found in food spoilage and commercial nitrogen fixing products had high values of enrichment.


Modeling Hybrid Novel Traits: A Case Study In Complex Petal Pigment Patterning In Hybrid Mimulus, Xingyu Zheng May 2020

Modeling Hybrid Novel Traits: A Case Study In Complex Petal Pigment Patterning In Hybrid Mimulus, Xingyu Zheng

Undergraduate Honors Theses

Hybridization between species, by introducing dramatic trait variation into the population and creating viable, transgressive offsprings with novel phenotypes, can have huge evolutionary implications. Some hybrid traits have been studied in the classical genetics or population genetics context, but most complex traits are determined by multiple causes, e.g. the number of loci involved, the rewiring of the genetic circuitries, and the changes in gene expression pattern. Using the hybrid monkeyflower petal pigment patterning as an example, we present a case study to investigate complex hybrid traits in a systematic manner that includes empirical data analysis and quantitative mathematical modeling of …


Evolutionary Inference From Admixed Genomes: Implications Of Hybridization For Biodiversity Dynamics And Conservation, Tyler Chafin May 2020

Evolutionary Inference From Admixed Genomes: Implications Of Hybridization For Biodiversity Dynamics And Conservation, Tyler Chafin

Graduate Theses and Dissertations

Hybridization as a macroevolutionary mechanism has been historically underappreciated among vertebrate biologists. Yet, the advent and subsequent proliferation of next-generation sequencing methods has increasingly shown hybridization to be a pervasive agent influencing evolution in many branches of the Tree of Life (to include ancestral hominids). Despite this, the dynamics of hybridization with regards to speciation and extinction remain poorly understood. To this end, I here examine the role of hybridization in the context of historical divergence and contemporary decline of several threatened and endangered North American taxa, with the goal to illuminate implications of hybridization for promoting—or impeding—population persistence in …


Environmental Dna Metabarcoding Detects Mammal Use Of Stock Tanks And Natural Springs On The Prescott National Forest, K. A. Smith, Berenice Carreras Mendiolea, C. E. Benson, F. B. Anaya Apr 2020

Environmental Dna Metabarcoding Detects Mammal Use Of Stock Tanks And Natural Springs On The Prescott National Forest, K. A. Smith, Berenice Carreras Mendiolea, C. E. Benson, F. B. Anaya

Discovery Day - Prescott

Recent research has shown the viability of environmental DNA (eDNA) analysis as a tool for identifying species presence in aquatic ecosystems. However, there is limited research indicating the feasibility of eDNA as a survey tool in water sources such as springs and stock tanks. Because of the scarcity of water in the high desert landscape of Arizona, natural springs and stock tanks are an important water resource. In this study, we examined whether 16S rRNA metabarcoding could detect mammal eDNA in springs and stock tanks in the Prescott National Forest. During summer 2019, we collected and analyzed water samples at …


Using Active Learning To Build A Foundation For Bioinformatics Training., Stacey E. Wahl Ph.D., Amy L. Olex Ms Mar 2020

Using Active Learning To Build A Foundation For Bioinformatics Training., Stacey E. Wahl Ph.D., Amy L. Olex Ms

Transforming Libraries for Graduate Students

As Health Sciences Libraries evolve, the support they offer graduate students has evolved to incorporate many aspects of the research life cycle. At Tompkins-McCaw Library for the Health Sciences, we have partnered with the Wright Center for Clinical and Translational Research to offer training workshops for graduate students who are interested in using bioinformatics to plan, analyze, or execute scientific experiments. We offer two series: 1) an 8-week, 1-hour per week seminar series providing a general overview of available techniques and 2) a week-long intensive, two hours per session, series on utilizing free databases from the National Center for Biotechnology …


Molecular Differentiation Of Astragalus Species And Varieties From The Western United States: The Chloroplast Dna Bridge Between Evolution And Molecular Systematics, Marwa Neyaz, Daniel Cook, Rebecca Creamer Mar 2020

Molecular Differentiation Of Astragalus Species And Varieties From The Western United States: The Chloroplast Dna Bridge Between Evolution And Molecular Systematics, Marwa Neyaz, Daniel Cook, Rebecca Creamer

Poisonous Plant Research (PPR)

Locoweeds are the most widespread poisonous plant problem in the world and have been reported in the Western United States since the 1800s, causing tremendous losses in livestock. Consumption of locoweeds by grazing animals stimulates the neurological disease, locoism, characterized by weight loss, ataxia, and lack of muscular coordination. The name locoweed is used for Astragalus and Oxytropis species known to contain swainsonine, the toxic principle produced by the plant endophytic fungus Undifilum. Astragalus includes 2,500-3,000 species and many varieties that have almost identical morphological characteristics that overlap among species, leading to improper identification. Therefore, the aim of this study …


Mushroom Body-Specific Gene Regulation By The Swi/Snf Chromatin Remodeling Complex, Kevin Cj Nixon Feb 2020

Mushroom Body-Specific Gene Regulation By The Swi/Snf Chromatin Remodeling Complex, Kevin Cj Nixon

Electronic Thesis and Dissertation Repository

Over the lifetime of an organism, neurons must establish, remodel, and maintain precise connections in order to form neural circuits that are required for proper nervous system functioning. Disruptions in these processes can lead to neurodevelopmental disorders such as intellectual disability (ID) and autism spectrum disorder. Mutations in genes encoding subunits of the SWI/SNF chromatin remodeling complex have been implicated in ID, yet the role of this complex in neurons is poorly understood. In this project, I established cell-type specific methods to examine the effect of SWI/SNF subunit knockdowns on gene transcription and chromatin structure in the memory-forming neurons of …


On The Distribution Of Genetic Variation In Ecological Communities, Isaac Overcast Feb 2020

On The Distribution Of Genetic Variation In Ecological Communities, Isaac Overcast

Dissertations, Theses, and Capstone Projects

Biodiversity in ecological communities is structured hierarchically across spatial and temporal scales. Many open questions remain as to how this structure accumulates. For example, what are the relative contributions of dispersal versus in situ speciation? Or, how important are stochastic drift versus deterministic processes? Up to this point, these questions have been investigated by isolated disciplines (e.g. macroecology, comparative phylogeography, macroevolution) using tools and data that tend to focus on only one axis of community scale data (e.g. phylogenies, relative abundances, and/or trait information). Yet we know that there are feedbacks among processes that respond on short, medium, and long …


Pseudomonas And Bacillus Soil Isolates Produce Antibiotics, Chelsea Brandt, Dr. Lori Scott Jan 2020

Pseudomonas And Bacillus Soil Isolates Produce Antibiotics, Chelsea Brandt, Dr. Lori Scott

Identifying and Characterizing Novel Antibiotic Producing Microbes From the Soil

The recent emergence of antibiotic resistance bacterial strains presents a significant challenge and threat to human healthcare. While new methods of treatment such as bacteriophage therapy and combinations of existing antibiotics are being researched, the human population is in dire need of new antibiotics to replace those that are ineffective. This research addresses this need by identifying antibiotic producing bacteria in a soil sample from Davenport, IA. This project is a collaboration with the Tiny Earth Project Initiative (TEPI), which is a global network of educators and students focused on studentsourcing antibiotic discovery from soil. Microbiology lab techniques and 16S …


Using Cuda To Enhance Data Processing Of Variant Call Format Files For Statistical Genetic Analysis, Heather Mckinnon Jan 2020

Using Cuda To Enhance Data Processing Of Variant Call Format Files For Statistical Genetic Analysis, Heather Mckinnon

All Graduate Projects

Utilizing the power of GPU parallel processing with CUDA can speed up the processing of Variant Call Format (VCF) files and statistical analysis of genomic data. A software package designed toward this purpose would be beneficial to genetic researchers by saving them time which they could spend on other aspects of their research. A data set containing genetics from a study of trichome production in Mimulus guttatus, or yellow monkey flower, was used to develop a package to test the effectiveness of GPU parallel processing versus serial executions. After a serial version of the code was generated and benchmarked, OpenACC …


Interspecific Gene Flow Potentiates Adaptive Evolution In A Hybrid Zone Formed Between Pinus Strobiformis And Pinus Flexilis, Mitra Menon Jan 2020

Interspecific Gene Flow Potentiates Adaptive Evolution In A Hybrid Zone Formed Between Pinus Strobiformis And Pinus Flexilis, Mitra Menon

Theses and Dissertations

Species range margins are often characterised by high degrees of habitat fragmentation resulting in low genetic diversity and higher gene flow from populations at the core of the species range. Interspecific gene flow from a closely related species with abutting range margins can increase standing genetic diversity and generate novel allelic combinations thereby alleviating limits to adaptive evolution in range margin populations. Hybridization driven interspecific gene flow has played a key role in the demographic history of several conifer due to their life history characteristics such as weak crossability barriers and long generation times. Nevertheless, demonstrating whether introgression is adaptive …


Ndrg1 And Myelin-Related Disease: Alcoholism And Chemotherapy-Induced Neuropathy, Guy Harris Jan 2020

Ndrg1 And Myelin-Related Disease: Alcoholism And Chemotherapy-Induced Neuropathy, Guy Harris

Theses and Dissertations

Alcohol use disorder (AUD) is a prevalent neuropsychiatric disease with profound health, social, and economic consequences. With an estimated 50% heritability, identifying genes that engender risk and contribute to the underlying neurobiological mechanisms represents an important first step in developing effective treatments. Gene expression studies are an important source of candidate genes for studying AUD, providing windows into the molecular machinery engaged by the brain in response to ethanol. Our laboratory has implicated N-myc down-regulated gene 1 (Ndrg1) as a potential candidate gene that modulates ethanol-induced changes in myelin-related gene expression and acute sensitivity to ethanol. Analysis of …


Biological Pathways Associated With Wild And Domestic Animals, Cayla Steemer, Laura Carson, Jim Hu, Gloria Regisford Jan 2020

Biological Pathways Associated With Wild And Domestic Animals, Cayla Steemer, Laura Carson, Jim Hu, Gloria Regisford

Pursue: Undergraduate Research Journal

Background: Zoonotic diseases are problematic, in that, they impact both wild and domestic animals alike. Thus, there is a need to investigate the genomes of wild and domestic. Gene ontology (GO) is a major bioinformatics initiative, whereby descriptions of gene products across the database are developed and unified to describe all species. This process is performed by biocurators, who gather, annotate, and validate information on the databases, consequently affording easy access to accurate and updated data. In this study, we investigated the biocuration of two biological processes, DNA integration, which is used for DNA coding, and the tricarboxylic acid (TCA) …


Plasticity And The Impact Of Increasing Temperature On A Tropical Ectotherm, Adam A. Rosso Jan 2020

Plasticity And The Impact Of Increasing Temperature On A Tropical Ectotherm, Adam A. Rosso

Electronic Theses and Dissertations

Organisms may respond to climate change through behavior, genetic adaptation, and/or phenotypic plasticity. Tropical ectotherms are thought to be especially vulnerable to climate change because most have a narrow range of thermal tolerance while living close to their upper thermal tolerance limits. Additionally, many tropical species live in closed-canopy forests, which provide homogenous thermal landscapes that prevent behavioral compensation for stressfully warm temperatures. Finally, tropical ectotherms are thought to have decreased capacity for phenotypic plasticity because they have evolved in thermally stable environments. We tested gene expression patterns and phenotypic plasticity in the Panamanian slender anole by a) measuring changes …


Spatial And Temporal Genetic Structure Of Winter-Run Steelhead (Oncorhynchus Mykiss) Returning To The Mad River, California, Steven R. Fong Jan 2020

Spatial And Temporal Genetic Structure Of Winter-Run Steelhead (Oncorhynchus Mykiss) Returning To The Mad River, California, Steven R. Fong

Cal Poly Humboldt theses and projects

Distinct populations of steelhead in the wild are in decline. The propagation of steelhead in hatcheries has been used to boost population numbers for recreational fisheries and for use in conservation. However, hatchery breeding practices of steelhead can result in changes in genetic structure. I investigated the genetic structure of winter-run steelhead (Oncorhynchus mykiss) returning to the Mad River, California, where a hatchery has been used enhance production for recreational fisheries since 1971. Genetic variability in Mad River steelhead was evaluated using 96 single nucleotide polymorphisms (SNPs) among 4203 individuals, including the Mad River and nearby locations, and …


The Transcriptome Response Of The White-Rot Fungus Trametes Versicolor To Wild-Type And Lignin-Modified Hybrid Poplar, Anbarah Alzabaidi Jan 2020

The Transcriptome Response Of The White-Rot Fungus Trametes Versicolor To Wild-Type And Lignin-Modified Hybrid Poplar, Anbarah Alzabaidi

Masters Theses

Plant biomass is a renewable and sustainable feedstock for biofuel production that can reduce societal dependence on fossil fuels. However, the production of liquid biofuels from the non-starch (i.e. lignocellulosic) material through fermentation technology is limited due to the complexity of the cell wall structure. This necessitates the use of chemical, thermal, and/or mechanical pretreatment technologies, which adds significant capital, operational, and environmental costs. Biological pretreatment strategies have the potential to mitigate these expenses by harnessing the innate ability of specialized bacteria and fungi to deconstruct lignocellulose. White-rot fungi (e.g. Trametes versicolor) have been shown to be …


Identification And Characterization Of Epigenetic Regulators In The Yellow Fever Mosquito, Aedes Aegypti, Shankar C. R. R. Chereddy Jan 2020

Identification And Characterization Of Epigenetic Regulators In The Yellow Fever Mosquito, Aedes Aegypti, Shankar C. R. R. Chereddy

Theses and Dissertations--Entomology

Due to the increased interactions between the sylvatic and urban cycles of Aedes aegypti life cycle, there has been an increase in the incidence of vector-borne diseases. The risk involved through the transmission of viruses like Dengue, Zika, Yellow-fever and Chikungunya calls for an effective control method for the deadly vector. The possibility of pesticide resistance and the costs involved in producing new pesticides have pushed Sterile Insect Technique (SIT) into prominence. However, current methods used to sterize insects for SIT are not highly efficient. This study tackles current issues in mosquito control by identifying and characterizing epigenetic regulators that …


The Host Gatekeeper: Using The Flagellar Pathway To Understand Symbiont Host Adaptation, Adam R. Pollio Jan 2020

The Host Gatekeeper: Using The Flagellar Pathway To Understand Symbiont Host Adaptation, Adam R. Pollio

Graduate Theses, Dissertations, and Problem Reports

The acquisition of microbial partners is a strategy used by a diverse group of arthropods to overcome ecological barriers that might normally make certain niches uninhabitable. The unique phylogenetic opportunities attainable from the natural experiment of the Sodalis-allied clade allow for better understanding of how molecular structures evolve through time. Here, we focus on the evolution of the flagellar synthesis pathway, due to its complexity and ability to diverge in response to ecological pressures. We used this molecular pathway and natural experiment to show that normal evolutionary outcomes associated with symbiosis (i.e., genome reduction) do not explain the predicted conservation …


Wading In A Glycan Sea: Investigations Of, And Developing Bioinformatic Tools For, Carbohydrate Active Enzymes And Gene Clusters In Prokaryotes, Catherine Ann Ausland Jan 2020

Wading In A Glycan Sea: Investigations Of, And Developing Bioinformatic Tools For, Carbohydrate Active Enzymes And Gene Clusters In Prokaryotes, Catherine Ann Ausland

Graduate Research Theses & Dissertations

Carbohydrates are immensely important biomolecule found in all organisms on earth, serving roles in structure, energy storage, protection and/or cell signaling. Carbohydrates are composed of an immense diversity of linkages, monosaccharide moieties and substitutions making them likely the most diverse biomolecule on earth, as well as the most abundant, being synthesized by plants and algae. Carbohydrates are synthesized and degraded by all organisms on earth through the expression of carbohydrate active enzymes, or ‘CAZymes’. In bacteria, CAZymes have been characterized to cluster in with transporters, regulators and/or other genes in so-called ‘polysaccharide utilization loci’, or PULs, to degrade carbohydrate substrates …


Investigating The Role Of Centromeric Repeats In Female Meiotic Drive, Jocelyn Crawford Jan 2020

Investigating The Role Of Centromeric Repeats In Female Meiotic Drive, Jocelyn Crawford

CMC Senior Theses

Female meiotic drive is an unequal transmission of alleles that arises through the competition of paired chromosomes for inclusion in the egg, resulting in an increase in frequency of the driven alleles regardless of their effect on fitness of the individual. In Mimulus guttatus (monkeyflower) second filial generations, driven alleles display transmission advantages resulting in the virtual elimination of recessive homozygotes, while the equivalent lines lacking drive elements conform to traditional Mendelian segregation population ratios. Centromeres have been identified as mechanistic drive elements due to their role in chromosomal segregation during female meiosis, with Mimulus providing the best documented case …


Machine Learning Methods For The Analysis Of Metagenomes, Vito Adrian Cantu Alessio Robles Jan 2020

Machine Learning Methods For The Analysis Of Metagenomes, Vito Adrian Cantu Alessio Robles

CGU Theses & Dissertations

As of October 2020, there are 18.6 × 1015 DNA base pairs publicly available in the Sequence Read Archive and this number is growing at an exponential rate. As DNA sequencing prices continue to drop, many research groups around the world have incorporated high throughput sequencing in their research, giving us access to sequences from many distinct ecosystems. This has revolutionized the field of metagenomics, which aims to fully characterize all organisms and their interactions in a particular system. Nevertheless, the plethora of available data has made its analysis difficult as traditional techniques such as genome assembly or sequence alignment …