Open Access. Powered by Scholars. Published by Universities.®

Life Sciences Commons

Open Access. Powered by Scholars. Published by Universities.®

Genome assembly

Discipline
Institution
Publication Year
Publication
Publication Type

Articles 1 - 27 of 27

Full-Text Articles in Life Sciences

Forage Genomics Accelerate The Germplasm Resource Innovation, Fan Wu, Qi Yan, Pan Xu, Zhen Duan, Jiyu Zhang Feb 2024

Forage Genomics Accelerate The Germplasm Resource Innovation, Fan Wu, Qi Yan, Pan Xu, Zhen Duan, Jiyu Zhang

IGC Proceedings (1997-2023)

To achieve sustainability and food security we need expand the germplasm base and access novel genetic diversity to accelerate breeding. For developing new forage cultivars, the availability of a high-quality genome facilitates accurate characterization of new germplasm, and an understanding of the genetics underlying important traits. Here, we sequenced and assembled three high-quality chromosome-level forage genomes. The contig-level assembly of Cleistogenes songorica (2n = 4x = 40) comprised 540.12 Mb of the genome, with a contig N50 of 21.28 Mb. Complete assemblies of all telomeres, and of ten chromosomes were derived. The chromosome-scale genome size of elephant grass (2n = …


Molecular Approaches In The Analysis Of Red Clover Rhizobium Symbiosis, Randy D. Dinkins, L. Koch, M. L. Sullivan, H. Zhu Jan 2024

Molecular Approaches In The Analysis Of Red Clover Rhizobium Symbiosis, Randy D. Dinkins, L. Koch, M. L. Sullivan, H. Zhu

IGC Proceedings (1997-2023)

Red clover (Trifolium pratense L.) is an important forage and pasture legume grown throughout temperate regions. Because it can form a symbiotic relationship with rhizobium, it is able to fix atmospheric nitrogen. Due to the high cost of nitrogen fertilizers, pasture legumes have been increasingly important in forage production settings. Red clover has not been a model legume primarily due to selfincompatibility and the associated high level of genomic heterozygosity, therefore it has not been a significant contributor in molecular or genetic studies and basic information on red clover legume/rhizobium symbiosis is lacking. Using recently annotated genomic resources, RNA-seq …


A Draft Diabrotica Virgifera Virgifera Genome: Insights Into Control And Host Plant Adaption By A Major Maize Pest Insect, Brad S. Coates, Kimberly K. O. Walden, Dimpal Lata, Neetha Nanoth Vellichirammal, Robert F. Mitchell, Martin N. Andersson, Rachel Mckay, Marcé D. Lorenzen, Nathaniel Grubbs, Yu‑Hui Wang, Jinlong Han, Jing Li Xuan, Peter Willadsen, Huichun Wang, B. Wade French, Raman Bansal, Sammy Sedky, Dariane Souza, Dakota Bunn, Lance J. Meinke, Nicholas J. Miller, Blair D. Siegfried11,, Thomas W. Sappington, Hugh M. Robertson Jan 2023

A Draft Diabrotica Virgifera Virgifera Genome: Insights Into Control And Host Plant Adaption By A Major Maize Pest Insect, Brad S. Coates, Kimberly K. O. Walden, Dimpal Lata, Neetha Nanoth Vellichirammal, Robert F. Mitchell, Martin N. Andersson, Rachel Mckay, Marcé D. Lorenzen, Nathaniel Grubbs, Yu‑Hui Wang, Jinlong Han, Jing Li Xuan, Peter Willadsen, Huichun Wang, B. Wade French, Raman Bansal, Sammy Sedky, Dariane Souza, Dakota Bunn, Lance J. Meinke, Nicholas J. Miller, Blair D. Siegfried11,, Thomas W. Sappington, Hugh M. Robertson

Department of Entomology: Faculty Publications

Background Adaptations by arthropod pests to host plant defenses of crops determine their impacts on agricultural production. The larval host range of western corn rootworm, Diabrotica virgifera virgifera (Coleoptera: Chrysomelidae), is restricted to maize and a few grasses. Resistance of D. v. virgifera to crop rotation practices and multiple insecticides contributes to its status as the most damaging pest of cultivated maize in North America and Europe. The extent to which adaptations by this pest contributes to host plant specialization remains unknown.

Results A 2.42 Gb draft D. v. virgifera genome, Dvir_v2.0, was assembled from short shotgun reads and scaffolded …


A Standardized Pipeline For Isolation And Assembly Of Genomes From Symbiotic Bacteria In Whole Louse Genomic Sequence Data., Mohammad Mikail I. Bala Jan 2023

A Standardized Pipeline For Isolation And Assembly Of Genomes From Symbiotic Bacteria In Whole Louse Genomic Sequence Data., Mohammad Mikail I. Bala

Theses and Dissertations

Many insects are known to harbour intracellular and heritable bacteria (endosymbionts), which provide their hosts with adaptive traits. Whole insect gDNA shotgun sequencing projects often sequence the genome of endosymbiont, in addition to the insect’s genome. There are approximately 600 whole genome shotgun libraries from insects available on the public repository (NCBI), which can be mined to obtain endosymbiont genomes. The assembly and annotation of endosymbiont genomes can contribute towards the exploration of their role as obligate symbiotic partners. However, de novo assembly of an endosymbiont genome, continues to be challenging, when the host and/or enteric bacterial gDNA is present …


Large Genomes Assembly Using Mapreduce Framework, Yuehua Zhang Dec 2022

Large Genomes Assembly Using Mapreduce Framework, Yuehua Zhang

All Dissertations

Knowing the genome sequence of an organism is the essential step toward understanding its genomic and genetic characteristics. Currently, whole genome shotgun (WGS) sequencing is the most widely used genome sequencing technique to determine the entire DNA sequence of an organism. Recent advances in next-generation sequencing (NGS) techniques have enabled biologists to generate large DNA sequences in a high-throughput and low-cost way. However, the assembly of NGS reads faces significant challenges due to short reads and an enormously high volume of data. Despite recent progress in genome assembly, current NGS assemblers cannot generate high-quality results or efficiently handle large genomes …


The Genome Of Orychophragmus Violaceus Provides Genomic Insights Into The Evolution Of Brassicaceaepolyploidizationandits Distinct Traits, Kang Zhang, Yinqing Yang, Xin Zhang, Lingkui Zhang, Yu Fu, Zhongwei Guo, Shumin Chen, Jian Wu, James C. Schnable, Keke Yi, Xiaowu Wang, Feng Cheng Nov 2022

The Genome Of Orychophragmus Violaceus Provides Genomic Insights Into The Evolution Of Brassicaceaepolyploidizationandits Distinct Traits, Kang Zhang, Yinqing Yang, Xin Zhang, Lingkui Zhang, Yu Fu, Zhongwei Guo, Shumin Chen, Jian Wu, James C. Schnable, Keke Yi, Xiaowu Wang, Feng Cheng

Department of Agronomy and Horticulture: Faculty Publications

Orychophragmus violaceus, referred to as ‘‘eryuelan’’ (February orchid) in China, is an early-flowering ornamental plant. The high oil content and abundance of unsaturated fatty acids in O. violaceus seeds make it a potential high-quality oilseed crop. Here, we generated a whole-genome assembly for O. violaceus using Nanopore and Hi-C sequencing technologies. The assembled genome of O. violaceus was ~1.3 Gb in size, with 12 pairs of chromosomes. Through investigation of ancestral genome evolution, we determined that the genome of O. violaceus experienced a tetraploidization event from a diploid progenitor with the translocated proto-Calepineae karyotype. Comparisons between the reconstructed subgenomes …


Current Status And Trends In Forest Genomics, Dulal Borthakur, Victor Busov, Xuan Hieu Cao, Qingzhang Du, Oliver Gailing, Fikret Isik, Hairong Wei, Et. Al. Aug 2022

Current Status And Trends In Forest Genomics, Dulal Borthakur, Victor Busov, Xuan Hieu Cao, Qingzhang Du, Oliver Gailing, Fikret Isik, Hairong Wei, Et. Al.

Michigan Tech Publications

Forests are not only the most predominant of the Earth's terrestrial ecosystems, but are also the core supply for essential products for human use. However, global climate change and ongoing population explosion severely threatens the health of the forest ecosystem and aggravtes the deforestation and forest degradation. Forest genomics has great potential of increasing forest productivity and adaptation to the changing climate. In the last two decades, the field of forest genomics has advanced quickly owing to the advent of multiple high-throughput sequencing technologies, single cell RNA-seq, clustered regularly interspaced short palindromic repeats (CRISPR)-mediated genome editing, and spatial transcriptomes, as …


High-Quality Ice Plant Reference Genome Analysis Provides Insights Into Genome Evolution And Allows Exploration Of Genes Involved In The Transition From C3 To Cam Pathways, Shaoqin Shen, Nan Li, Yujie Wang, Rong Zhou, Pengchuan Sun, Hao Lin, Wei Chen, Tong Yu, Zhuo Liu, Zhiyuan Wang, Xiao Tan, Changping Zhu, Shuyan Feng, Yu Zhang, Xiaoming Song Jul 2022

High-Quality Ice Plant Reference Genome Analysis Provides Insights Into Genome Evolution And Allows Exploration Of Genes Involved In The Transition From C3 To Cam Pathways, Shaoqin Shen, Nan Li, Yujie Wang, Rong Zhou, Pengchuan Sun, Hao Lin, Wei Chen, Tong Yu, Zhuo Liu, Zhiyuan Wang, Xiao Tan, Changping Zhu, Shuyan Feng, Yu Zhang, Xiaoming Song

Department of Food Science and Technology: Faculty Publications

Ice plant (Mesembryanthemum crystallinum), a member of the Aizoaceae family, is a typical halophyte crop and a model plant for studying the mechanism of transition from C3 photosynthesis to crassulacean acid metabolism (CAM). Here, we report a high-quality chromosome-level ice plant genome sequence. This 98.05% genome sequence is anchored to nine chromosomes, with a total length of 377.97 Mb and an N50 scaffold of 40.45 Mb. Almost half of the genome (48.04%) is composed of repetitive sequences, and 24 234 genes have been annotated. Subsequent to the ancient whole-genome triplication (WGT) that occurred in eudicots, there has been …


Chromosome-Level Thlaspi Arvense Genome Provides New Tools For Translational Research And For A Newly Domesticated Cash Cover Crop Of The Cooler Climates, Adam Nunn, Isaac Rodríguez-Arévalo, Zenith Tandukar, Katherine Anna Frels, Adrián Contreras-Garrido, Pablo Carbonell-Bejerano, Panpan Zhang, Daniela Ramos Cruz, Katharina Jandrasits, Christa Lanz, Anthony Brusa, Marie Mirouze, Kevin Dorn, David W. Galbraith, Brice A. Jarvis, John C. Sedbrook, Donald L. Wyse, Christian Otto, David Langenberger, Peter F. Stadler, Detlef Weigel, M. David Marks, James A. Anderson, Claude Becker, Ratan Chopra May 2022

Chromosome-Level Thlaspi Arvense Genome Provides New Tools For Translational Research And For A Newly Domesticated Cash Cover Crop Of The Cooler Climates, Adam Nunn, Isaac Rodríguez-Arévalo, Zenith Tandukar, Katherine Anna Frels, Adrián Contreras-Garrido, Pablo Carbonell-Bejerano, Panpan Zhang, Daniela Ramos Cruz, Katharina Jandrasits, Christa Lanz, Anthony Brusa, Marie Mirouze, Kevin Dorn, David W. Galbraith, Brice A. Jarvis, John C. Sedbrook, Donald L. Wyse, Christian Otto, David Langenberger, Peter F. Stadler, Detlef Weigel, M. David Marks, James A. Anderson, Claude Becker, Ratan Chopra

Department of Agronomy and Horticulture: Faculty Publications

Thlaspi arvense (field pennycress) is being domesticated as a winter annual oilseed crop capable of improving ecosystems and intensifying agricultural productivity without increasing land use. It is a selfing diploid with a short life cycle and is amenable to genetic manipulations, making it an accessible field-based model species for genetics and epigenetics. The availability of a high-quality reference genome is vital for understanding pennycress physiology and for clarifying its evolutionary history within the Brassicaceae. Here, we present a chromosome-level genome assembly of var. MN106-Ref with improved gene annotation and use it to investigate gene structure differences between two accessions (MN108 …


Applications Of Nanopore Dna Sequencing For Improved Genome Assembly, Daniel Giguere Mar 2022

Applications Of Nanopore Dna Sequencing For Improved Genome Assembly, Daniel Giguere

Electronic Thesis and Dissertation Repository

An organism's genome is the ultimate determinant of its functional potential. Understanding genomes is therefore essential to understand function, and a foundational knowledge of a genome is required transfer functions to and from microorganisms of interest. Sequencing DNA using nanopores is a recent advance that resolves limitations of previous technologies, enabling an improved understanding of genomes. For this thesis, I improved our understanding of microbial genomes by developing novel approaches to analyze long read sequencing data, setting the foundation for future synthetic biology work.

Long sequencing reads have enabled routine assembly of complete bacterial genomes by directly sequencing DNA extracted …


Highly Contiguous Assemblies Of 101 Drosophilid Genomes, Bernard Y. Kim, Jeremy R. Wang, Danny E. Miller, Olga Barmina, Emily Delaney, Ammon Thompson, Aaron A. Comeault, David Peede, Emmanuel R. R. D'Agostino, Julianne Pelaez, Jessica M. Aguilar, Diler Haji, Teruyuki Matsunaga, Ellie E. Armstrong, Molly Zych, Yoshitaka Ogawa, Marina Stamenković-Radak, Mihailo Jelić, Marija Savić Veselinović, Marija Tanasković, Jeremy S. Davis Jul 2021

Highly Contiguous Assemblies Of 101 Drosophilid Genomes, Bernard Y. Kim, Jeremy R. Wang, Danny E. Miller, Olga Barmina, Emily Delaney, Ammon Thompson, Aaron A. Comeault, David Peede, Emmanuel R. R. D'Agostino, Julianne Pelaez, Jessica M. Aguilar, Diler Haji, Teruyuki Matsunaga, Ellie E. Armstrong, Molly Zych, Yoshitaka Ogawa, Marina Stamenković-Radak, Mihailo Jelić, Marija Savić Veselinović, Marija Tanasković, Jeremy S. Davis

Biology Faculty Publications

Over 100 years of studies in Drosophila melanogaster and related species in the genus Drosophila have facilitated key discoveries in genetics, genomics, and evolution. While high-quality genome assemblies exist for several species in this group, they only encompass a small fraction of the genus. Recent advances in long-read sequencing allow high-quality genome assemblies for tens or even hundreds of species to be efficiently generated. Here, we utilize Oxford Nanopore sequencing to build an open community resource of genome assemblies for 101 lines of 93 drosophilid species encompassing 14 species groups and 35 sub-groups. The genomes are highly contiguous and complete, …


Highly Contiguous Assemblies Of 101 Drosophilid Genomes, Bernard Y. Kim, Jeremy R. Wang, Danny E. Miller, Olga Barmina, Emily Delaney, Ammon Thompson, Aaron A. Comeault, David Peede, Emmanuel R.R. D’Agostino, Julianne Pelaez, Jessica M. Aguilar, Diler Haji, Teruyuki Matsunaga, Ellie E. Armstrong, Molly Zych, Yoshitaka Ogawa, Marina Stamenković-Radak, Mihailo Jelić, Marija Savić Veselinović, Marija Tanasković, Pavle Erić, Jian Jun Gao, Takehiro K. Katoh, Masanori J. Toda, Hideaki Watabe, Masayoshi Watada, Jeremy S. Davis, Leonie C. Moyle, Giulia Manoli, Enrico Bertolini, Vladimír Košťál, R. Scott Hawley, Aya Takahashi, Corbin D. Jones, Donald K. Price, Noah Whiteman, Artyom Kopp, Daniel R. Matute, Dmitri A. Petrov Jul 2021

Highly Contiguous Assemblies Of 101 Drosophilid Genomes, Bernard Y. Kim, Jeremy R. Wang, Danny E. Miller, Olga Barmina, Emily Delaney, Ammon Thompson, Aaron A. Comeault, David Peede, Emmanuel R.R. D’Agostino, Julianne Pelaez, Jessica M. Aguilar, Diler Haji, Teruyuki Matsunaga, Ellie E. Armstrong, Molly Zych, Yoshitaka Ogawa, Marina Stamenković-Radak, Mihailo Jelić, Marija Savić Veselinović, Marija Tanasković, Pavle Erić, Jian Jun Gao, Takehiro K. Katoh, Masanori J. Toda, Hideaki Watabe, Masayoshi Watada, Jeremy S. Davis, Leonie C. Moyle, Giulia Manoli, Enrico Bertolini, Vladimír Košťál, R. Scott Hawley, Aya Takahashi, Corbin D. Jones, Donald K. Price, Noah Whiteman, Artyom Kopp, Daniel R. Matute, Dmitri A. Petrov

Life Sciences Faculty Research

Over 100 years of studies in Drosophila melanogaster and related species in the genus Drosophila have facilitated key discoveries in genetics, genomics, and evolution. While high-quality genome assemblies exist for several species in this group, they only encompass a small fraction of the genus. Recent advances in long-read sequencing allow high-quality genome assemblies for tens or even hundreds of species to be efficiently generated. Here, we utilize Oxford Nanopore sequencing to build an open community resource of genome assemblies for 101 lines of 93 drosophilid species encompassing 14 species groups and 35 sub-groups. The genomes are highly contiguous and complete, …


The Usda-Ars Ag100pest Initiative: High-Quality Genome Assemblies For Agricultural Pest Arthropod Research, Anna K. Childers, Scott M. Geib, Sheina B. Sim, Monica F. Poelchau, Brad S. Coates, Tyler J. Simmonds, Erin D. Scully, Timothy P.L. Smith, Christopher P. Childers, Renee L. Corpuz, Kevin Hackett, Brian Scheffler Jul 2021

The Usda-Ars Ag100pest Initiative: High-Quality Genome Assemblies For Agricultural Pest Arthropod Research, Anna K. Childers, Scott M. Geib, Sheina B. Sim, Monica F. Poelchau, Brad S. Coates, Tyler J. Simmonds, Erin D. Scully, Timothy P.L. Smith, Christopher P. Childers, Renee L. Corpuz, Kevin Hackett, Brian Scheffler

Roman L. Hruska U.S. Meat Animal Research Center: Reports

The phylum Arthropoda includes species crucial for ecosystem stability, soil health, crop production, and others that present obstacles to crop and animal agriculture. The United States Department of Agriculture’s Agricultural Research Service initiated the Ag100Pest Initiative to generate reference genome assemblies of arthropods that are (or may become) pests to agricultural production and global food security. We describe the project goals, process, status, and future. The first three years of the project were focused on species selection, specimen collection, and the construction of lab and bioinformatics pipelines for the efficient production of assemblies at scale. Contig-level assemblies of 47 species …


Chromosome-Level Genome Assembly Of A Regenerable Maize Inbred Line A188, Guifang Lin, Cheng He, Jun Zheng, Dal Hoe Koo, Ha Le, Huakun Zheng, Hairong Wei, Et. Al. Jun 2021

Chromosome-Level Genome Assembly Of A Regenerable Maize Inbred Line A188, Guifang Lin, Cheng He, Jun Zheng, Dal Hoe Koo, Ha Le, Huakun Zheng, Hairong Wei, Et. Al.

Michigan Tech Publications

Background: The maize inbred line A188 is an attractive model for elucidation of gene function and improvement due to its high embryogenic capacity and many contrasting traits to the first maize reference genome, B73, and other elite lines. The lack of a genome assembly of A188 limits its use as a model for functional studies. Results: Here, we present a chromosome-level genome assembly of A188 using long reads and optical maps. Comparison of A188 with B73 using both whole-genome alignments and read depths from sequencing reads identify approximately 1.1 Gb of syntenic sequences as well as extensive structural variation, including …


Applications Of And Algorithms For Genome Assembly And Genomic Analyses With An Emphasis On Marine Teleosts, Brandon D. Pickett May 2021

Applications Of And Algorithms For Genome Assembly And Genomic Analyses With An Emphasis On Marine Teleosts, Brandon D. Pickett

Theses and Dissertations

The burgeoning frequency of genome sequencing in recent years is a testament to both the improvements in sequencing technologies and the utility of genomic analyses for biological discovery. The rapid proliferation in technological advancements and availability of complementary data types and techniques has obfuscated the optimal process of genome assembly and raised the barrier to entry to unprecedented levels. In this dissertation, we describe the genome assemblies performed for several marine teleosts and discuss the algorithms and applications required for genome assembly, including some of our specific contributions to the genome assembly and annotation space. In Chapter 1 and Chapter …


The Giant Axolotl Genome Uncovers The Evolution, Scaling, And Transcriptional Control Of Complex Gene Loci, Siegfried Schloissnig, Akane Kawaguchi, Sergej Nowoshilow, Francisco Falcon, Leo Otsuki, Pietro Tardivo, Nataliya Timoshevskaya, Melissa C. Keinath, Jeramiah J. Smith, S. Randal Voss, Elly M. Tanaka Apr 2021

The Giant Axolotl Genome Uncovers The Evolution, Scaling, And Transcriptional Control Of Complex Gene Loci, Siegfried Schloissnig, Akane Kawaguchi, Sergej Nowoshilow, Francisco Falcon, Leo Otsuki, Pietro Tardivo, Nataliya Timoshevskaya, Melissa C. Keinath, Jeramiah J. Smith, S. Randal Voss, Elly M. Tanaka

Biology Faculty Publications

Vertebrates harbor recognizably orthologous gene complements but vary 100-fold in genome size. How chromosomal organization scales with genome expansion is unclear, and how acute changes in gene regulation, as during axolotl limb regeneration, occur in the context of a vast genome has remained a riddle. Here, we describe the chromosome-scale assembly of the giant, 32 Gb axolotl genome. Hi-C contact data revealed the scaling properties of interphase and mitotic chromosome organization. Analysis of the assembly yielded understanding of the evolution of large, syntenic multigene clusters, including the Major Histocompatibility Complex (MHC) and the functional regulatory landscape of the Fibroblast Growth …


Nanopore Guided Regional Assembly, Eleni Adam, Desh Ranjan, Harold Riethman Apr 2021

Nanopore Guided Regional Assembly, Eleni Adam, Desh Ranjan, Harold Riethman

College of Sciences Posters

The telomeres are the “caps” of the chromosomes and their vital role is to protect them. Possible telomere dysfunction caused by telomere rearrangements can be fatal for the cell and result in age-related diseases, including cancer. The telomeres and subtelomeres are regions that are hard to investigate. The current technology cannot provide their complete sequence, instead the DNA is given in multiple pieces. Current methods of assembling the pieces of these regions are not accurate enough due to the region’s high variability and complex repeated patterns. We propose a hybrid assembly method, the NPGREAT, which utilizes two of the latest …


De Novo Whole Genome Assembly Of The Swede Midge (Contarinia Nasturtii), A Specialist Of Brassicaceae, Using Linked-Read Sequencing, Boyd A. Mori, Cathy Coutu, Yolanda H. Chen, Erin O. Campbell, Julian R. Dupuis, Martin A. Erlandson, Dwayne D. Hegedus Feb 2021

De Novo Whole Genome Assembly Of The Swede Midge (Contarinia Nasturtii), A Specialist Of Brassicaceae, Using Linked-Read Sequencing, Boyd A. Mori, Cathy Coutu, Yolanda H. Chen, Erin O. Campbell, Julian R. Dupuis, Martin A. Erlandson, Dwayne D. Hegedus

Entomology Faculty Publications

The swede midge, Contarinia nasturtii, is a cecidomyiid fly that feeds specifically on plants within the Brassicaceae. Plants in this family employ a glucosinolate-myrosinase defense system, which can be highly toxic to non-specialist feeders. Feeding by C. nasturtii larvae induces gall formation, which can cause substantial yield losses thus making it a significant agricultural pest. A lack of genomic resources, in particular a reference genome, has limited deciphering the mechanisms underlying glucosinolate tolerance in C. nasturtii, which is of particular importance for managing this species. Here, we present an annotated, scaffolded reference genome of C. nasturtii using linked-read …


Whole-Genome Assembly Of Atriplex Hortensis L. Using Oxfordnanopore Technology With Chromatin-Contact Mapping, Spencer Philip Hunt Jul 2019

Whole-Genome Assembly Of Atriplex Hortensis L. Using Oxfordnanopore Technology With Chromatin-Contact Mapping, Spencer Philip Hunt

Theses and Dissertations

Atriplex hortensis (2n = 2x = 18, 1C genome size ~1.1 gigabases), also known as garden orach, is a highly nutritious, broadleaf annual of the Amaranthaceae-Chenopodiaceae family that has spread from its native Eurasia to other temperate and subtropical environments worldwide. Atriplex is a highly complex and polyphyletic genus of generally halophytic and/or xerophytic plants, some of which have been used as food sources for humans and animals alike. Although there is some literature describing the taxonomy and ecology of orach, there is a lack of genetic and genomic data that would otherwise help elucidate the genetic variation, phylogenetic position, …


The Genome Of Cañahua: An Emerging Andean Super Grain, Hayley Jennifer Mangelson May 2019

The Genome Of Cañahua: An Emerging Andean Super Grain, Hayley Jennifer Mangelson

Theses and Dissertations

Chenopodium pallidicaule, known commonly as cañahua, is a semi-domesticated crop grown in high-altitude regions of the Andes. It is an A-genome diploid (2n = 2x = 18) relative of the allotetraploid (AABB) Chenopodium quinoa and shares many of its nutritional benefits. Both species contain a complete protein, a low glycemic index, and offer a wide variety of nutritionally important vitamins and minerals. Due to its minor crop status, few genomic resources for its improvement have been developed. Here we present a fully annotated, reference-quality assembly of cañahua. The reference assembly was developed using a combination of established techniques, including multiple …


Linked Read Technology For Assembling Large Complex And Polyploid Genomes, Alina Ott, James C. Schnable, Cheng-Ting Yeh, Linjiang Wu, Chao Liu, Heng-Cheng Hu, Clifton L. Dalgard, Soumik Sarkar, Patrick Schnable Jan 2018

Linked Read Technology For Assembling Large Complex And Polyploid Genomes, Alina Ott, James C. Schnable, Cheng-Ting Yeh, Linjiang Wu, Chao Liu, Heng-Cheng Hu, Clifton L. Dalgard, Soumik Sarkar, Patrick Schnable

Department of Agronomy and Horticulture: Faculty Publications

Background: Short read DNA sequencing technologies have revolutionized genome assembly by providing high accuracy and throughput data at low cost. But it remains challenging to assemble short read data, particularly for large, complex and polyploid genomes. The linked read strategy has the potential to enhance the value of short reads for genome assembly because all reads originating from a single long molecule of DNA share a common barcode. However, the majority of studies to date that have employed linked reads were focused on human haplotype phasing and genome assembly.

Results: Here we describe a de novo maize B73 genome assembly …


Exploring The Diversity Of Bacillus Whole Genome Sequencing Projects Using Peasant, The Prokaryotic Assembly And Annotation Tool, Jonathon Brenner, Laurynas Kalesinskas, Catherine Putonti Apr 2017

Exploring The Diversity Of Bacillus Whole Genome Sequencing Projects Using Peasant, The Prokaryotic Assembly And Annotation Tool, Jonathon Brenner, Laurynas Kalesinskas, Catherine Putonti

Biology: Faculty Publications and Other Works

The persistent decrease in cost and difficulty of whole genome sequencing of microbial organisms has led to a dramatic increase in the number of species and strains characterized from a wide variety of environments. Microbial genome sequencing can now be conducted by small laboratories and as part of undergraduate curriculum. While sequencing is routine in microbiology, assembly, annotation and downstream analyses still require computational resources and expertise, often necessitating familiarity with programming languages. To address this problem, we have created a light-weight, user-friendly tool for the assembly and annotation of microbial sequencing projects. The Prokaryotic Assembly and Annotation Tool, Peasant, …


Sequencing And Comparative Analysis Of De Novo Genome Assemblies Of Streptomyces Aureofaciens Atcc 10762, Julien S. Gradnigo Jul 2016

Sequencing And Comparative Analysis Of De Novo Genome Assemblies Of Streptomyces Aureofaciens Atcc 10762, Julien S. Gradnigo

School of Biological Sciences: Dissertations, Theses, and Student Research

Streptomyces aureofaciens is a Gram-positive Actinomycete used for commercial antibiotic production. Although it has been the subject of many biochemical studies, no public genome resource was available prior to this project. To address this need, the genome of S. aureofaciens ATCC 10762 was sequenced using a combination of sequencing platforms (Illumina and 454-shotgun). Multiple de novo assembly methods (SGA, IDBA, Trinity, SOAPdenovo2, MIRA, Velvet and SPAdes) as well as combinations of these methods were assessed to determine which provided the most robust assembly. Combination strategies led to a consistent overestimation of the total genome size. Empirical data from targeted PCR …


Simba: A Web Tool For Managing Bacterial Genome Assembly Generated By Ion Pgm Sequencing Technology, Diego C. B. Mariano, Felipe L. Pereira, Edgar L. Aguiar, Letícia C. Oliveira, Leandro Benevides, Luís C. Guimarães, Edson L. Folador, Thiago J. Sousa, Preetam Ghosh, Debmalya Barh, Henrique C. P. Figueiredo, Artur Silva, Rommel T. J. Ramos, Vasco A. C. Azevedo Jan 2016

Simba: A Web Tool For Managing Bacterial Genome Assembly Generated By Ion Pgm Sequencing Technology, Diego C. B. Mariano, Felipe L. Pereira, Edgar L. Aguiar, Letícia C. Oliveira, Leandro Benevides, Luís C. Guimarães, Edson L. Folador, Thiago J. Sousa, Preetam Ghosh, Debmalya Barh, Henrique C. P. Figueiredo, Artur Silva, Rommel T. J. Ramos, Vasco A. C. Azevedo

Biology Publications

Background

The evolution of Next-Generation Sequencing (NGS) has considerably reduced the cost per sequenced-base, allowing a significant rise of sequencing projects, mainly in prokaryotes. However, the range of available NGS platforms requires different strategies and software to correctly assemble genomes. Different strategies are necessary to properly complete an assembly project, in addition to the installation or modification of various software. This requires users to have significant expertise in these software and command line scripting experience on Unix platforms, besides possessing the basic expertise on methodologies and techniques for genome assembly. These difficulties often delay the complete genome assembly projects.

Results …


Efficient Algorithms For Prokaryotic Whole Genome Assembly And Finishing, Abhishek Biswas Oct 2015

Efficient Algorithms For Prokaryotic Whole Genome Assembly And Finishing, Abhishek Biswas

Computer Science Theses & Dissertations

De-novo genome assembly from DNA fragments is primarily based on sequence overlap information. In addition, mate-pair reads or paired-end reads provide linking information for joining gaps and bridging repeat regions. Genome assemblers in general assemble long contiguous sequences (contigs) using both overlapping reads and linked reads until the assembly runs into an ambiguous repeat region. These contigs are further bridged into scaffolds using linked read information. However, errors can be made in both phases of assembly due to high error threshold of overlap acceptance and linking based on too few mate reads. Identical as well as similar repeat regions can …


Error Correction In Next Generation Dna Sequencing Data, Michael Z. Molnar Dec 2012

Error Correction In Next Generation Dna Sequencing Data, Michael Z. Molnar

Electronic Thesis and Dissertation Repository

Motivation: High throughput Next Generation Sequencing (NGS) technologies can sequence the genome of a species quickly and cheaply. Errors that are introduced by NGS technologies limit the full potential of the applications that rely on their data. Current techniques used to correct these errors are not sufficient, and a more efficient and accurate program is needed to correct errors.

Results: We have designed and implemented RACER (Rapid Accurate Correction of Errors in Reads), an error correction program that targets the Illumina genome sequencer, which is currently the dominant NGS technology. RACER combines advanced data structures with an intricate analysis of …


De Novo Genome Assembly And Snp Marker Development Of Pyrenophora Semeniperda, Marcus Makina Soliai Mar 2011

De Novo Genome Assembly And Snp Marker Development Of Pyrenophora Semeniperda, Marcus Makina Soliai

Theses and Dissertations

Pyrenophora semeniperda (anamorph Drechslera campulata) is a necrotrophic fungal seed pathogen of a variety of grass genra and species, including Bromus tectorum, an exotic grass that has invaded many natural ecosystems of the U.S. Intermountain West. As a natural seed pathogen of B. tectorum, P. semeniperda has potential as a biocontrol agent due to its effectiveness at killing dormant B. tectorum seeds; however, few genetic resources exist for this fungus. Here, the genome assembly of a P. semeniperda isolate using 454 GS-FLX genomic and paired-end pyrosequencing techniques is presented. The total assembly is 32.5 Mb and contains …