Efficient Alignment Algorithms For Dna Sequencing Data, 2018 The University of Western Ontario
Efficient Alignment Algorithms For Dna Sequencing Data, Nilesh Vinod Khiste
Electronic Thesis and Dissertation Repository
The DNA Next Generation Sequencing (NGS) technologies produce data at a low cost, enabling their application to many ambitious fields such as cancer research, disease control, personalized medicine etc. However, even after a decade of research, the modern aligners and assemblers are far from providing efficient and error free genome alignments and assemblies respectively. This is due to the inherent nature of the genome alignment and assembly problem, which involves many complexities. Many algorithms to address this problem have been proposed over the years, but there still is a huge scope for improvement in this research space.
Many new genome ...
A Model Species For Agricultural Pest Genomics: The Genome Of The Colorado Potato Beetle, Leptinotarsa Decemlineata (Coleoptera: Chrysomelidae), 2018 University of Wisconsin-Madison
A Model Species For Agricultural Pest Genomics: The Genome Of The Colorado Potato Beetle, Leptinotarsa Decemlineata (Coleoptera: Chrysomelidae), Sean D. Schoville, Yolanda H. Chen, Martin N. Andersson, Joshua B. Benoit, Anita Bhandari, Julia H. Bowsher, Kristian Brevik, Kaat Cappelle, Mei-Ju M. Chen, Anna K. Childers, Christopher Childers, Olivier Christiaens, Justin Clements, Elise M. Didion, Elena N. Elpidina, Patamarerk Engsontia, Markus Friedrich, Inmaculada García-Robles, Richard A. Gibbs, Chandan Goswami, Alessandro Grapputo, Kristina Gruden, Marcin Grynberg, Bernard Henrissat, Emily C. Jennings, Jeffery W. Jones, Megha Kalsi, Sher A. Khan, Abhishek Kumar, Fei Li, Vincent Lombard, Subba Reddy Palli, June-Sun Yoon
Entomology Faculty Publications
The Colorado potato beetle is one of the most challenging agricultural pests to manage. It has shown a spectacular ability to adapt to a variety of solanaceaeous plants and variable climates during its global invasion, and, notably, to rapidly evolve insecticide resistance. To examine evidence of rapid evolutionary change, and to understand the genetic basis of herbivory and insecticide resistance, we tested for structural and functional genomic changes relative to other arthropod species using genome sequencing, transcriptomics, and community annotation. Two factors that might facilitate rapid evolutionary change include transposable elements, which comprise at least 17% of the genome and ...
Gene-Based Association Study For Lipid Traits In Diverse Cohorts Implicates Bace1 And Sidt2 Regulation In Triglyceride Levels, 2018 Loyola University Chicago
Gene-Based Association Study For Lipid Traits In Diverse Cohorts Implicates Bace1 And Sidt2 Regulation In Triglyceride Levels, Angela Andaleon, Lauren S. Mogil, Heather Wheeler
Bioinformatics Faculty Publications
Plasma lipid levels are risk factors for cardiovascular disease, a leading cause of death worldwide. While many studies have been conducted on lipid genetics, they mainly focus on Europeans and thus their transferability to diverse populations is unclear. We performed SNP- and gene-level genome-wide association studies (GWAS) of four lipid traits in cohorts from Nigeria and the Philippines and compared them to the results of larger, predominantly European meta-analyses. Two previously implicated loci met genome-wide significance in our SNP-level GWAS in the Nigerian cohort, rs34065661 in CETP associated with HDL cholesterol (P = 9.0 × 10−10) and rs1065853 upstream of ...
The Genome Of The Hi5 Germ Cell Line From Trichoplusia Ni, An Agricultural Pest And Novel Model For Small Rna Biology, Yu Fu, Yujing Yang, Han Zhang, Gwen Farley, Junling Wang, Kaycee A. Quarles, Zhiping Weng, Phillip D. Zamore
Open Access Articles
We report a draft assembly of the genome of Hi5 cells from the lepidopteran insect pest, Trichoplusia ni, assigning 90.6% of bases to one of 28 chromosomes and predicting 14,037 protein-coding genes. Chemoreception and detoxification gene families reveal T. ni-specific gene expansions that may explain its widespread distribution and rapid adaptation to insecticides. Transcriptome and small RNA data from thorax, ovary, testis, and the germline-derived Hi5 cell line show distinct expression profiles for 295 microRNA- and > 393 piRNA-producing loci, as well as 39 genes encoding small RNA pathway proteins. Nearly all of the W chromosome is devoted to ...
Impact Of Concurrent Genomic Alterations Detected By Comprehensive Genomic Sequencing On Clinical Outcomes In East-Asian Patients With Egfr-Mutated Lung Adenocarcinoma, Seijiro Sato, Stephen Lyle, Toshifumi Wakai, Masanori Tsuchida
Open Access Articles
Next-generation sequencing (NGS) has enabled comprehensive detection of genomic alterations in lung cancer. Ethnic differences may play a critical role in the efficacy of targeted therapies. The aim of this study was to identify and compare genomic alterations of lung adenocarcinoma between Japanese patients and the Cancer Genome Atlas (TCGA), which majority of patients are from the US. We also aimed to examine prognostic impact of additional genomic alterations in patients harboring EGFR mutations. Genomic alterations were determined in Japanese patients with lung adenocarcinoma (N = 100) using NGS-based sequencing of 415 known cancer genes, and correlated with clinical outcome. EGFR ...
Orbit: A New Paradigm For Genetic Engineering Of Mycobacterial Chromosomes, 2018 University of Massachusetts Medical School
Orbit: A New Paradigm For Genetic Engineering Of Mycobacterial Chromosomes, Kenan C. Murphy, Samantha J. Nelson, Subhalaxmi Nambi, Kadamba Papavinasasundaram, Christina E. Baer, Christopher M. Sassetti
University of Massachusetts Medical School Faculty Publications
Current methods for genome engineering in mycobacteria rely on relatively inefficient recombination systems that require the laborious construction of a long double-stranded DNA substrate for each desired modification. We combined two efficient recombination systems to produce a versatile method for high-throughput chromosomal engineering that obviates the need for the preparation of double-stranded DNA recombination substrates. A synthetic targeting oligonucleotide is incorporated into the chromosome via homologous recombination mediated by the phage Che9c RecT annelase. This oligo contains a site-specific recombination site for the directional Bxb1 integrase (Int), which allows the simultaneous integration of a payload plasmid that contains a cognate ...
Determining Bioindicators For Coastal Tidal Marsh Health Using The Food Web Of Larvae Of The Greenhead Horse Fly (Tabanus Nigrovittatus), 2018 Louisiana State University and Agricultural and Mechanical College
Determining Bioindicators For Coastal Tidal Marsh Health Using The Food Web Of Larvae Of The Greenhead Horse Fly (Tabanus Nigrovittatus), Devika Rajeev Bhalerao
LSU Master's Theses
The greenhead horse fly Tabanus nigrovittatus Macquart is native to coastal marshlands from Texas to Nova Scotia. The larvae are apex invertebrate predators and their development is dependent on the food web in the soil. Surveillance of T. nigrovittatus after the 2010 Deepwater Horizon oil spill in the Gulf of Mexico showed population crashes of adults in the coastal marshes of East Louisiana near places where oil made landfall, but not in West Louisiana where the oil did not reach. Sediment collection in 2011 from West and East Louisiana revealed larval population crashes in the Eastern coastal region. We hypothesized ...
Insights Into Teleost Sex Determination From The Seriola Dorsalis Genome Assembly, 2018 National Marine Fisheries Service
Insights Into Teleost Sex Determination From The Seriola Dorsalis Genome Assembly, Catherine M. Purcell, Arun S. Seetharam, Owyn Snodgrass, Sofia Ortega-García, John R. Hyde, Andrew J. Severin
Office of Biotechnology Publications
Background: The assembly and annotation of a genome is a valuable resource for a species, with applications ranging from conservation genomics to gene discovery. Genomic resource development is especially important for species in culture, such as the California Yellowtail (Seriola dorsalis), the likely candidate for the establishment of commercial offshore aquaculture production in southern California. Genomic resource development for this species will improve the understanding of sex and other phenotypic traits, and allow for rapid increases in genetic improvement for and economic gain in culture production.
Results: We describe the assembly and annotation of the S. dorsalis genome, and present ...
Early Epstein-Barr Virus Genomic Diversity And Convergence Toward The B95.8 Genome In Primary Infection, 2018 University of Massachusetts Medical School
Early Epstein-Barr Virus Genomic Diversity And Convergence Toward The B95.8 Genome In Primary Infection, Eric R. Weiss, Susanna L. Lamers, Jennifer L. Henderson, Alexandre Melnikov, Mohan Somasundaran, Manuel Garber, Liisa K. Selin, Chad Nusbaum, Katherine Luzuriaga
Garber Lab Publications
Over 90% of the world's population is persistently infected with Epstein-Barr virus. While EBV does not cause disease in most individuals, it is the common cause of acute infectious mononucleosis (AIM) and has been associated with several cancers and autoimmune diseases, highlighting a need for a preventive vaccine. At present, very few primary, circulating EBV genomes have been sequenced directly from infected individuals. While low levels of diversity and low viral evolution rates have been predicted for double-stranded DNA (dsDNA) viruses, recent studies have demonstrated appreciable diversity in common dsDNA pathogens (e.g., cytomegalovirus). Here, we report 40 full-length ...
Novel Computational Methods For Sequencing Data Analysis: Mapping, Query, And Classification, 2018 University of Kentucky
Novel Computational Methods For Sequencing Data Analysis: Mapping, Query, And Classification, Xinan Liu
Theses and Dissertations--Computer Science
Over the past decade, the evolution of next-generation sequencing technology has considerably advanced the genomics research. As a consequence, fast and accurate computational methods are needed for analyzing the large data in different applications. The research presented in this dissertation focuses on three areas: RNA-seq read mapping, large-scale data query, and metagenomics sequence classification.
A critical step of RNA-seq data analysis is to map the RNA-seq reads onto a reference genome. This dissertation presents a novel splice alignment tool, MapSplice3. It achieves high read alignment and base mapping yields and is able to detect splice junctions, gene fusions, and circular ...
Draft Genome Sequence Of Streptomyces Sp. Strain Jv178, A Producer Of Clifednamide-Type Polycyclic Tetramate Macrolactams, 2018 Washington University in St. Louis
Draft Genome Sequence Of Streptomyces Sp. Strain Jv178, A Producer Of Clifednamide-Type Polycyclic Tetramate Macrolactams, Yunci Qi, John M. D’Alessandro, Joshua A.V Blodgett
Biology Faculty Publications & Presentations
Here, we report the draft genome sequence of Streptomyces sp. JV178, a strain originating from Connecticut (USA) garden soil. This strain produces the polycyclic tetramate macrolactam compounds clifednamides A and B. The draft genome contains 10.65 Mb, 9,045 predicted protein coding sequences, and several natural product biosynthetic loci.
Shared Data Science Infrastructure For Genomics Data, 2018 Iowa State University
Shared Data Science Infrastructure For Genomics Data, Hamid Bagheri, Usha Muppirala, Andrew J. Severin, Hridesh Rajan
Office of Biotechnology Publications
Creating a computational infrastructure to analyze the wealth of information contained in data repositories that scales well is difficult due to significant barriers in organizing, extracting and analyzing relevant data. Shared Data Science Infrastructures like Boa can be used to more efficiently process and parse data contained in large data repositories. The main features of Boa are inspired from existing languages for data intensive computing and can easily integrate data from biological data repositories. Here, we present an implementation of Boa for Genomic research (BoaG) on a relatively small data repository: RefSeq's 97,716 annotation (GFF) and assembly (FASTA ...
The Genome Of The Soybean Cyst Nematode (Heterodera Glycines) Reveals Complex Patterns Of Duplications Involved In The Evolution Of Parasitism Genes, Rick E. Masonbrink, Thomas R. Maier, Usha Muppirala, Arun S. Seetharam, Etienne Lord, Parijat S. Juvale, Jeremy Schmutz, Nathan T. Johnson, Dmitry Korkin, Melissa G. Mitchum, Benjamin Mimee, Sebastian Eves-Van Den Akker, Matthew Hudson, Andrew J. Severin, Thomas J. Baum
Office of Biotechnology Publications
Heterodera glycines, commonly referred to as the soybean cyst nematode (SCN), is an obligatory and sedentary plant parasite that causes over a billion-dollar yield loss to soybean production annually. Although there are genetic determinants that render soybean plants resistant to certain nematode genotypes, resistant soybean cultivars are increasingly ineffective because their multi-year usage has selected for virulent H. glycines populations. The parasitic success of H. glycines relies on the comprehensive re-engineering of an infection site into a syncytium, as well as the long-term suppression of host defense to ensure syncytial viability. At the forefront of these complex molecular interactions are ...
Impacts Of Genome And Nuclear Architecture On Molecular Evolution In Eukaryotes, 2018 University of Massachusetts Amherst
Impacts Of Genome And Nuclear Architecture On Molecular Evolution In Eukaryotes, Xyrus Maurer-Alcalá
The traditional view of genomes suggests that they are static entities changing slowly in sequence and structure through time (e.g. evolving over geological time-scales). This outdated view has been challenged as our understanding of the dynamic nature of genomes has increased. Changes in DNA content (i.e. polyploidy) are common to specific life-cycle stages in a variety of eukaryotes, as are changes in genome content itself. These dramatic genomic changes include chromosomal deletions (i.e. paternal chromosome deletion in insects; Goday and Esteban 2001; Ross, et al. 2010), developmentally regulated genome rearrangements (e.g. the V(D)J system ...
Tissue Expression Patterns Identify Mouse Cilia Genes, 2018 University of Kentucky
Tissue Expression Patterns Identify Mouse Cilia Genes, Timothy S. Mcclintock
Physiology Research Data
A summary of 2,127 mouse genes linked to cilia by previous high-throughput approaches (reviewed in Inglis et al., 2006), published studies of individual gene products, or by McClintock et al. 2008. Each gene was viewed at Entrez Gene to verify and update the Gene ID and Gene Symbol. Each was also searched against PubMed (Gene Symbol AND (cili* OR flagel*)) for published evidence linking the mouse gene or putative orthologs to cilia or flagella. This searchable file lists identifying information about each gene, functional information about the encoded protein, the number of high-throughput studies linking the gene to cilia ...
Rates And Patterns Of Evolution In A Duplicated Genome In The Family Catostomidae, 2018 University of Northern Iowa
Rates And Patterns Of Evolution In A Duplicated Genome In The Family Catostomidae, Megann Michelle Schmidt
Honors Program Theses
First and foremost, I would like to thank Dr. Gary and Myrna Floyd for financial support through their Undergraduate Research Assistantship, and I would also like to thank the University of Northern Iowa Biology Department for their support of undergraduate research. I would like to express my utmost gratitude towards Dr. Peter Berendzen and Zachary Sperstad for their support and guidance throughout this entire project. Additionally, thanks to the Iowa EPSCoR for funding collection of the data set, and thanks to Andrew Simons at the University of Minnesota and Jonathan Armbruster at Auburn University for help in data collection and ...
Evaluating Nubian Population Structure From Cranial Nonmetric Traits: Gene Flow, Genetic Drift, And Population History Of The Nubian Nile Valle, 2018 Sociology/Anthropology, University of La Verne, La Verne, CA
Evaluating Nubian Population Structure From Cranial Nonmetric Traits: Gene Flow, Genetic Drift, And Population History Of The Nubian Nile Valle, Kanya Godde, Richard L. Jantz
Human Biology Open Access Pre-Prints
Paleolithic archaeological and skeletal remains from the Nile Valley have yielded a complex picture of life along the river. Sociocultural and sociopolitical events during this timeframe shaped population structure, while gene flow and genetic drift further developed it. In this paper, we take a population genetics approach to modeling Nubian biological relationships in an effort to describe how an accumulation of events formed Nubian population structure. A variety of Nubian samples were utilized, spanning the Mesolithic-Christian time periods, and geographically, from just above the first through the third cataracts. Population genetics statistics were employed to estimate and depict biological affinities ...
Meiothermus Ruber Mrub_0320 Gene Is An Ortholog Of The B3452 Gene, Mrub_0321 Gene Is An Ortholog Of The B3451 Gene, Mrub_0322 Gene Is An Ortholog Of The B3453 Gene, Mrub_2366 Gene Is An Ortholog Of The B3450 Gene Found In Escherichia Coli, Which Encode For Components Of An Abc Transporter Involved In Sn-Glycerol - 3-Phosphate, 2018 Augustana College, Rock Island Illinois
Meiothermus Ruber Mrub_0320 Gene Is An Ortholog Of The B3452 Gene, Mrub_0321 Gene Is An Ortholog Of The B3451 Gene, Mrub_0322 Gene Is An Ortholog Of The B3453 Gene, Mrub_2366 Gene Is An Ortholog Of The B3450 Gene Found In Escherichia Coli, Which Encode For Components Of An Abc Transporter Involved In Sn-Glycerol - 3-Phosphate, Jenna Hall, Dr. Lori Scott
Meiothermus ruber Genome Analysis Project
In this project we investigated the biological function of the genes mrub_0320, mrub_0321, mrub_0322, and mrub_2366 (KEGG map number 02010). We predict these genes encode components of a sn-glycerol-3-phosphate (ABC) transporter: 1) mrub_0320 (DNA coordinates 288469..289401) encodes the permease component (aka transmembrane domain), predicted to be an ortholog; 2) mrub_0321 (DNA coordinates 289394..290218) encodes another permease domain, and also contains a transcriptional regular; ATP-binding domain (aka nucleotide binding domain); 3) mrub_0322 (DNA coordinates 290234..291541) encodes the solute binding protein; and 4) mrub_2366 (DNA coordinates 2418207..2419352 on the reverse strand) encodes for an ATP-binding domain for multiple ...
Mrub_2120, Mrub_2121, Mrub_2122, Mrub_2123 And Mrub_2124 Are Orthologs Of E. Coli Genes B3458, B3457, B3456, B3455 And B3454, Respectively, And Make Up An Operon That Codes For The Branched-Chain Amino Acid Abc Transporter In Meiothermus Ruber Dsm 1279, 2018 Augustana College, Rock Island Illinois
Mrub_2120, Mrub_2121, Mrub_2122, Mrub_2123 And Mrub_2124 Are Orthologs Of E. Coli Genes B3458, B3457, B3456, B3455 And B3454, Respectively, And Make Up An Operon That Codes For The Branched-Chain Amino Acid Abc Transporter In Meiothermus Ruber Dsm 1279, Aaron Jones, Madelyn Huber, Dr. Lori Scott
Meiothermus ruber Genome Analysis Project
In this project we investigated the biological function of the genes Mrub_2120, Mrub_2121, Mrub_2122, Mrub_2123 and Mrub_2124 (KEGG map number 02010). We predict these genes encode components of a branched-chain amino acid ATP Binding Cassette (ABC) transporter: 1) Mrub_2120 (DNA coordinates 2169247-2170416 on the reverse strand) encodes the branched-chain amino acid binding protein that is localized to the periplasm; 2) Mrub_2121 (DNA coordinates 2170433..2171353 on the reverse strand) encodes the first TMD; 3) Mrub_2122 (DNA coordinates 2171365..2172279 on the reverse strand) encodes the second TMD; 4) Mrub_2123 (DNA coordinates 2172276..2173028 on the reverse strand) encodes the first ...
Mrub_1675, Mrub_1676, Mrub_1677, And Mrub_1679 Genes Are Orthologs Of B_3458, B_3457, B_3456, And B_3454 Genes In E. Coli, Respectively, Coding For Abc Transporters. Mrub_1678 And B_3455, Though Perform Similar Tasks, Are Not Orthologous, 2018 Augustana College, Rock Island Illinois
Mrub_1675, Mrub_1676, Mrub_1677, And Mrub_1679 Genes Are Orthologs Of B_3458, B_3457, B_3456, And B_3454 Genes In E. Coli, Respectively, Coding For Abc Transporters. Mrub_1678 And B_3455, Though Perform Similar Tasks, Are Not Orthologous, Ravi Patel, Alaina Hofmann, Dr. Lori Scott
Meiothermus ruber Genome Analysis Project
In this project we investigated the biological function of the genes Mrub_1675, Mrub_1676, Mrub_1677, and Mrub_1679 (KEGG map number 02010). We predict these genes encode components of a Branched chain amino acid (ABC) transporter: Mrub_1675 (DNA coordinates 1711022..1712185 on the reverse strand) encodes the permease component, Mrub_1676 (DNA coordinates 1712313..1713170) encodes for the NBD (aka nucleotide binding domain), Mrub_1677 (DNA coordinates 1713167..1714075 on the reverse strand) encodes the NBD (aka nucleotide binding domain), Mrub_1678 (DNA coordinates 1713167..1714075 on the reverse strand) encodes the TMD (aka transmembrane domain) and Mrub_1679 (DNA coordinates 1714781..1715485 on the reverse ...