Open Access. Powered by Scholars. Published by Universities.®

Genetics and Genomics Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 16 of 16

Full-Text Articles in Genetics and Genomics

Development, Evaluation, And Application Of A Novel Error Correction Method For Next Generation Sequencing Data, Isaac Akogwu Dec 2017

Development, Evaluation, And Application Of A Novel Error Correction Method For Next Generation Sequencing Data, Isaac Akogwu

Dissertations

Tremendous evolvement in sequencing technologies and the vast availability of data due to decreasing cost of Next-Generation-Sequencing (NGS) has availed scientists the opportunity to address a wide variety of evolutionary and biological issues. NGS uses massively parallel technology to accelerate the process at the expense of accuracy and read length in comparison to earlier Sanger methods. Therefore, computational limitations exist in how much analysis and information can be gleaned from the data without performing some form of error correction.

Error correction process is laborious and consumes a lot of computational resources. Despite the existence of many NGS data error correction …


All Roads Lead To Weediness: Stories About Weedy Rice Origins, Weedy Genes And Weed Competitiveness, Zhongyun Huang Nov 2017

All Roads Lead To Weediness: Stories About Weedy Rice Origins, Weedy Genes And Weed Competitiveness, Zhongyun Huang

Doctoral Dissertations

Weedy rice (Oryza spp.), a weedy relative of cultivated rice (O.sativa), infests and persists in cultivated rice fields worldwide. Many weedy rice populations have evolved similar adaptive traits, considered part of the ‘agricultural weed syndrome’, making this an ideal model to study the genetic basis of parallel evolution. Using population genetics analyses of South Asian and US weedy rice, my research reveals multiple independent evolution events giving rise to weed groups in the two geographic areas. Weeds in South Asia have highly heterogenous genetic backgrounds, with contributions from both cultivated varieties (aus and indica) …


Discovery Of Sex-Specific Regions In A Salamander Genome, Nataliya Y. Timoshevskaya, Melissa C. Keinath, Jeramiah J. Smith Oct 2017

Discovery Of Sex-Specific Regions In A Salamander Genome, Nataliya Y. Timoshevskaya, Melissa C. Keinath, Jeramiah J. Smith

Commonwealth Computational Summit

Biological Aspects:

Salamander (Ambystoma mexicanum) has a gigantic genome: ~32,000,000,000 bases (10X of size of human genome)

Sex is determined by a pair of morphologically identical chromosomes:

  • ZZ in male
  • ZW in female

Object:

  • Find (if there are any) genomic differences between chromosomes W and Z

Workflow:

  1. Sequencing and de novo assembly of the reference salamander genome
  2. Alignment of short sequences from male and female genomes to the reference
  3. Coverage analysis


The Lamprey Genome: Illuminating Genomic Change Across Eons And Embryogenesis, Jeramiah J. Smith, Courtney K. M. Waterbury, Melissa C. Keinath, Cody B. Saraceno, Vladimir A. Timoshevskiy, Nataliya Y. Timoshevskaya Oct 2017

The Lamprey Genome: Illuminating Genomic Change Across Eons And Embryogenesis, Jeramiah J. Smith, Courtney K. M. Waterbury, Melissa C. Keinath, Cody B. Saraceno, Vladimir A. Timoshevskiy, Nataliya Y. Timoshevskaya

Commonwealth Computational Summit

The lamprey genome provides unique insights into both the deep evolutionary history of vertebrate genomes and the maintenance of genome structure/integrity over development. The lamprey lineage diverged from all other vertebrates approximately 500 million years ago. As such, comparisons between lamprey and other vertebrates permit reconstruction of ancient duplication and rearrangement events that defined the fundamental architecture and gene content of all extant vertebrate genomes. Lamprey also undergoes programmatic changes genome structure that result in the physical elimination of ~20% of its genomic DNA (~0.5Gb from a ~2 Gb genome) from all somatic cell lineages during early embryonic development. Here, …


Deep Ancestry Of Programmed Genome Rearrangement In Lampreys, Vladimir A. Timoshevskiy, Ralph T. Lampman, Jon E. Hess, Laurie L. Porter, Jeramiah J. Smith Sep 2017

Deep Ancestry Of Programmed Genome Rearrangement In Lampreys, Vladimir A. Timoshevskiy, Ralph T. Lampman, Jon E. Hess, Laurie L. Porter, Jeramiah J. Smith

Biology Faculty Publications

In most multicellular organisms, the structure and content of the genome is rigorously maintained over the course of development. However some species have evolved genome biologies that permit, or require, developmentally regulated changes in the physical structure and content of the genome (programmed genome rearrangement: PGR). Relatively few vertebrates are known to undergo PGR, although all agnathans surveyed to date (several hagfish and one lamprey: Petromyzon marinus) show evidence of large scale PGR. To further resolve the ancestry of PGR within vertebrates, we developed probes that allow simultaneous tracking of nearly all sequences eliminated by PGR in P. marinus and …


A Linkage Map For The Newt Notophthalmus Viridescens: Insights In Vertebrate Genome And Chromosome Evolution, Melissa C. Keinath, S. Randal Voss, Panagiotis A. Tsonis, Jeramiah J. Smith Jun 2017

A Linkage Map For The Newt Notophthalmus Viridescens: Insights In Vertebrate Genome And Chromosome Evolution, Melissa C. Keinath, S. Randal Voss, Panagiotis A. Tsonis, Jeramiah J. Smith

Biology Faculty Publications

Genetic linkage maps are fundamental resources that enable diverse genetic and genomic approaches, including quantitative trait locus (QTL) analyses and comparative studies of genome evolution. It is straightforward to build linkage maps for species that are amenable to laboratory culture and genetic crossing designs, and that have relatively small genomes and few chromosomes. It is more difficult to generate linkage maps for species that do not meet these criteria. Here, we introduce a method to rapidly build linkage maps for salamanders, which are known for their enormous genome sizes. As proof of principle, we developed a linkage map with thousands …


Genome-Wide Analysis Of Atp-Binding Cassette (Abc) Transporters In The Sweetpotato Whitefly, Bemisia Tabaci, Lixia Tian, Tianxue Song, Rongjun He, Yang Zeng, Wen Xie, Qingjun Wu, Shaoli Wang, Xuguo Zhou, Youjun Zhang Apr 2017

Genome-Wide Analysis Of Atp-Binding Cassette (Abc) Transporters In The Sweetpotato Whitefly, Bemisia Tabaci, Lixia Tian, Tianxue Song, Rongjun He, Yang Zeng, Wen Xie, Qingjun Wu, Shaoli Wang, Xuguo Zhou, Youjun Zhang

Entomology Faculty Publications

Background: ABC transporter superfamily is one of the largest and ubiquitous groups of proteins. Because of their role in detoxification, insect ABC transporters have gained more attention in recent years. In this study, we annotated ABC transporters from a newly sequenced sweetpotato whitefly genome. Bemisia tabaci Q biotype is an emerging global invasive species that has caused extensive damages to field crops as well as ornamental plants.

Results: A total of 55 ABC transporters containing all eight described subfamilies (A to H) were identified in the B. tabaci Q genome, including 8 ABCAs, 3 ABCBs, 6 ABCCs, 2 ABCDs, 1 …


Discovery And Validation Of Information Theory-Based Transcription Factor And Cofactor Binding Site Motifs., Ruipeng Lu, Eliseos J Mucaki, Peter K Rogan Mar 2017

Discovery And Validation Of Information Theory-Based Transcription Factor And Cofactor Binding Site Motifs., Ruipeng Lu, Eliseos J Mucaki, Peter K Rogan

Biochemistry Publications

Data from ChIP-seq experiments can derive the genome-wide binding specificities of transcription factors (TFs) and other regulatory proteins. We analyzed 765 ENCODE ChIP-seq peak datasets of 207 human TFs with a novel motif discovery pipeline based on recursive, thresholded entropy minimization. This approach, while obviating the need to compensate for skewed nucleotide composition, distinguishes true binding motifs from noise, quantifies the strengths of individual binding sites based on computed affinity and detects adjacent cofactor binding sites that coordinate with the targets of primary, immunoprecipitated TFs. We obtained contiguous and bipartite information theory-based position weight matrices (iPWMs) for 93 sequence-specific TFs, …


Mrub_1873, Mrub_1872, Mrub_1871 Genes Are Predicted Orthologs Of The B2285, B2284, And B2283 Genes Respectively, Found In Escherichia Coli Coding For Nadh Ubiquinone Oxidoreductase Complex Subunits E, F, And G., Hannah Lohmeier, Dr. Lori R. Scott Jan 2017

Mrub_1873, Mrub_1872, Mrub_1871 Genes Are Predicted Orthologs Of The B2285, B2284, And B2283 Genes Respectively, Found In Escherichia Coli Coding For Nadh Ubiquinone Oxidoreductase Complex Subunits E, F, And G., Hannah Lohmeier, Dr. Lori R. Scott

Meiothermus ruber Genome Analysis Project

This project is part of the Meiothermus ruber genome analysis project, which uses the bioinformatics tools associated with the Guiding Education through Novel Investigation –Annotation Collaboration Toolkit (GENI-ACT) to predict gene function. We investigated the biological function of the genes Mrub_1873, Mrub_1872, and Mrub_1871.We predict that Mrub_1873 (DNA coordinates 1933743..1934309 on the reverse strand), Mrub_1872 (DNA coordinates 1932430..1933746 on the reverse strand), and Mrub_1871 (DNA coordinates 1930055..1932421 on the reverse strand) are subunits of the NADH ubiquinone oxidoreductase complex (00190). The complex catalyzes both the transfer of protons across the cytoplasmic membrane and the transfer of electrons to ubiquinone during …


Annotation And Identification Of Several Glycerolipid Metabolic Related Ortholog Genes; Mrub_0437, Mrub_1813 And Mrub_2759 In The Organism Meithermus Ruber And Their Predicted Respective Orthologs B3926, B4042 And Bo514 Found In E.Coli., Abdul Rahman Abdul Kader, Dr. Lori R. Scott Jan 2017

Annotation And Identification Of Several Glycerolipid Metabolic Related Ortholog Genes; Mrub_0437, Mrub_1813 And Mrub_2759 In The Organism Meithermus Ruber And Their Predicted Respective Orthologs B3926, B4042 And Bo514 Found In E.Coli., Abdul Rahman Abdul Kader, Dr. Lori R. Scott

Meiothermus ruber Genome Analysis Project

We predict Mrub_0437 encodes the enzyme glycerol kinase (DNA coordinates [417621..419183), which is an intermediary step of the glycerolipid metabolic pathway (KEGG map00561), It catalyzes the conversion of glycerol to sn-Glycerol-3-phosphate. The E. coli K12 MG1655 ortholog is predicted to be b3926.

We predict Mrub_1813 encodes the enzyme diacylglycerol kinase (DNA coordinates [1864659..1865063), which is an intermediary step of the glycerolipid metabolic pathway (KEGG map00561), It catalyzes the conversion of 1,2-diacyl-sn-glycerol to 1,2-diacyl-sn-glycerol 3-phosphate. The E. coli K12 MG1655 ortholog is predicted to be b4042.

We predict Mrub_2759 encodes the enzyme glycerol kinase (DNA coordinates [2799712..2800665), which is an intermediary …


Serine Biosynthesis And Glycine Biosynthesis/Degradation: Mrub_0173 Is Orthologous To E. Coli B2913 (Sera); Mrub_0125 Is Orthologous To E. Coli B4388 (Serb); Mrub_2910 Is Orthologous To E. Coli B2551 (Glya)., Megan M. Janssen, Dr. Lori R. Scott Jan 2017

Serine Biosynthesis And Glycine Biosynthesis/Degradation: Mrub_0173 Is Orthologous To E. Coli B2913 (Sera); Mrub_0125 Is Orthologous To E. Coli B4388 (Serb); Mrub_2910 Is Orthologous To E. Coli B2551 (Glya)., Megan M. Janssen, Dr. Lori R. Scott

Meiothermus ruber Genome Analysis Project

ABSTRACT. This project is part of the Meiothermus ruber genome analysis project, which uses the bioinformatics tools associated with the Guiding Education through Novel Investigation –Annotation Collaboration Toolkit (GENI-ACT) to predict gene function. We investigated the biological function of the genes Mrub_0173, Mrub_0125, and Mrub_ 2910. We predict that Mrub_0173 encodes the enzyme phosphoglycerate dehydrogenase (DNA coordinates 152982 ... 154347), which is the 1st step of the serine biosynthesis pathway (KEGG map number 00680). It catalyzes the conversion of NAD+ + 3-phospho-D-glycerate → NADH H+ + 3-phospho-hydroxypyruvate. The E. coli K12 MG1655 ortholog is predicted to be b2913, which has …


Mrub_3029, Mrub_2052, Are Predicted Orthologs Of B_0688, B_0394, While Mrub_0759 And Mrub_2365 Are Not Predicted Orthologs Of B_1309, In Escherichia Coli, Which Code For Enzymes Involved In Starch And Sucrose Metabolism, Max A. Benstine, Dr. Lori R. Scott Jan 2017

Mrub_3029, Mrub_2052, Are Predicted Orthologs Of B_0688, B_0394, While Mrub_0759 And Mrub_2365 Are Not Predicted Orthologs Of B_1309, In Escherichia Coli, Which Code For Enzymes Involved In Starch And Sucrose Metabolism, Max A. Benstine, Dr. Lori R. Scott

Meiothermus ruber Genome Analysis Project

We predict that Mrub__[0759] encodes the enzyme [Meiothermus ruber Fruktokinase] (DNA coordinates [741282..742202 on the forward strand] which is the 00500 step of the Starch and Sucrose Metabolism pathway (KEGG map number [2.7.1.4]). It catalyzes the conversion of [ATP + D-fructoseADP + D-fructose 6-phosphate]. The E. coli K12 MG1655 ortholog is predicted to be b1309, which has the gene identifier [ycjM] We predict that Mrub__[ 2365] encodes the enzyme [Meiothermus ruber Fruktokinase] (DNA coordinates [2417118..2418059 on the forward strand], which is the [00500] step of the [Starch and Sucrose Metabolism] pathway (KEGG map number [2.7.1.4]). It catalyzes the …


Mrub_2642, Mrub_1054, And Mrub_1059 Genes Are Orthologs Of The Escherichia Coli Genes B2942, B0159, And B2687 Genes, Respectively, Which Code For Methionine Adenosyltransferase, Adenosylhomocysteine Nucleosidase, And S-Ribosylhomocysteine Lyase, Nicholas M. Orslini, Dr. Lori R. Scott Jan 2017

Mrub_2642, Mrub_1054, And Mrub_1059 Genes Are Orthologs Of The Escherichia Coli Genes B2942, B0159, And B2687 Genes, Respectively, Which Code For Methionine Adenosyltransferase, Adenosylhomocysteine Nucleosidase, And S-Ribosylhomocysteine Lyase, Nicholas M. Orslini, Dr. Lori R. Scott

Meiothermus ruber Genome Analysis Project

This project is part of the Meiothermus ruber genome analysis project, which uses the bioinformatics tools associated with the Guiding Education through Novel Investigation –Annotation Collaboration Toolkit (GENI-ACT) to predict gene function. We investigated the biological function of the genes Mrub_2642, Mrub_1054, and Mrub_1059.

We predict that Mrub_2642 encodes the enzyme methionine adenosyltransferase (DNA coordinates [2677251…2678426] on the reverse strand), the first step of the methionine degradation pathway (KEGG map number 00270). Methionine adenosyltransferase catalyzes the conversion of the substrates, ATP, L-methionine, and water, to yield the products S-adenosyl-L-methionine (SAM), inorganic phosphate, and diphosphate. Mrub_1054 encodes adenosylhomocysteine nucleosidase (DNA …


Mrub_0860, Mrub_0701 And Mrub_2285 Are Orthologous To E. Coli B2892, B2562 And B3863 Within The Recfor Pathway For Homologous Recombination, Bailey Englund, Dr. Lori Scott Jan 2017

Mrub_0860, Mrub_0701 And Mrub_2285 Are Orthologous To E. Coli B2892, B2562 And B3863 Within The Recfor Pathway For Homologous Recombination, Bailey Englund, Dr. Lori Scott

Meiothermus ruber Genome Analysis Project

This project is part of the Meiothermus ruber genome analysis project, which uses the bioinformatics tool associated with the Guiding Education through Novel Investigation – Annotation Collaboration Toolkit (GENI-ACT) to predict the gene function. We investigated the biological function of the genes Mrub_0860, Mrub_0701,and Mrub_2285. We predicted that Mrub_0860 (DNA coordinates 842934..844868 on the forward strand) encodes for the enzyme single-stranded DNA-specific exonuclease, which is in the first step of homologous recombination via the RecFOR pathway (KEGG map number 03440). The E. coli K12 MG1655 ortholog is predicted to be b2892, which has the gene identifier …


Annotation Of Genes Involved With Biosynthetic Production Of Peptidoglycan Within Meiothermus Ruber Involving Supposed Orthologous Genes: Mrub_0981 And B1069, Mrub_1162 And B063, Mrub_1999 And B0084., Marckus Simmons, Dr. Lori Scott Jan 2017

Annotation Of Genes Involved With Biosynthetic Production Of Peptidoglycan Within Meiothermus Ruber Involving Supposed Orthologous Genes: Mrub_0981 And B1069, Mrub_1162 And B063, Mrub_1999 And B0084., Marckus Simmons, Dr. Lori Scott

Meiothermus ruber Genome Analysis Project

Using bioinformatics tools, the genes within Meiothermus ruber that are involved with peptidoglycan production were annotated. We predict that Mrub_0981 encodes the enzyme Lipid II Flippase (DNA coordinates970078…971580 on the reverse strand), which is the 9th step of the Peptidoglycan biosynthesis pathway (KEGG map number 00550) It catalyzes the conversion of Meso-2,6-diaminopimelate to Peptidoglycan. The E. coli K12 MG1655 ortholog is predicted to be b1069, which has the gene identifier mviN. We also predict that Mrub_1162 encodes the enzyme Penicillin binding protein II (DNA coordinates 1176079…1177836 on the reverse strand), which is the 12th step of the Peptidoglycan biosynthesis …


Mrub_1867, Mrub_1868, And Mrub_1869 Genes Are Predicted Orthologs Of The B2279, B2280, And B2281 Genes Found In Escherichia Coli Coding For The Nadh Dehydrogenase Subunits K, J, And I Respectively, Wade Smith, Dr. Lori R. Scott Jan 2017

Mrub_1867, Mrub_1868, And Mrub_1869 Genes Are Predicted Orthologs Of The B2279, B2280, And B2281 Genes Found In Escherichia Coli Coding For The Nadh Dehydrogenase Subunits K, J, And I Respectively, Wade Smith, Dr. Lori R. Scott

Meiothermus ruber Genome Analysis Project

This project is part of the Meiothermus ruber genome analysis project, which uses the bioinformatics tools associated with the Guiding Education through Novel Investigation –Annotation Collaboration Toolkit (GENI-ACT) to predict gene function. We investigated the biological function of the genes Mrub_1867, Mrub_1868, and Mrub_1869. We predict that Mrub_1867 (DNA coordinates 1927237..1927527 on the reverse strand), Mrub_1868 (DNA coordinates 1927524..1928123 on the reverse strand), and Mrub_1869 (DNA coordinates 1928248..1928781 on the reverse strand) are subunits of the NADH: ubiquinone oxidoreductase complex (KEGG map number 00190). This complex catalyzes the translocation of H+ across the cytoplasmic …