Open Access. Powered by Scholars. Published by Universities.®

Genomics Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 3 of 3

Full-Text Articles in Genomics

These Are Not The K-Mers You Are Looking For: Efficient Online K-Mer Counting Using A Probabilistic Data Structure, Qingpeng Zhang, Jason Pell, Rosangela Canino-Koning, Adina Chuang Howe, C. Titus Brown Jul 2014

These Are Not The K-Mers You Are Looking For: Efficient Online K-Mer Counting Using A Probabilistic Data Structure, Qingpeng Zhang, Jason Pell, Rosangela Canino-Koning, Adina Chuang Howe, C. Titus Brown

Adina Howe

K-mer abundance analysis is widely used for many purposes in nucleotide sequence analysis, including data preprocessing for de novo assembly, repeat detection, and sequencing coverage estimation. We present the khmer software package for fast and memory efficient online counting of k-mers in sequencing data sets. Unlike previous methods based on data structures such as hash tables, suffix arrays, and trie structures, khmer relies entirely on a simple probabilistic data structure, a Count-Min Sketch. The Count-Min Sketch permits online updating and retrieval of k-mer counts in memory which is necessary to support online k-mer analysis algorithms. On sparse data sets this …


Combined Metagenomic And Phenomic Approaches Identify A Novel Salt Tolerance Gene From The Human Gut Microbiome, Eamon Culligan, Julian R. Marchesi, Colin Hill, Roy D. Sleator Apr 2014

Combined Metagenomic And Phenomic Approaches Identify A Novel Salt Tolerance Gene From The Human Gut Microbiome, Eamon Culligan, Julian R. Marchesi, Colin Hill, Roy D. Sleator

Department of Biological Sciences Publications

In the current study, a number of salt-tolerant clones previously isolated from a human gut metagenomic library were screened using Phenotype MicroArray (PM) technology to assess their functional capacity. PM's can be used to study gene function, pathogenicity, metabolic capacity and identify drug targets using a series of specialized microtitre plate assays, where each well of the microtitre plate contains a different set of conditions and tests a different phenotype. Cellular respiration is monitored colorimetrically by the reduction of a tetrazolium dye. One clone, SMG 9, was found to be positive for utilization/transport of L-carnitine (a well-characterized osmoprotectant) in the …


The Boiling Springs Lake Metavirome: Charting The Viral Sequence-Space Of An Extreme Environment Microbial Ecosystem, Geoffrey Scott Diemer Mar 2014

The Boiling Springs Lake Metavirome: Charting The Viral Sequence-Space Of An Extreme Environment Microbial Ecosystem, Geoffrey Scott Diemer

Dissertations and Theses

Viruses are the most abundant organisms on Earth, yet their collective evolutionary history, biodiversity and functional capacity is not well understood. Viral metagenomics offers a potential means of establishing a more comprehensive view of virus diversity and evolution, as vast amounts of new sequence data becomes available for comparative analysis.
Metagenomic DNA from virus-sized particles (smaller than 0.2 microns in diameter) was isolated from approximately 20 liters of sediment obtained from Boiling Springs Lake (BSL) and sequenced. BSL is a large, acidic hot-spring (with a pH of 2.2, and temperatures ranging from 50°C to 96°C) located in Lassen Volcanic National …