Open Access. Powered by Scholars. Published by Universities.®

Genetics and Genomics Commons

Open Access. Powered by Scholars. Published by Universities.®

Genomics

Selected Works

Metagenomics

Articles 1 - 4 of 4

Full-Text Articles in Genetics and Genomics

Undergraduates Contain Multitudes: Course-Based Metagenomics Analysis Using Dna Subway Purple Line, Ray A. Enke Dec 2018

Undergraduates Contain Multitudes: Course-Based Metagenomics Analysis Using Dna Subway Purple Line, Ray A. Enke

Ray Enke Ph.D.

No abstract provided.


Fastqc Analysis & Hisat Alignments Using Cyverse (Part 2), Ray A. Enke Oct 2018

Fastqc Analysis & Hisat Alignments Using Cyverse (Part 2), Ray A. Enke

Ray Enke Ph.D.

Part 2 of this in class exercise uses CyVerse Discovery Environment (DE) for the following:
  • view the output files of FastQC analysis
  • create custom data tracks from HISAT alignment files for visualization in the UCSC Genome Browser


Fastqc Analysis & Hisat Alignments Using Cyverse (Part 1), Ray A. Enke Oct 2018

Fastqc Analysis & Hisat Alignments Using Cyverse (Part 1), Ray A. Enke

Ray Enke Ph.D.

This in class exercise demonstrates the basic features of the CyVerse Discovery Environment (DE) cyberinfrastructure and also provides a tutorial for setting up FastQC analysis of next generation sequencing reads as well as HISAT alignment of eukaryotic RNA-seq FASTQ files.


These Are Not The K-Mers You Are Looking For: Efficient Online K-Mer Counting Using A Probabilistic Data Structure, Qingpeng Zhang, Jason Pell, Rosangela Canino-Koning, Adina Chuang Howe, C. Titus Brown Jul 2014

These Are Not The K-Mers You Are Looking For: Efficient Online K-Mer Counting Using A Probabilistic Data Structure, Qingpeng Zhang, Jason Pell, Rosangela Canino-Koning, Adina Chuang Howe, C. Titus Brown

Adina Howe

K-mer abundance analysis is widely used for many purposes in nucleotide sequence analysis, including data preprocessing for de novo assembly, repeat detection, and sequencing coverage estimation. We present the khmer software package for fast and memory efficient online counting of k-mers in sequencing data sets. Unlike previous methods based on data structures such as hash tables, suffix arrays, and trie structures, khmer relies entirely on a simple probabilistic data structure, a Count-Min Sketch. The Count-Min Sketch permits online updating and retrieval of k-mer counts in memory which is necessary to support online k-mer analysis algorithms. On sparse data sets this …