Open Access. Powered by Scholars. Published by Universities.®

Genetics and Genomics Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 3 of 3

Full-Text Articles in Genetics and Genomics

Sccad: Cluster Decomposition-Based Anomaly Detection For Rare Cell Identification In Single-Cell Expression Data, Yunpei Xu, Shaokai Wang, Qilong Feng, Jiazhi Xia, Yaohang Li, Hong-Dong Li, Jianxin Wang Jan 2024

Sccad: Cluster Decomposition-Based Anomaly Detection For Rare Cell Identification In Single-Cell Expression Data, Yunpei Xu, Shaokai Wang, Qilong Feng, Jiazhi Xia, Yaohang Li, Hong-Dong Li, Jianxin Wang

Computer Science Faculty Publications

Single-cell RNA sequencing (scRNA-seq) technologies have become essential tools for characterizing cellular landscapes within complex tissues. Large-scale single-cell transcriptomics holds great potential for identifying rare cell types critical to the pathogenesis of diseases and biological processes. Existing methods for identifying rare cell types often rely on one-time clustering using partial or global gene expression. However, these rare cell types may be overlooked during the clustering phase, posing challenges for their accurate identification. In this paper, we propose a Cluster decomposition-based Anomaly Detection method (scCAD), which iteratively decomposes clusters based on the most differential signals in each cluster to effectively separate …


Organelle_Pba, A Pipeline For Assembling Chloroplast And Mitochondrial Genomes From Pacbio Dna Sequencing Data, Aboozar Soorni, David Haak, David Zaitlin, Aureliano Bombarely Jan 2017

Organelle_Pba, A Pipeline For Assembling Chloroplast And Mitochondrial Genomes From Pacbio Dna Sequencing Data, Aboozar Soorni, David Haak, David Zaitlin, Aureliano Bombarely

Kentucky Tobacco Research and Development Center Faculty Publications

Background: The development of long-read sequencing technologies, such as single-molecule real-time (SMRT) sequencing by PacBio, has produced a revolution in the sequencing of small genomes. Sequencing organelle genomes using PacBio long-read data is a cost effective, straightforward approach. Nevertheless, the availability of simple-to-use software to perform the assembly from raw reads is limited at present.

Results: We present Organelle-PBA, a Perl program designed specifically for the assembly of chloroplast and mitochondrial genomes. For chloroplast genomes, the program selects the chloroplast reads from a whole genome sequencing pool, maps the reads to a reference sequence from a closely related species, and …


The Homeo Domain Of A Murine Protein Binds 5' To Its Own Homeo Box., Abraham Fainsod, Leonard D. Bogarad, Tarmo Ruusala, Martin Lubin Dec 1986

The Homeo Domain Of A Murine Protein Binds 5' To Its Own Homeo Box., Abraham Fainsod, Leonard D. Bogarad, Tarmo Ruusala, Martin Lubin

Dartmouth Scholarship

Nuclear protein extracts from day 12.5 mouse embryos were used to study protein binding to DNA sequences 5' of the Hox 1.5 homeo box. Embryos of this developmental stage are known to express this gene. DNA binding protein blotting and retardation gel techniques show that murine embryonic nuclear proteins specifically bind a 753-base pair (bp) DNA fragment from the region upstream of the Hox 1.5 homeo box. A fusion protein containing the Hox 1.5 homeo domain constructed in lambda gt11 also binds the same 753-bp DNA fragment. Specific binding of the fusion protein to the upstream DNA fragment shows that …