Open Access. Powered by Scholars. Published by Universities.®

Medical Genetics Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 2 of 2

Full-Text Articles in Medical Genetics

Subject Level Clustering Using A Negative Binomial Model For Small Transcriptomic Studies., Qian Li, Janelle R. Noel-Macdonnell, Devin C. Koestler, Ellen L. Goode, Brooke L. Fridley Dec 2018

Subject Level Clustering Using A Negative Binomial Model For Small Transcriptomic Studies., Qian Li, Janelle R. Noel-Macdonnell, Devin C. Koestler, Ellen L. Goode, Brooke L. Fridley

Manuscripts, Articles, Book Chapters and Other Papers

BACKGROUND: Unsupervised clustering represents one of the most widely applied methods in analysis of high-throughput 'omics data. A variety of unsupervised model-based or parametric clustering methods and non-parametric clustering methods have been proposed for RNA-seq count data, most of which perform well for large samples, e.g. N ≥ 500. A common issue when analyzing limited samples of RNA-seq count data is that the data follows an over-dispersed distribution, and thus a Negative Binomial likelihood model is often used. Thus, we have developed a Negative Binomial model-based (NBMB) clustering approach for application to RNA-seq studies.

RESULTS: We have developed a Negative …


Penalized Mixed-Effects Ordinal Response Models For High-Dimensional Genomic Data In Twins And Families, Amanda E. Gentry Jan 2018

Penalized Mixed-Effects Ordinal Response Models For High-Dimensional Genomic Data In Twins And Families, Amanda E. Gentry

Theses and Dissertations

The Brisbane Longitudinal Twin Study (BLTS) was being conducted in Australia and was funded by the US National Institute on Drug Abuse (NIDA). Adolescent twins were sampled as a part of this study and surveyed about their substance use as part of the Pathways to Cannabis Use, Abuse and Dependence project. The methods developed in this dissertation were designed for the purpose of analyzing a subset of the Pathways data that includes demographics, cannabis use metrics, personality measures, and imputed genotypes (SNPs) for 493 complete twin pairs (986 subjects.) The primary goal was to determine what combination of SNPs and …