Open Access. Powered by Scholars. Published by Universities.®
- Institution
- Publication
-
- Biology Faculty Publications (3)
- U.C. Berkeley Division of Biostatistics Working Paper Series (2)
- Biochemistry Publications (1)
- Bioconductor Project Working Papers (1)
- Bioinformatics Faculty Publications (1)
-
- Harvard University Biostatistics Working Paper Series (1)
- Interdisciplinary Informatics Faculty Proceedings & Presentations (1)
- Interdisciplinary Informatics Faculty Publications (1)
- Mathematics & Statistics Faculty Publications (1)
- Student and Faculty Publications (1)
- The University of Michigan Department of Biostatistics Working Paper Series (1)
Articles 1 - 14 of 14
Full-Text Articles in Genetics and Genomics
Structural Diversity And Stress Regulation Of The Plant Immunity-Associated Calmodulin-Binding Protein 60 (Cbp60) Family Of Transcription Factors In Solanum Lycopersicum (Tomato), Vanessa Shivnauth, Sonya Pretheepkumar, Eric J. R. Marchetta, Christina A. M. Rossi, Keaun Amani, Christian Castroverde
Structural Diversity And Stress Regulation Of The Plant Immunity-Associated Calmodulin-Binding Protein 60 (Cbp60) Family Of Transcription Factors In Solanum Lycopersicum (Tomato), Vanessa Shivnauth, Sonya Pretheepkumar, Eric J. R. Marchetta, Christina A. M. Rossi, Keaun Amani, Christian Castroverde
Biology Faculty Publications
Cellular signaling generates calcium (Ca2+) ions, which are ubiquitous secondary messengers decoded by calcium-dependent protein kinases, calcineurins, calreticulin, calmodulins (CAMs), and CAM-binding proteins. Previous studies in the model plant Arabidopsis thaliana have shown the critical roles of the CAM-BINDING PROTEIN 60 (CBP60) protein family in plant growth, stress responses, and immunity. Certain CBP60 factors can regulate plant immune responses, like pattern-triggered immunity, effector-triggered immunity, and synthesis of major plant immune-activating metabolites salicylic acid (SA) and N-hydroxypipecolic acid (NHP). Although homologous CBP60 sequences have been identified in the plant kingdom, their function and regulation in most species remain unclear. In …
Adjusting For Gene-Specific Covariates To Improve Rna-Seq Analysis, Hyeongseon Jeon, Kyu-Sang Lim, Yet Nguyen, Dan Nettleton
Adjusting For Gene-Specific Covariates To Improve Rna-Seq Analysis, Hyeongseon Jeon, Kyu-Sang Lim, Yet Nguyen, Dan Nettleton
Mathematics & Statistics Faculty Publications
Summary
This paper suggests a novel positive false discovery rate (pFDR) controlling method for testing gene-specific hypotheses using a gene-specific covariate variable, such as gene length. We suppose the null probability depends on the covariate variable. In this context, we propose a rejection rule that accounts for heterogeneity among tests by employing two distinct types of null probabilities. We establish a pFDR estimator for a given rejection rule by following Storey's q-value framework. A condition on a type 1 error posterior probability is provided that equivalently characterizes our rejection rule. We also present a suitable procedure for selecting a tuning …
Artificial Intelligence-Driven Meta-Analysis Of Brain Gene Expression Identifies Novel Gene Candidates And A Role For Mitochondria In Alzheimer’S Disease, Caitlin A Finney, Fabien Delerue, Wendy A Gold, David A Brown, Artur Shvetcov
Artificial Intelligence-Driven Meta-Analysis Of Brain Gene Expression Identifies Novel Gene Candidates And A Role For Mitochondria In Alzheimer’S Disease, Caitlin A Finney, Fabien Delerue, Wendy A Gold, David A Brown, Artur Shvetcov
Student and Faculty Publications
Alzheimer's disease (AD) is the most common form of dementia. There is no treatment and AD models have focused on a small subset of genes identified in familial AD. Microarray studies have identified thousands of dysregulated genes in the brains of patients with AD yet identifying the best gene candidates to both model and treat AD remains a challenge. We performed a meta-analysis of microarray data from the frontal cortex (n = 697) and cerebellum (n = 230) of AD patients and healthy controls. A two-stage artificial intelligence approach, with both unsupervised and supervised machine learning, combined with a functional …
Improved Radiation Expression Profiling In Blood By Sequential Application Of Sensitive And Specific Gene Signatures, Eliseos J. Mucaki, Ben C. Shirley, Peter K. Rogan
Improved Radiation Expression Profiling In Blood By Sequential Application Of Sensitive And Specific Gene Signatures, Eliseos J. Mucaki, Ben C. Shirley, Peter K. Rogan
Biochemistry Publications
Purpose. Combinations of expressed genes can discriminate radiation-exposed from normal control blood samples by machine learning based signatures (with 8 to 20% misclassification rates). These signatures can quantify therapeutically-relevant as well as accidental radiation exposures. The prodromal symptoms of Acute Radiation Syndrome (ARS) overlap those present in Influenza and Dengue Fever infections. Surprisingly, these human radiation signatures misclassified gene expression profiles of virally infected samples as false positive exposures. The present study investigates these and other confounders, and then mitigates their impact on signature accuracy.
Methods. This study investigated recall by previous and novel radiation signatures independently derived …
Highly Conserved Molecular Pathways, Including Wnt Signaling, Promote Functional Recovery From Spinal Cord Injury In Lampreys, Paige E. Herman, Angelos Papatheodorou, Stephanie A. Bryant, Courtney K. M. Waterbury, Joseph R. Herdy, Anthony A. Arcese, Joseph D. Buxbaum, Jeramiah J. Smith, Jennifer R. Morgan, Ona Bloom
Highly Conserved Molecular Pathways, Including Wnt Signaling, Promote Functional Recovery From Spinal Cord Injury In Lampreys, Paige E. Herman, Angelos Papatheodorou, Stephanie A. Bryant, Courtney K. M. Waterbury, Joseph R. Herdy, Anthony A. Arcese, Joseph D. Buxbaum, Jeramiah J. Smith, Jennifer R. Morgan, Ona Bloom
Biology Faculty Publications
In mammals, spinal cord injury (SCI) leads to dramatic losses in neurons and synaptic connections, and consequently function. Unlike mammals, lampreys are vertebrates that undergo spontaneous regeneration and achieve functional recovery after SCI. Therefore our goal was to determine the complete transcriptional responses that occur after SCI in lampreys and to identify deeply conserved pathways that promote regeneration. We performed RNA-Seq on lamprey spinal cord and brain throughout the course of functional recovery. We describe complex transcriptional responses in the injured spinal cord, and somewhat surprisingly, also in the brain. Transcriptional responses to SCI in lampreys included transcription factor networks …
Genome-Wide Detection And Analysis Of Multifunctional Genes, Yuri Pritykin, Dario Ghersi, Mona Singh
Genome-Wide Detection And Analysis Of Multifunctional Genes, Yuri Pritykin, Dario Ghersi, Mona Singh
Interdisciplinary Informatics Faculty Publications
Many genes can play a role in multiple biological processes or molecular functions. Identifying multifunctional genes at the genome-wide level and studying their properties can shed light upon the complexity of molecular events that underpin cellular functioning, thereby leading to a better understanding of the functional landscape of the cell. However, to date, genome-wide analysis of multifunctional genes (and the proteins they encode) has been limited. Here we introduce a computational approach that uses known functional annotations to extract genes playing a role in at least two distinct biological processes. We leverage functional genomics data sets for three organisms—H. sapiens, …
A Gene-Based Association Method For Mapping Traits Using Reference Transcriptome Data, Eric R. Gamazon, Heather Wheeler, Kaanan P. Shah, Sahar V. Mozaffari, Keston Aquino-Michaels, Robert J. Carroll, Anne E. Eyler, Joshua C. Denny, Gtex Consortium, Dan L. Nicolae, Nancy J. Cox, Hae Kyung Im
A Gene-Based Association Method For Mapping Traits Using Reference Transcriptome Data, Eric R. Gamazon, Heather Wheeler, Kaanan P. Shah, Sahar V. Mozaffari, Keston Aquino-Michaels, Robert J. Carroll, Anne E. Eyler, Joshua C. Denny, Gtex Consortium, Dan L. Nicolae, Nancy J. Cox, Hae Kyung Im
Bioinformatics Faculty Publications
Genome-wide association studies (GWAS) have identified thousands of variants robustly associated with complex traits. However, the biological mechanisms underlying these associations are, in general, not well understood. We propose a gene-based association method called PrediXcan that directly tests the molecular mechanisms through which genetic variation affects phenotype. The approach estimates the component of gene expression determined by an individual’s genetic profile and correlates ‘imputed’ gene expression with the phenotype under investigation to identify genes involved in the etiology of the phenotype. Genetically regulated gene expression is estimated using whole-genome tissue-dependent prediction models trained with reference transcriptome data sets. PrediXcan enjoys …
On The Comparison Of State- And Transition-Based Analysis Of Biological Relevance In Gene Co-Expression Networks, Kathryn Dempsey Cooper, Prasuna Vemuri, Hesham Ali
On The Comparison Of State- And Transition-Based Analysis Of Biological Relevance In Gene Co-Expression Networks, Kathryn Dempsey Cooper, Prasuna Vemuri, Hesham Ali
Interdisciplinary Informatics Faculty Proceedings & Presentations
Traditional correlation network analysis typically involves creating a network using gene expression data and then identifying biologically relevant clusters from that network by enrichment with Gene Ontology or pathway information. When one wants to examine these networks in a dynamic way - such as between controls versus treatment or over time - a "snapshot" approach is taken by comparing network structures at each time point. The biological relevance of these structures are then reported and compared. In this research, we examine the same "snapshot" networks but focus on the enrichment of changes in structure to determine if these results give …
Linear Methods For Analysis And Quality Control Of Relative Expression Ratios From Quantitative Real-Time Polymerase Chain Reaction Experiments, Robert B. Page, Arnold J. Stromberg
Linear Methods For Analysis And Quality Control Of Relative Expression Ratios From Quantitative Real-Time Polymerase Chain Reaction Experiments, Robert B. Page, Arnold J. Stromberg
Biology Faculty Publications
Relative expression quantitative real-time polymerase chain reaction (RT-qPCR) experiments are a common means of estimating transcript abundances across biological groups and experimental treatments. One of the most frequently used expression measures that results from such experiments is the relative expression ratio (RE), which describes expression in experimental samples (i.e., RNA isolated from organisms, tissues, and/or cells that were exposed to one or more experimental or nonbaseline condition) in terms of fold change relative to calibrator samples (i.e., RNA isolated from organisms, tissues, and/or cells that were exposed to a control or baseline condition). Over the past decade, several …
Selecting 'Significant' Differentially Expressed Genes From The Combined Perspective Of The Null And The Alternative, Beatrijs Moerkerke, Els Goetghebeur
Selecting 'Significant' Differentially Expressed Genes From The Combined Perspective Of The Null And The Alternative, Beatrijs Moerkerke, Els Goetghebeur
Harvard University Biostatistics Working Paper Series
No abstract provided.
Cluster Analysis Of Genomic Data With Applications In R, Katherine S. Pollard, Mark J. Van Der Laan
Cluster Analysis Of Genomic Data With Applications In R, Katherine S. Pollard, Mark J. Van Der Laan
U.C. Berkeley Division of Biostatistics Working Paper Series
In this paper, we provide an overview of existing partitioning and hierarchical clustering algorithms in R. We discuss statistical issues and methods in choosing the number of clusters, the choice of clustering algorithm, and the choice of dissimilarity matrix. In particular, we illustrate how the bootstrap can be employed as a statistical method in cluster analysis to establish the reproducibility of the clusters and the overall variability of the followed procedure. We also show how to visualize a clustering result by plotting ordered dissimilarity matrices in R. We present a new R package, hopach, which implements the hybrid clustering method, …
Classification Using Generalized Partial Least Squares, Beiying Ding, Robert Gentleman
Classification Using Generalized Partial Least Squares, Beiying Ding, Robert Gentleman
Bioconductor Project Working Papers
The advances in computational biology have made simultaneous monitoring of thousands of features possible. The high throughput technologies not only bring about a much richer information context in which to study various aspects of gene functions but they also present challenge of analyzing data with large number of covariates and few samples. As an integral part of machine learning, classification of samples into two or more categories is almost always of interest to scientists. In this paper, we address the question of classification in this setting by extending partial least squares (PLS), a popular dimension reduction tool in chemometrics, in …
Mixture Models For Assessing Differential Expression In Complex Tissues Using Microarray Data, Debashis Ghosh
Mixture Models For Assessing Differential Expression In Complex Tissues Using Microarray Data, Debashis Ghosh
The University of Michigan Department of Biostatistics Working Paper Series
The use of DNA microarrays has become quite popular in many scientific and medical disciplines, such as in cancer research. One common goal of these studies is to determine which genes are differentially expressed between cancer and healthy tissue, or more generally, between two experimental conditions. A major complication in the molecular profiling of tumors using gene expression data is that the data represent a combination of tumor and normal cells. Much of the methodology developed for assessing differential expression with microarray data has assumed that tissue samples are homogeneous. In this article, we outline a general framework for determining …
Statistical Inference For Simultaneous Clustering Of Gene Expression Data, Katherine S. Pollard, Mark J. Van Der Laan
Statistical Inference For Simultaneous Clustering Of Gene Expression Data, Katherine S. Pollard, Mark J. Van Der Laan
U.C. Berkeley Division of Biostatistics Working Paper Series
Current methods for analysis of gene expression data are mostly based on clustering and classification of either genes or samples. We offer support for the idea that more complex patterns can be identified in the data if genes and samples are considered simultaneously. We formalize the approach and propose a statistical framework for two-way clustering. A simultaneous clustering parameter is defined as a function of the true data generating distribution, and an estimate is obtained by applying this function to the empirical distribution. We illustrate that a wide range of clustering procedures, including generalized hierarchical methods, can be defined as …