Open Access. Powered by Scholars. Published by Universities.®

Life Sciences Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 12 of 12

Full-Text Articles in Life Sciences

Integrated Transcriptome Analysis Reveals Mirna-Mrna Crosstalk In Laryngeal Squamous Cell Carcinoma., Yang Zhang, Yong Chen, Jinhai Yu, Guiming Liu, Zhigang Huang Sep 2019

Integrated Transcriptome Analysis Reveals Mirna-Mrna Crosstalk In Laryngeal Squamous Cell Carcinoma., Yang Zhang, Yong Chen, Jinhai Yu, Guiming Liu, Zhigang Huang

Yong Chen

Next generation sequencing (NGS) has proven to be a powerful tool in delineating myriads of molecular subtypes of cancer, as well as in revealing accumulation of genomic mutations throughout cancer progression. Whole genome microRNA (miRNA) and mRNA expression profiles were obtained from patients with laryngeal squamous cell carcinoma (LSCC) using deep sequencing technology, and were analyzed by utilizing integrative computational approaches. A large number of protein-coding and non-coding genes were detected to be differentially expressed, indicating a functional switch in LSCC cells. A total of 127 mutated genes were detected to be significantly associated with ectoderm and epidermis development. Eleven …


Uncover Disease Genes By Maximizing Information Flow In The Phenome-Interactome Network., Yong Chen, Tao Jiang, Rui Jiang Sep 2019

Uncover Disease Genes By Maximizing Information Flow In The Phenome-Interactome Network., Yong Chen, Tao Jiang, Rui Jiang

Yong Chen

MOTIVATION: Pinpointing genes that underlie human inherited diseases among candidate genes in susceptibility genetic regions is the primary step towards the understanding of pathogenesis of diseases. Although several probabilistic models have been proposed to prioritize candidate genes using phenotype similarities and protein-protein interactions, no combinatorial approaches have been proposed in the literature.

RESULTS: We propose the first combinatorial approach for prioritizing candidate genes. We first construct a phenome-interactome network by integrating the given phenotype similarity profile, protein-protein interaction network and associations between diseases and genes. Then, we introduce a computational method called MAXIF to maximize the information flow in this …


Integrating Human Omics Data To Prioritize Candidate Genes., Yong Chen, Xuebing Wu, Rui Jiang Sep 2019

Integrating Human Omics Data To Prioritize Candidate Genes., Yong Chen, Xuebing Wu, Rui Jiang

Yong Chen

BACKGROUND: The identification of genes involved in human complex diseases remains a great challenge in computational systems biology. Although methods have been developed to use disease phenotypic similarities with a protein-protein interaction network for the prioritization of candidate genes, other valuable omics data sources have been largely overlooked in these methods.

METHODS: With this understanding, we proposed a method called BRIDGE to prioritize candidate genes by integrating disease phenotypic similarities with such omics data as protein-protein interactions, gene sequence similarities, gene expression patterns, gene ontology annotations, and gene pathway memberships. BRIDGE utilizes a multiple regression model with lasso penalty to …


Tracing Evolutionary Footprints To Identify Novel Gene Functional Linkages., Yong Chen, Li Yang, Yunfeng Ding, Shuyan Zhang, Tong He, Fenglou Mao, Congyan Zhang, Huina Zhang, Chaoxing Huo, Pingsheng Liu Sep 2019

Tracing Evolutionary Footprints To Identify Novel Gene Functional Linkages., Yong Chen, Li Yang, Yunfeng Ding, Shuyan Zhang, Tong He, Fenglou Mao, Congyan Zhang, Huina Zhang, Chaoxing Huo, Pingsheng Liu

Yong Chen

Systematic determination of gene function is an essential step in fully understanding the precise contribution of each gene for the proper execution of molecular functions in the cell. Gene functional linkage is defined as to describe the relationship of a group of genes with similar functions. With thousands of genomes sequenced, there arises a great opportunity to utilize gene evolutionary information to identify gene functional linkages. To this end, we established a computational method (called TRACE) to trace gene footprints through a gene functional network constructed from 341 prokaryotic genomes. TRACE performance was validated and successfully tested to predict enzyme …


Investigating Evolutionary Dynamics Of Rha1 Operons, Yong Chen, Dandan Geng, Kristina Ehrhardt, Shaoqiang Zhang Sep 2019

Investigating Evolutionary Dynamics Of Rha1 Operons, Yong Chen, Dandan Geng, Kristina Ehrhardt, Shaoqiang Zhang

Yong Chen

Grouping genes as operons is an important genomic feature of prokaryotic organisms. The comprehensive understanding of the operon organizations would be helpful to decipher transcriptional mechanisms, cellular pathways, and the evolutionary landscape of prokaryotic genomes. Although thousands of prokaryotes have been sequenced, genome-wide investigation of the evolutionary dynamics (division and recombination) of operons among these genomes remains unexplored. Here, we systematically analyzed the operon dynamics of Rhodococcus jostii RHA1 (RHA1), an oleaginous bacterium with high potential applications in biofuel, by comparing 340 prokaryotic genomes that were carefully selected from different genera. Interestingly, 99% of RHA1 operons were observed to exhibit …


Prioritizing Protein Complexes Implicated In Human Diseases By Network Optimization., Yong Chen, Thibault Jacquemin, Shuyan Zhang, Rui Jiang Sep 2019

Prioritizing Protein Complexes Implicated In Human Diseases By Network Optimization., Yong Chen, Thibault Jacquemin, Shuyan Zhang, Rui Jiang

Yong Chen

BACKGROUND: The detection of associations between protein complexes and human inherited diseases is of great importance in understanding mechanisms of diseases. Dysfunctions of a protein complex are usually defined by its member disturbance and consequently result in certain diseases. Although individual disease proteins have been widely predicted, computational methods are still absent for systematically investigating disease-related protein complexes.

RESULTS: We propose a method, MAXCOM, for the prioritization of candidate protein complexes. MAXCOM performs a maximum information flow algorithm to optimize relationships between a query disease and candidate protein complexes through a heterogeneous network that is constructed by combining protein-protein interactions …


Integrated Transcriptomic Analysis Of Trichosporon Asahii Uncovers The Core Genes And Pathways Of Fluconazole Resistance., Haitao Li, Congmin Wang, Yong Chen, Shaoqiang Zhang, Rongya Yang Sep 2019

Integrated Transcriptomic Analysis Of Trichosporon Asahii Uncovers The Core Genes And Pathways Of Fluconazole Resistance., Haitao Li, Congmin Wang, Yong Chen, Shaoqiang Zhang, Rongya Yang

Yong Chen

Trichosporon asahii (T. asahii) has emerged as a dangerous pathogen that causes rare but life-threatening infections. Its resistance to certain antifungal agents makes it difficult to treat, especially for patients undergoing long-term antibiotic therapy. In this study, we performed a series of fluconazole (FLC) perturbation experiments for two T. asahii strains, a clinical isolate stain CBS 2479 (T2) and an environmental isolate strain CBS 8904 (T8), to uncover potential genes and pathways involved in FLC resistance. We achieved 10 transcriptomes of T2 and T8 that were based on dose and time series of FLC perturbations. Systematic comparisons of the transcriptomes …


Climp: Clustering Motifs Via Maximal Cliques With Parallel Computing Design., Shaoqiang Zhang, Yong Chen Sep 2019

Climp: Clustering Motifs Via Maximal Cliques With Parallel Computing Design., Shaoqiang Zhang, Yong Chen

Yong Chen

A set of conserved binding sites recognized by a transcription factor is called a motif, which can be found by many applications of comparative genomics for identifying over-represented segments. Moreover, when numerous putative motifs are predicted from a collection of genome-wide data, their similarity data can be represented as a large graph, where these motifs are connected to one another. However, an efficient clustering algorithm is desired for clustering the motifs that belong to the same groups and separating the motifs that belong to different groups, or even deleting an amount of spurious ones. In this work, a new motif …


Identifying Potential Cancer Driver Genes By Genomic Data Integration., Yong Chen, Jingjing Hao, Wei Jiang, Tong He, Xuegong Zhang, Tao Jiang, Rui Jiang Sep 2019

Identifying Potential Cancer Driver Genes By Genomic Data Integration., Yong Chen, Jingjing Hao, Wei Jiang, Tong He, Xuegong Zhang, Tao Jiang, Rui Jiang

Yong Chen

Cancer is a genomic disease associated with a plethora of gene mutations resulting in a loss of control over vital cellular functions. Among these mutated genes, driver genes are defined as being causally linked to oncogenesis, while passenger genes are thought to be irrelevant for cancer development. With increasing numbers of large-scale genomic datasets available, integrating these genomic data to identify driver genes from aberration regions of cancer genomes becomes an important goal of cancer genome analysis and investigations into mechanisms responsible for cancer development. A computational method, MAXDRIVER, is proposed here to identify potential driver genes on the basis …


Fishermp: Fully Parallel Algorithm For Detecting Combinatorial Motifs From Large Chip-Seq Datasets., Shaoqiang Zhang, Ying Liang, Xiangyun Wang, Zhengchang Su, Yong Chen Sep 2019

Fishermp: Fully Parallel Algorithm For Detecting Combinatorial Motifs From Large Chip-Seq Datasets., Shaoqiang Zhang, Ying Liang, Xiangyun Wang, Zhengchang Su, Yong Chen

Yong Chen

Detecting binding motifs of combinatorial transcription factors (TFs) from chromatin immunoprecipitation sequencing (ChIP-seq) experiments is an important and challenging computational problem for understanding gene regulations. Although a number of motif-finding algorithms have been presented, most are either time consuming or have sub-optimal accuracy for processing large-scale datasets. In this article, we present a fully parallelized algorithm for detecting combinatorial motifs from ChIP-seq datasets by using Fisher combined method and OpenMP parallel design. Large scale validations on both synthetic data and 350 ChIP-seq datasets from the ENCODE database showed that FisherMP has not only super speeds on large datasets, but also …


Integrated Omics Study Delineates The Dynamics Of Lipid Droplets In Rhodococcus Opacus Pd630., Yong Chen, Yunfeng Ding, Li Yang, Jinhai Yu, Guiming Liu, Xumin Wang, Shuyan Zhang, Dan Yu, Lai Song, Hangxiao Zhang, Congyan Zhang, Linhe Huo, Chaoxing Huo, Yang Wang, Yalan Du, Huina Zhang, Peng Zhang, Huimin Na, Shimeng Xu, Yaxin Zhu, Zhensheng Xie, Tong He, Yue Zhang, Guoliang Wang, Zhonghua Fan, Fuquan Yang, Honglei Liu, Xiaowo Wang, Xuegong Zhang, Michael Q Zhang, Yanda Li, Alexander Steinbüchel, Toyoshi Fujimoto, Simon Cichello, Jun Yu, Pingsheng Liu Sep 2019

Integrated Omics Study Delineates The Dynamics Of Lipid Droplets In Rhodococcus Opacus Pd630., Yong Chen, Yunfeng Ding, Li Yang, Jinhai Yu, Guiming Liu, Xumin Wang, Shuyan Zhang, Dan Yu, Lai Song, Hangxiao Zhang, Congyan Zhang, Linhe Huo, Chaoxing Huo, Yang Wang, Yalan Du, Huina Zhang, Peng Zhang, Huimin Na, Shimeng Xu, Yaxin Zhu, Zhensheng Xie, Tong He, Yue Zhang, Guoliang Wang, Zhonghua Fan, Fuquan Yang, Honglei Liu, Xiaowo Wang, Xuegong Zhang, Michael Q Zhang, Yanda Li, Alexander Steinbüchel, Toyoshi Fujimoto, Simon Cichello, Jun Yu, Pingsheng Liu

Yong Chen

Rhodococcus opacus strain PD630 (R. opacus PD630), is an oleaginous bacterium, and also is one of few prokaryotic organisms that contain lipid droplets (LDs). LD is an important organelle for lipid storage but also intercellular communication regarding energy metabolism, and yet is a poorly understood cellular organelle. To understand the dynamics of LD using a simple model organism, we conducted a series of comprehensive omics studies of R. opacus PD630 including complete genome, transcriptome and proteome analysis. The genome of R. opacus PD630 encodes 8947 genes that are significantly enriched in the lipid transport, synthesis and metabolic, indicating a super …


In Situ Capture Of Chromatin Interactions By Biotinylated Dcas9., Xin Liu, Yuannyu Zhang, Yong Chen, Mushan Li, Feng Zhou, Kailong Li, Hui Cao, Min Ni, Yuxuan Liu, Zhimin Gu, Kathryn E Dickerson, Shiqi Xie, Gary C Hon, Zhenyu Xuan, Michael Q Zhang, Zhen Shao, Jian Xu Sep 2019

In Situ Capture Of Chromatin Interactions By Biotinylated Dcas9., Xin Liu, Yuannyu Zhang, Yong Chen, Mushan Li, Feng Zhou, Kailong Li, Hui Cao, Min Ni, Yuxuan Liu, Zhimin Gu, Kathryn E Dickerson, Shiqi Xie, Gary C Hon, Zhenyu Xuan, Michael Q Zhang, Zhen Shao, Jian Xu

Yong Chen

Cis-regulatory elements (CREs) are commonly recognized by correlative chromatin features, yet the molecular composition of the vast majority of CREs in chromatin remains unknown. Here, we describe a CRISPR affinity purification in situ of regulatory elements (CAPTURE) approach to unbiasedly identify locus-specific chromatin-regulating protein complexes and long-range DNA interactions. Using an in vivo biotinylated nuclease-deficient Cas9 protein and sequence-specific guide RNAs, we show high-resolution and selective isolation of chromatin interactions at a single-copy genomic locus. Purification of human telomeres using CAPTURE identifies known and new telomeric factors. In situ capture of individual constituents of the enhancer cluster controlling human β-globin …