Open Access. Powered by Scholars. Published by Universities.®

Medicine and Health Sciences Commons

Open Access. Powered by Scholars. Published by Universities.®

Faculty Research 2021

Computational Biology

Articles 1 - 2 of 2

Full-Text Articles in Medicine and Health Sciences

Discovery Of Widespread Transcription Initiation At Microsatellites Predictable By Sequence-Based Deep Neural Network., Mathys Grapotte, Manu Saraswat, Chloé Bessière, Christophe Menichelli, Jordan A Ramilowski, Jessica Severin, Yoshihide Hayashizaki, Masayoshi Itoh, Michihira Tagami, Mitsuyoshi Murata, Miki Kojima-Ishiyama, Shohei Noma, Shuhei Noguchi, Takeya Kasukawa, Akira Hasegawa, Harukazu Suzuki, Hiromi Nishiyori-Sueki, Martin C Frith, Fantom Consortium, Clément Chatelain, Piero Carninci, Michiel J L De Hoon, Wyeth W Wasserman, Laurent Bréhélin, Charles-Henri Lecellier, Judith A. Blake, Carol J Bult Jun 2021

Discovery Of Widespread Transcription Initiation At Microsatellites Predictable By Sequence-Based Deep Neural Network., Mathys Grapotte, Manu Saraswat, Chloé Bessière, Christophe Menichelli, Jordan A Ramilowski, Jessica Severin, Yoshihide Hayashizaki, Masayoshi Itoh, Michihira Tagami, Mitsuyoshi Murata, Miki Kojima-Ishiyama, Shohei Noma, Shuhei Noguchi, Takeya Kasukawa, Akira Hasegawa, Harukazu Suzuki, Hiromi Nishiyori-Sueki, Martin C Frith, Fantom Consortium, Clément Chatelain, Piero Carninci, Michiel J L De Hoon, Wyeth W Wasserman, Laurent Bréhélin, Charles-Henri Lecellier, Judith A. Blake, Carol J Bult

Faculty Research 2021

Using the Cap Analysis of Gene Expression (CAGE) technology, the FANTOM5 consortium provided one of the most comprehensive maps of transcription start sites (TSSs) in several species. Strikingly, ~72% of them could not be assigned to a specific gene and initiate at unconventional regions, outside promoters or enhancers. Here, we probe these unassigned TSSs and show that, in all species studied, a significant fraction of CAGE peaks initiate at microsatellites, also called short tandem repeats (STRs). To confirm this transcription, we develop Cap Trap RNA-seq, a technology which combines cap trapping and long read MinION sequencing. We train sequence-based deep …


Integrating Genomics And Metabolomics For Scalable Non-Ribosomal Peptide Discovery., Bahar Behsaz, Edna Bode, Alexey Gurevich, Yan-Ni Shi, Florian Grundmann, Deepa Acharya, Andrés Mauricio Caraballo-Rodríguez, Amina Bouslimani, Morgan Panitchpakdi, Annabell Linck, Changhui Guan, Julia Oh, Pieter C Dorrestein, Helge B Bode, Pavel A Pevzner, Hosein Mohimani May 2021

Integrating Genomics And Metabolomics For Scalable Non-Ribosomal Peptide Discovery., Bahar Behsaz, Edna Bode, Alexey Gurevich, Yan-Ni Shi, Florian Grundmann, Deepa Acharya, Andrés Mauricio Caraballo-Rodríguez, Amina Bouslimani, Morgan Panitchpakdi, Annabell Linck, Changhui Guan, Julia Oh, Pieter C Dorrestein, Helge B Bode, Pavel A Pevzner, Hosein Mohimani

Faculty Research 2021

Non-Ribosomal Peptides (NRPs) represent a biomedically important class of natural products that include a multitude of antibiotics and other clinically used drugs. NRPs are not directly encoded in the genome but are instead produced by metabolic pathways encoded by biosynthetic gene clusters (BGCs). Since the existing genome mining tools predict many putative NRPs synthesized by a given BGC, it remains unclear which of these putative NRPs are correct and how to identify post-assembly modifications of amino acids in these NRPs in a blind mode, without knowing which modifications exist in the sample. To address this challenge, here we report NRPminer, …