Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Michigan Tech Publications

2022

Computational biology and bioinformatics

Articles 1 - 2 of 2

Full-Text Articles in Physical Sciences and Mathematics

Improving Protein Succinylation Sites Prediction Using Embeddings From Protein Language Model, Suresh Pokharel, Pawel Pratyush, Michael Heinzinger, Robert H. Newman, Dukka Kc Oct 2022

Improving Protein Succinylation Sites Prediction Using Embeddings From Protein Language Model, Suresh Pokharel, Pawel Pratyush, Michael Heinzinger, Robert H. Newman, Dukka Kc

Michigan Tech Publications

Protein succinylation is an important post-translational modification (PTM) responsible for many vital metabolic activities in cells, including cellular respiration, regulation, and repair. Here, we present a novel approach that combines features from supervised word embedding with embedding from a protein language model called ProtT5-XL-UniRef50 (hereafter termed, ProtT5) in a deep learning framework to predict protein succinylation sites. To our knowledge, this is one of the first attempts to employ embedding from a pre-trained protein language model to predict protein succinylation sites. The proposed model, dubbed LMSuccSite, achieves state-of-the-art results compared to existing methods, with performance scores of 0.36, 0.79, 0.79 …


Gene-Based Association Tests Using Gwas Summary Statistics And Incorporating Eqtl, Xuewei Cao, Xuexia Wang, Shuanglin Zhang, Qiuying Sha Mar 2022

Gene-Based Association Tests Using Gwas Summary Statistics And Incorporating Eqtl, Xuewei Cao, Xuexia Wang, Shuanglin Zhang, Qiuying Sha

Michigan Tech Publications

Although genome-wide association studies (GWAS) have been successfully applied to a variety of complex diseases and identified many genetic variants underlying complex diseases via single marker tests, there is still a considerable heritability of complex diseases that could not be explained by GWAS. One alternative approach to overcome the missing heritability caused by genetic heterogeneity is gene-based analysis, which considers the aggregate effects of multiple genetic variants in a single test. Another alternative approach is transcriptome-wide association study (TWAS). TWAS aggregates genomic information into functionally relevant units that map to genes and their expression. TWAS is not only powerful, but …