Open Access. Powered by Scholars. Published by Universities.®

Life Sciences Commons

Open Access. Powered by Scholars. Published by Universities.®

Bioinformatics

Electronic Thesis and Dissertation Repository

Theses/Dissertations

Machine learning

Publication Year

Articles 1 - 2 of 2

Full-Text Articles in Life Sciences

Mhcherrypan, A Novel Model To Predict The Binding Affinity Of Pan-Specific Class I Hla-Peptide, Xuezhi Xie Apr 2020

Mhcherrypan, A Novel Model To Predict The Binding Affinity Of Pan-Specific Class I Hla-Peptide, Xuezhi Xie

Electronic Thesis and Dissertation Repository

The human leukocyte antigen (HLA) system or complex plays an essential role in regulating the immune system in humans. Accurate prediction of peptide binding with HLA can efficiently help to identify those neoantigens, which potentially make a big difference in immune drug development. HLA is one of the most polymorphic genetic systems in humans, and thousands of HLA allelic versions exist. Due to the high polymorphism of HLA complex, it is still pretty difficult to accurately predict the binding affinity. In this thesis, we presented a new algorithm to combine convolutional neural network and long short-term memory to solve this …


Dna Sequence Classification: It’S Easier Than You Think: An Open-Source K-Mer Based Machine Learning Tool For Fast And Accurate Classification Of A Variety Of Genomic Datasets, Stephen Solis-Reyes Oct 2018

Dna Sequence Classification: It’S Easier Than You Think: An Open-Source K-Mer Based Machine Learning Tool For Fast And Accurate Classification Of A Variety Of Genomic Datasets, Stephen Solis-Reyes

Electronic Thesis and Dissertation Repository

Supervised classification of genomic sequences is a challenging, well-studied problem with a variety of important applications. We propose an open-source, supervised, alignment-free, highly general method for sequence classification that operates on k-mer proportions of DNA sequences. This method was implemented in a fully standalone general-purpose software package called Kameris, publicly available under a permissive open-source license. Compared to competing software, ours provides key advantages in terms of data security and privacy, transparency, and reproducibility. We perform a detailed study of its accuracy and performance on a wide variety of classification tasks, including virus subtyping, taxonomic classification, and human haplogroup assignment. …