Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Statistics and Probability

Michigan Technological University

Theses/Dissertations

2023

Boruta

Articles 1 - 1 of 1

Full-Text Articles in Physical Sciences and Mathematics

Machine Learning Methods For Prediction Of Human Infectious Virus And Imputation Of Hla Alleles, Xiaoqing Gao Jan 2023

Machine Learning Methods For Prediction Of Human Infectious Virus And Imputation Of Hla Alleles, Xiaoqing Gao

Dissertations, Master's Theses and Master's Reports

This dissertation contains three Chapters. The following is a concise description of each Chapters.

In Chapter 1, we introduced the Random Forest, a machine learning method, to foresee whether a virus is capable of infecting humans. The Covid pandemic informs us the importance of predicting the ability of a zoonotic virus that can infect humans from its genomic sequence. We used the -mer with and as features of a virus to predict if it can affect humans. We further employed the Boruta algorithm to select the important features, then fed those important features into the Random Forest method to train …