Open Access. Powered by Scholars. Published by Universities.®

Biomedical Informatics Commons

Open Access. Powered by Scholars. Published by Universities.®

McKelvey School of Engineering Theses & Dissertations

Theses/Dissertations

2023

Articles 1 - 1 of 1

Full-Text Articles in Biomedical Informatics

Feature Selection From Clinical Surveys Using Semantic Textual Similarity, Benjamin Warner May 2023

Feature Selection From Clinical Surveys Using Semantic Textual Similarity, Benjamin Warner

McKelvey School of Engineering Theses & Dissertations

Survey data collected from human subjects can contain a high number of features while having a comparatively low quantity of examples. Machine learning models that attempt to predict outcomes from survey data under these conditions can overfit and result in poor generalizability. One remedy to this issue is feature selection, which attempts to select an optimal subset of features to learn upon. A relatively unexplored source of information in the feature selection process is the usage of textual names of features, which may be semantically indicative of which features are relevant to a target outcome. The relationships between feature names …