Open Access. Powered by Scholars. Published by Universities.®

Engineering Commons

Open Access. Powered by Scholars. Published by Universities.®

Electrical and Computer Engineering Faculty Research & Creative Works

Series

2019

Feature selection

Articles 1 - 2 of 2

Full-Text Articles in Engineering

Comparative Analysis Of Feature Selection Methods To Identify Biomarkers In A Stroke-Related Dataset, Thomas Clifford, Justin Bruce, Tayo Obafemi-Ajayi, John Matta Jul 2019

Comparative Analysis Of Feature Selection Methods To Identify Biomarkers In A Stroke-Related Dataset, Thomas Clifford, Justin Bruce, Tayo Obafemi-Ajayi, John Matta

Electrical and Computer Engineering Faculty Research & Creative Works

This paper applies machine learning feature selection techniques to the REGARDS stroke-related dataset to identify health-related biomarkers. A data-driven methodological framework is presented to evaluate multiple feature selection methods. In applying the framework, three classifiers are chosen in conjunction with two wrappers, and their performance with diverse classification targets such as Current Smoker, Current Alcohol Use, and Deceased is evaluated. The performance across logistic regression, random forest and naïve Bayes classifier methods, as quantified by the ROC Area Under Curve metric and selected features, was similar. However, significant differences were observed in running time. Performance of the selected features was …


Genotype Combinations Linked To Phenotype Subgroups In Autism Spectrum Disorders, Junya Zhao, Thy Nguyen, Jonathan Kopel, Perry B. Koob, Donald A. Adieroh, Tayo Obafemi-Ajayi Jul 2019

Genotype Combinations Linked To Phenotype Subgroups In Autism Spectrum Disorders, Junya Zhao, Thy Nguyen, Jonathan Kopel, Perry B. Koob, Donald A. Adieroh, Tayo Obafemi-Ajayi

Electrical and Computer Engineering Faculty Research & Creative Works

This paper investigates a computational model that allows for systematic comparison of phenotype data with genotype (Single Nucleotide Polymorphisms (SNPs)) data based on machine learning techniques to identify discriminant genotype markers associated with the phenotypic subgroups. The proposed discriminant SNP identifier model is empirically evaluated using Autism Spectrum Disorder (ASD) simplex sample. Six phenotype markers were selected to cluster the sample in a hexagonal lattice format yielding five multidimensional subgroups based on extremities of the phenotype markers. The SNP selection model includes random subspace selection of SNPs in conjunction with feature selection algorithms to determine which set of SNPs were …