Open Access. Powered by Scholars. Published by Universities.®

Electrical and Computer Engineering Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 11 of 11

Full-Text Articles in Electrical and Computer Engineering

Meta-Icvi: Ensemble Validity Metrics For Concise Labeling Of Correct, Under- Or Over-Partitioning In Streaming Clustering, Niklas M. Melton, Sasha A. Petrenko, Donald C. Wunsch Jan 2024

Meta-Icvi: Ensemble Validity Metrics For Concise Labeling Of Correct, Under- Or Over-Partitioning In Streaming Clustering, Niklas M. Melton, Sasha A. Petrenko, Donald C. Wunsch

Electrical and Computer Engineering Faculty Research & Creative Works

Understanding the performance and validity of clustering algorithms is both challenging and crucial, particularly when clustering must be done online. Until recently, most validation methods have relied on batch calculation and have required considerable human expertise in their interpretation. Improving real-time performance and interpretability of cluster validation, therefore, continues to be an important theme in unsupervised learning. Building upon previous work on incremental cluster validity indices (iCVIs), this paper introduces the Meta- iCVI as a tool for explainable and concise labeling of partition quality in online clustering. Leveraging a time-series classifier and data-fusion techniques, the Meta- iCVI combines the outputs …


An Explainable And Statistically Validated Ensemble Clustering Model Applied To The Identification Of Traumatic Brain Injury Subgroups, Dacosta Yeboah, Louis Steinmeister, Daniel B. Hier, Bassam Hadi, Donald C. Wunsch, Gayla R. Olbricht, Tayo Obafemi-Ajayi Sep 2020

An Explainable And Statistically Validated Ensemble Clustering Model Applied To The Identification Of Traumatic Brain Injury Subgroups, Dacosta Yeboah, Louis Steinmeister, Daniel B. Hier, Bassam Hadi, Donald C. Wunsch, Gayla R. Olbricht, Tayo Obafemi-Ajayi

Electrical and Computer Engineering Faculty Research & Creative Works

We present a framework for an explainable and statistically validated ensemble clustering model applied to Traumatic Brain Injury (TBI). The objective of our analysis is to identify patient injury severity subgroups and key phenotypes that delineate these subgroups using varied clinical and computed tomography data. Explainable and statistically-validated models are essential because a data-driven identification of subgroups is an inherently multidisciplinary undertaking. In our case, this procedure yielded six distinct patient subgroups with respect to mechanism of injury, severity of presentation, anatomy, psychometric, and functional outcome. This framework for ensemble cluster analysis fully integrates statistical methods at several stages of …


Incremental Cluster Validity Indices For Online Learning Of Hard Partitions: Extensions And Comparative Study, Leonardo Enzo Brito Da Silva, Niklas Max Melton, Donald C. Wunsch Jan 2020

Incremental Cluster Validity Indices For Online Learning Of Hard Partitions: Extensions And Comparative Study, Leonardo Enzo Brito Da Silva, Niklas Max Melton, Donald C. Wunsch

Electrical and Computer Engineering Faculty Research & Creative Works

Validation is one of the most important aspects of clustering, particularly when the user is designing a trustworthy or explainable system. However, most clustering validation approaches require batch calculation. This is an important gap because of the value of clustering in real-time data streaming and other online learning applications. Therefore, interest has grown in providing online alternatives for validation. This paper extends the incremental cluster validity index (iCVI) family by presenting incremental versions of Calinski-Harabasz (iCH), Pakhira-Bandyopadhyay-Maulik (iPBM), WB index (iWB), Silhouette (iSIL), Negentropy Increment (iNI), Representative Cross Information Potential (irCIP), Representative Cross Entropy (irH), and Conn_Index (iConn_Index). This paper …


Genotype Combinations Linked To Phenotype Subgroups In Autism Spectrum Disorders, Junya Zhao, Thy Nguyen, Jonathan Kopel, Perry B. Koob, Donald A. Adieroh, Tayo Obafemi-Ajayi Jul 2019

Genotype Combinations Linked To Phenotype Subgroups In Autism Spectrum Disorders, Junya Zhao, Thy Nguyen, Jonathan Kopel, Perry B. Koob, Donald A. Adieroh, Tayo Obafemi-Ajayi

Electrical and Computer Engineering Faculty Research & Creative Works

This paper investigates a computational model that allows for systematic comparison of phenotype data with genotype (Single Nucleotide Polymorphisms (SNPs)) data based on machine learning techniques to identify discriminant genotype markers associated with the phenotypic subgroups. The proposed discriminant SNP identifier model is empirically evaluated using Autism Spectrum Disorder (ASD) simplex sample. Six phenotype markers were selected to cluster the sample in a hexagonal lattice format yielding five multidimensional subgroups based on extremities of the phenotype markers. The SNP selection model includes random subspace selection of SNPs in conjunction with feature selection algorithms to determine which set of SNPs were …


Applications Of Node-Based Resilience Graph Theoretic Framework To Clustering Autism Spectrum Disorders Phenotypes, John Matta, Junya Zhao, Gunes Ercal, Tayo Obafemi-Ajayi Dec 2018

Applications Of Node-Based Resilience Graph Theoretic Framework To Clustering Autism Spectrum Disorders Phenotypes, John Matta, Junya Zhao, Gunes Ercal, Tayo Obafemi-Ajayi

Electrical and Computer Engineering Faculty Research & Creative Works

With the growing ubiquity of data in network form, clustering in the context of a network, represented as a graph, has become increasingly important. Clustering is a very useful data exploratory machine learning tool that allows us to make better sense of heterogeneous data by grouping data with similar attributes based on some criteria. This paper investigates the application of a novel graph theoretic clustering method, Node-Based Resilience clustering (NBR-Clust), to address the heterogeneity of Autism Spectrum Disorder (ASD) and identify meaningful subgroups. The hypothesis is that analysis of these subgroups would reveal relevant biomarkers that would provide a better …


Node-Based Resilience Measure Clustering With Applications To Noisy And Overlapping Communities In Complex Networks, John Matta, Tayo Obafemi-Ajayi, Jeffrey Borwey, Koushik Sinha, Donald C. Wunsch, Gunes Ercal Aug 2018

Node-Based Resilience Measure Clustering With Applications To Noisy And Overlapping Communities In Complex Networks, John Matta, Tayo Obafemi-Ajayi, Jeffrey Borwey, Koushik Sinha, Donald C. Wunsch, Gunes Ercal

Electrical and Computer Engineering Faculty Research & Creative Works

This paper examines a schema for graph-theoretic clustering using node-based resilience measures. Node-based resilience measures optimize an objective based on a critical set of nodes whose removal causes some severity of disconnection in the network. Beyond presenting a general framework for the usage of node-based resilience measures for variations of clustering problems, we experimentally validate the usefulness of such methods in accomplishing the following: (i) clustering a graph in one step without knowing the number of clusters a priori; (ii) removing noise from noisy data; and (iii) detecting overlapping communities. We demonstrate that this clustering schema can be applied successfully …


Analysis Of Grapevine Gene Expression Data Using Node-Based Resilience Clustering, Jeffrey Dale, John Matta, Susanne Howard, Gunes Ercal, Wenping Qiu, Tayo Obafemi-Ajayi Jul 2018

Analysis Of Grapevine Gene Expression Data Using Node-Based Resilience Clustering, Jeffrey Dale, John Matta, Susanne Howard, Gunes Ercal, Wenping Qiu, Tayo Obafemi-Ajayi

Electrical and Computer Engineering Faculty Research & Creative Works

Powdery mildew is the most economically important disease of cultivated grapevines worldwide. In the agricultural community, there is a great need for better understanding of the complex genetic basis of powdery mildew (PM) resistance by delineating possible gene biomarkers associated with the plants' defense mechanisms. Machine learning techniques can be applied to analysis of gene expression data to aid knowledge discovery of disease fighting genes. In this work, we apply a data-driven computational model, utilizing a graph-based clustering algorithm - Node-Based Resilience Clustering (NBRClust), to analyze grapevine gene expression data to identify possible gene biomarkers associated with powdery mildew disease …


Hidden Markov Model With Information Criteria Clustering And Extreme Learning Machine Regression For Wind Forecasting, Dao Lam, Shuhui Li, Donald C. Wunsch Jan 2014

Hidden Markov Model With Information Criteria Clustering And Extreme Learning Machine Regression For Wind Forecasting, Dao Lam, Shuhui Li, Donald C. Wunsch

Electrical and Computer Engineering Faculty Research & Creative Works

This paper proposes a procedural pipeline for wind forecasting based on clustering and regression. First, the data are clustered into groups sharing similar dynamic properties. Then, data in the same cluster are used to train the neural network that predicts wind speed. For clustering, a hidden Markov model (HMM) and the modified Bayesian information criteria (BIC) are incorporated in a new method of clustering time series data. to forecast wind, a new method for wind time series data forecasting is developed based on the extreme learning machine (ELM). the clustering results improve the accuracy of the proposed method of wind …


Development And Implementation Of Optimized Energy-Delay Sub-Network Routing Protocol For Wireless Sensor Networks, Maciej Jan Zawodniok, Jagannathan Sarangapani, Steve Eugene Watkins, James W. Fonda Jan 2006

Development And Implementation Of Optimized Energy-Delay Sub-Network Routing Protocol For Wireless Sensor Networks, Maciej Jan Zawodniok, Jagannathan Sarangapani, Steve Eugene Watkins, James W. Fonda

Electrical and Computer Engineering Faculty Research & Creative Works

The development and implementation of the optimized energy-delay sub-network routing (OEDSR) protocol for wireless sensor networks (WSN) is presented. This ondemand routing protocol minimizes a novel link cost factor which is defined using available energy, end-to-end (E2E) delay and distance from a node to the base station (BS), along with clustering, to effectively route information to the BS. Initially, the nodes are either in idle or sleep mode, but once an event is detected, the nodes near the event become active and start forming sub-networks. Formation of the inactive network into a sub-network saves energy because only a portion of …


Survey Of Clustering Algorithms, Rui Xu, Donald C. Wunsch May 2005

Survey Of Clustering Algorithms, Rui Xu, Donald C. Wunsch

Electrical and Computer Engineering Faculty Research & Creative Works

Data analysis plays an indispensable role for understanding various phenomena. Cluster analysis, primitive exploration with little or no prior knowledge, consists of research developed across a wide variety of communities. The diversity, on one hand, equips us with many tools. On the other hand, the profusion of options causes confusion. We survey clustering algorithms for data sets appearing in statistics, computer science, and machine learning, and illustrate their applications in some benchmark data sets, the traveling salesman problem, and bioinformatics, a new field attracting intensive efforts. Several tightly related topics, proximity measure, and cluster validation, are also discussed.


An Industrial Application To Neural Networks To Reusable Design, Donald C. Wunsch, R. Escobedo, T. P. Caudell, S. D. G. Smith, G. C. Johnson Jan 1991

An Industrial Application To Neural Networks To Reusable Design, Donald C. Wunsch, R. Escobedo, T. P. Caudell, S. D. G. Smith, G. C. Johnson

Electrical and Computer Engineering Faculty Research & Creative Works

Summary form only given, as follows. The feasibility of training an adaptive resonance theory (ART-1) network to first cluster aircraft parts into families, and then to recall the most similar family when presented a new part has been demonstrated, ART-1 networks were used to adaptively group similar input vectors. The inputs to the network were generated directly from computer-aided designs of the parts and consist of binary vectors which represent bit maps of the features of the parts. This application, referred to as group technology, is of large practical value to industry, making it possible to avoid duplication of design …