Open Access. Powered by Scholars. Published by Universities.®

Engineering Commons

Open Access. Powered by Scholars. Published by Universities.®

Electrical and Computer Engineering

Missouri University of Science and Technology

2024

Clustering

Articles 1 - 2 of 2

Full-Text Articles in Engineering

Meta-Icvi: Ensemble Validity Metrics For Concise Labeling Of Correct, Under- Or Over-Partitioning In Streaming Clustering, Niklas M. Melton, Sasha A. Petrenko, Donald C. Wunsch Jan 2024

Meta-Icvi: Ensemble Validity Metrics For Concise Labeling Of Correct, Under- Or Over-Partitioning In Streaming Clustering, Niklas M. Melton, Sasha A. Petrenko, Donald C. Wunsch

Electrical and Computer Engineering Faculty Research & Creative Works

Understanding the performance and validity of clustering algorithms is both challenging and crucial, particularly when clustering must be done online. Until recently, most validation methods have relied on batch calculation and have required considerable human expertise in their interpretation. Improving real-time performance and interpretability of cluster validation, therefore, continues to be an important theme in unsupervised learning. Building upon previous work on incremental cluster validity indices (iCVIs), this paper introduces the Meta- iCVI as a tool for explainable and concise labeling of partition quality in online clustering. Leveraging a time-series classifier and data-fusion techniques, the Meta- iCVI combines the outputs …


Multiple Imputation For Robust Cluster Analysis To Address Missingness In Medical Data, Arnold Harder, Gayla R. Olbricht, Godwin Ekuma, Daniel B. Hier, Tayo Obafemi-Ajayi Jan 2024

Multiple Imputation For Robust Cluster Analysis To Address Missingness In Medical Data, Arnold Harder, Gayla R. Olbricht, Godwin Ekuma, Daniel B. Hier, Tayo Obafemi-Ajayi

Mathematics and Statistics Faculty Research & Creative Works

Cluster Analysis Has Been Applied To A Wide Range Of Problems As An Exploratory Tool To Enhance Knowledge Discovery. Clustering Aids Disease Subtyping, I.e. Identifying Homogeneous Patient Subgroups, In Medical Data. Missing Data Is A Common Problem In Medical Research And Could Bias Clustering Results If Not Properly Handled. Yet, Multiple Imputation Has Been Under-Utilized To Address Missingness, When Clustering Medical Data. Its Limited Integration In Clustering Of Medical Data, Despite The Known Advantages And Benefits Of Multiple Imputation, Could Be Attributed To Many Factors. This Includes Methodological Complexity, Difficulties In Pooling Results To Obtain A Consensus Clustering, Uncertainty Regarding …