Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Computer Sciences

Brigham Young University

Theses/Dissertations

2015

Cluster confidence

Articles 1 - 1 of 1

Full-Text Articles in Physical Sciences and Mathematics

Cvic: Cluster Validation Using Instance-Based Confidences, Dean M. Lebaron Nov 2015

Cvic: Cluster Validation Using Instance-Based Confidences, Dean M. Lebaron

Theses and Dissertations

As unlabeled data becomes increasingly available, the need for robust data mining techniques increases as well. Clustering is a common data mining tool which seeks to find related, independent patterns in data called clusters. The cluster validation problem addresses the question of how well a given clustering fits the data set. We present CVIC (cluster validation using instance-based confidences) which assigns confidence scores to each individual instance, as opposed to more traditional methods which focus on the clusters themselves. CVIC trains supervised learners to recreate the clustering, and instances are scored based on output from the learners which corresponds to …