Open Access. Powered by Scholars. Published by Universities.®

Digital Commons Network

Open Access. Powered by Scholars. Published by Universities.®

PDF

2006

Theses and Dissertations

Statistics and Probability

Algorithm evaluation

Articles 1 - 1 of 1

Full-Text Articles in Entire DC Network

Optimal Clustering: Genetic Constrained K-Means And Linear Programming Algorithms, Jianmin Zhao Jan 2006

Optimal Clustering: Genetic Constrained K-Means And Linear Programming Algorithms, Jianmin Zhao

Theses and Dissertations

Methods for determining clusters of data under- specified constraints have recently gained popularity. Although general constraints may be used, we focus on clustering methods with the constraint of a minimal cluster size. In this dissertation, we propose two constrained k-means algorithms: Linear Programming Algorithm (LPA) and Genetic Constrained K-means Algorithm (GCKA). Linear Programming Algorithm modifies the k-means algorithm into a linear programming problem with constraints requiring that each cluster have m or more subjects. In order to achieve an acceptable clustering solution, we run the algorithm with a large number of random sets of initial seeds, and choose the solution …