Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Statistics and Probability

Theses and Dissertations

2006

Cluster analysis

Articles 1 - 1 of 1

Full-Text Articles in Physical Sciences and Mathematics

Optimal Clustering: Genetic Constrained K-Means And Linear Programming Algorithms, Jianmin Zhao Jan 2006

Optimal Clustering: Genetic Constrained K-Means And Linear Programming Algorithms, Jianmin Zhao

Theses and Dissertations

Methods for determining clusters of data under- specified constraints have recently gained popularity. Although general constraints may be used, we focus on clustering methods with the constraint of a minimal cluster size. In this dissertation, we propose two constrained k-means algorithms: Linear Programming Algorithm (LPA) and Genetic Constrained K-means Algorithm (GCKA). Linear Programming Algorithm modifies the k-means algorithm into a linear programming problem with constraints requiring that each cluster have m or more subjects. In order to achieve an acceptable clustering solution, we run the algorithm with a large number of random sets of initial seeds, and choose the solution …