Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

2002

PDF

Conference

Kernel density clustering

Articles 1 - 1 of 1

Full-Text Articles in Physical Sciences and Mathematics

Impact Of Data Transformation On The Performance Of Different Clustering Methods And Cluster Number Determination Statistics For Analyzing Gene Expression Profile Data, Guoping Shu, Beiyan Zeng, Deanne Wright, Oscar Smith Apr 2002

Impact Of Data Transformation On The Performance Of Different Clustering Methods And Cluster Number Determination Statistics For Analyzing Gene Expression Profile Data, Guoping Shu, Beiyan Zeng, Deanne Wright, Oscar Smith

Conference on Applied Statistics in Agriculture

We have assessed the impact of 13 different data transformation methods on the performance of four types of clustering methods (partitioning (K-mean), hierarchical distance (Average Linkage), multivariate normal mixture, and non-parametric kernel density) and four cluster number determination statistics (CNDS) (Pseudo F, Pseudo t2, Cubic Clustering Criterion (CCC), and Bayesian Information Criterion (BIC), using both simulated and real gene expression profile data. We found that Square Root, Cubic Root, and Spacing transformations have mostly positive impacts on the performance of the four types of clustering methods whereas Tukey's Bisquare and Interquantile Range have mostly negative impacts. The impacts …