Open Access. Powered by Scholars. Published by Universities.®

Computer Sciences Commons

Open Access. Powered by Scholars. Published by Universities.®

Faculty and Research Publications

Performance evaluation (efficiency and effectiveness)

Articles 1 - 2 of 2

Full-Text Articles in Computer Sciences

Towards Solving Similarity Search Problems Using Fuzzy Concept For Multi-Dimensional Data, Yong Shi Jan 2009

Towards Solving Similarity Search Problems Using Fuzzy Concept For Multi-Dimensional Data, Yong Shi

Faculty and Research Publications

In this paper, we present continuous research on data analysis based on our previous work on similarity search problems. PanKNN[13] is a novel technique which explores the meaning of K nearest neighbors from a new perspective, redefines the distances between data points and a given query point Q, and efficiently and effectively select data points which are closest to Q. It can be applied in various data mining fields. In this paper, we applied the Fuzzy concept to improve the performance of PanKNN, targeting the better decision making for the calculation of the distance between a data …


A Dynamic Insertion Approach For Multi-Dimensional Data Using Index Structures, Yong Shi Jan 2009

A Dynamic Insertion Approach For Multi-Dimensional Data Using Index Structures, Yong Shi

Faculty and Research Publications

Nowadays large volumes of data with high dimensionality are being generated in many fields. Most existing indexing techniques degrade rapidly when dimensionality goes higher. A large amount of data sets are time related, and the existence of the obsolete data in the data sets may seriously degrade the data processing. In our previous work[7], we proposed ClusterTree+, a new indexing approach representing clusters generated by any existing clustering approach. It is a hierarchy of clusters and subclusters which incorporates the cluster representation into the index structure to achieve effective and efficient retrieval. It also has features from the …