Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Computer Sciences

University of Kentucky

University of Kentucky Master's Theses

Theses/Dissertations

2011

Term weighting scheme; Document Clustering; Information Retrieval; Page Ranking; Data Mining

Articles 1 - 1 of 1

Full-Text Articles in Physical Sciences and Mathematics

Cluster-Based Term Weighting And Document Ranking Models, Keerthiram Murugesan Jan 2011

Cluster-Based Term Weighting And Document Ranking Models, Keerthiram Murugesan

University of Kentucky Master's Theses

A term weighting scheme measures the importance of a term in a collection. A document ranking model uses these term weights to find the rank or score of a document in a collection. We present a series of cluster-based term weighting and document ranking models based on the TF-IDF and Okapi BM25 models. These term weighting and document ranking models update the inter-cluster and intra-cluster frequency components based on the generated clusters. These inter-cluster and intra-cluster frequency components are used for weighting the importance of a term in addition to the term and document frequency components. In this thesis, we …