Open Access. Powered by Scholars. Published by Universities.®

Digital Commons Network

Open Access. Powered by Scholars. Published by Universities.®

Computer Sciences

Masters Theses

Theses/Dissertations

2012

Clustering

Articles 1 - 1 of 1

Full-Text Articles in Entire DC Network

Semantic Preserving Text Tepresentation And Its Applications In Text Clustering, Michael Howard Jan 2012

Semantic Preserving Text Tepresentation And Its Applications In Text Clustering, Michael Howard

Masters Theses

Text mining using the vector space representation has proven to be an valuable tool for classification, prediction, information retrieval and extraction. The nature of text data presents several issues to these tasks, including large dimension and the existence of special polysemous and synonymous words. A variety of techniques have been devised to overcome these shortcomings, including feature selection and word sense disambiguation. Privacy preserving data mining is also an area of emerging interest. Existing techniques for privacy preserving data mining require the use of secure computation protocols, which often incur a greatly increased computational cost. In this paper, a generalization-based …