Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

PDF

All Capstone Projects

Theses/Dissertations

2015

Internet

Articles 1 - 1 of 1

Full-Text Articles in Physical Sciences and Mathematics

A Survey To Fix The Threshold And Implementation For Detecting Duplicate Web Documents, Manojreddy Bhimireddy, Krishna Pavan Gandi, Reuven Hicks, Bhargav Roy Veeramachaneni Oct 2015

A Survey To Fix The Threshold And Implementation For Detecting Duplicate Web Documents, Manojreddy Bhimireddy, Krishna Pavan Gandi, Reuven Hicks, Bhargav Roy Veeramachaneni

All Capstone Projects

The drastic development in the information accessible on the World Wide Web has made the employment of automated tools to locate the information resources of interest, and for tracking and analyzing the same a certainty. Web Mining is the branch of data mining that deals with the analysis of World Wide Web. The concepts from various areas such as Data Mining, Internet technology and World Wide Web, and recently, Semantic Web can be said as the origin of web mining. Web mining can be defined as the procedure of determining hidden yet potentially beneficial knowledge from the data accessible in …