Open Access. Powered by Scholars. Published by Universities.®
Articles 1 - 1 of 1
Full-Text Articles in Engineering
Automatic Labeling Of Hidden Web Data Using Multi-Heuristics Annotator, Umamageswari Baskaran, R. Kalpana
Automatic Labeling Of Hidden Web Data Using Multi-Heuristics Annotator, Umamageswari Baskaran, R. Kalpana
Future Computing and Informatics Journal
Hidden web contains huge amount of high quality data which are not indexed to search engines. Hidden web refers to web pages which are generated dynamically by embedding backend data matching the search keywords, in server-side templates. They are created for human consumption and makes automated processing cumbersome since structured data is embedded within unstructured HTML tags. In order to enable machine processing, structured data must be detected, extracted and annotated. Many heuristic based approaches DeLa [1], MSAA [2] are available in the literature to perform automatic annotation. Most of these techniques fail if data values didn't contain labels present …