Open Access. Powered by Scholars. Published by Universities.®

Electrical and Computer Engineering Commons

Open Access. Powered by Scholars. Published by Universities.®

National Taiwan Ocean University

2010

Data extraction model

Articles 1 - 1 of 1

Full-Text Articles in Electrical and Computer Engineering

On Design Of Browser-Oriented Data Extraction System And The Plug-Ins, Jui-Yuan Su, Der-Johng Sun, I-Chen Wu, Lung-Pin Chen Apr 2010

On Design Of Browser-Oriented Data Extraction System And The Plug-Ins, Jui-Yuan Su, Der-Johng Sun, I-Chen Wu, Lung-Pin Chen

Journal of Marine Science and Technology

Web data extraction systems currently not only extract data on web pages but also need to navigate to the target correctly. Most traditional web data extraction systems extract URLs directly from web pages, and then access next pages using the extracted URLs. This data extraction approach is herein called the URL-oriented data extraction approach in this paper. However, currently, more and more web pages use script functions, such as JavaScript, to access next pages and may hide URLs inside these functions, making it difficult to extract URLs. In order to solve this problem, a new data extraction approach, named the …