Open Access. Powered by Scholars. Published by Universities.®

Digital Commons Network

Open Access. Powered by Scholars. Published by Universities.®

PDF

2006

Physical Sciences and Mathematics

Georgia State University

Web crawler

Articles 1 - 2 of 2

Full-Text Articles in Entire DC Network

A Domain Based Approach To Crawl The Hidden Web, Milan Pandya Dec 2006

A Domain Based Approach To Crawl The Hidden Web, Milan Pandya

Computer Science Theses

There is a lot of research work being performed on indexing the Web. More and more sophisticated Web crawlers are been designed to search and index the Web faster. But all these traditional crawlers crawl only the part of Web we call “Surface Web”. They are unable to crawl the hidden portion of the Web. These traditional crawlers retrieve contents only from surface Web pages which are just a set of Web pages linked by some hyperlinks and ignoring the hidden information. Hence, they ignore tremendous amount of information hidden behind these search forms in Web pages. Most of the …


An Indexation And Discovery Architecture For Semantic Web Services And Its Application In Bioinformatics, Liyang Yu Jun 2006

An Indexation And Discovery Architecture For Semantic Web Services And Its Application In Bioinformatics, Liyang Yu

Computer Science Theses

Recently much research effort has been devoted to the discovery of relevant Web services. It is widely recognized that adding semantics to service description is the solution to this challenge. Web services with explicit semantic annotation are called Semantic Web Services (SWS). This research proposes an indexation and discovery architecture for SWS, together with a prototype application in the area of bioinformatics. In this approach, a SWS repository is created and maintained by crawling both ontology-oriented UDDI registries and Web sites that hosting SWS. For a given service request, the proposed system invokes the matching algorithm and a candidate set …