Open Access. Powered by Scholars. Published by Universities.®

Computer Engineering Commons

Open Access. Powered by Scholars. Published by Universities.®

2009

Other Computer Engineering

Information retrieval

Articles 1 - 1 of 1

Full-Text Articles in Computer Engineering

Efficient Storage And Domain-Specific Information Discovery On Semistructured Documents, Fernando R. Farfan Nov 2009

Efficient Storage And Domain-Specific Information Discovery On Semistructured Documents, Fernando R. Farfan

FIU Electronic Theses and Dissertations

The increasing amount of available semistructured data demands efficient mechanisms to store, process, and search an enormous corpus of data to encourage its global adoption. Current techniques to store semistructured documents either map them to relational databases, or use a combination of flat files and indexes. These two approaches result in a mismatch between the tree-structure of semistructured data and the access characteristics of the underlying storage devices. Furthermore, the inefficiency of XML parsing methods has slowed down the large-scale adoption of XML into actual system implementations. The recent development of lazy parsing techniques is a major step towards improving …