Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Computer Sciences

University of Massachusetts Amherst

Selected Works

2004

Conditional random fields

Articles 1 - 1 of 1

Full-Text Articles in Physical Sciences and Mathematics

Table Extraction For Answer Retrieval, Xing Wei, Bruce Croft, Andrew Mccallum Jan 2004

Table Extraction For Answer Retrieval, Xing Wei, Bruce Croft, Andrew Mccallum

Andrew McCallum

The ability to find tables and extract information from them is a necessary component of question answering and other information retrieval tasks. Documents often contain tables in order to communicate densely packed, multidimensional information. Tables do this by employing layout patterns to efficiently indicate fields and records in two-dimensional form. Their rich combination of formatting and content present difficulties for traditional retrieval techniques. This paper describes techniques for extracting tables from text and retrieving answers from the extracted information. We compare machine learning (especially conditional random fields) and heuristic methods for table extraction. Our approach creates a cell document, which …