Open Access. Powered by Scholars. Published by Universities.®

Databases and Information Systems Commons

Open Access. Powered by Scholars. Published by Universities.®

University of Nebraska - Lincoln

Other Computer Sciences

Table segmentation

Articles 1 - 1 of 1

Full-Text Articles in Databases and Information Systems

Segmenting Tables Via Indexing Of Value Cells By Table Headers, Sharad C. Seth, George Nagy Aug 2013

Segmenting Tables Via Indexing Of Value Cells By Table Headers, Sharad C. Seth, George Nagy

CSE Conference and Workshop Papers

Correct segmentation of a web table into its component regions is the essential first step to understanding tabular data. Our algorithmic solution to the segmentation problem relies on the property that strings defining row and column header paths uniquely index each data cell in the table. We segment the table using only “logical layout analysis” without resorting to any appearance features or natural language understanding. We start with a CSV table that preserves the 2- dimensional structure and contents of the original source table (e.g., an HTML table) but not font size, font weight, and color. The indexing property of …