Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Engineering

University of Nebraska - Lincoln

CSE Conference and Workshop Papers

Series

Category hierarchies

Articles 1 - 1 of 1

Full-Text Articles in Physical Sciences and Mathematics

Table Headers: An Entrance To The Data Mine, George Nagy, Sharad C. Seth Jan 2016

Table Headers: An Entrance To The Data Mine, George Nagy, Sharad C. Seth

CSE Conference and Workshop Papers

Algorithmic methods are demonstrated for information extraction from table header elements, including data categories and data hierarchies. The table headers are found with the Minimum Index Point Search algorithm. The header-path alignment and header completion algorithms yield database-ready table content and configuration statistics on a random sample of 400 diverse tables with ground truth and 1120 tables without ground truth from international statistical data sites.