Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Dissertations

1999

Data structures (Computer science).

Articles 1 - 1 of 1

Full-Text Articles in Physical Sciences and Mathematics

Automatic Document Classification And Extraction System (Adoces), Xuhong Li May 1999

Automatic Document Classification And Extraction System (Adoces), Xuhong Li

Dissertations

Document processing is a critical element of office automation. Document image processing begins from the Optical Character Recognition (OCR) phase with complex processing for document classification and extraction. Document classification is a process that classifies an incoming document into a particular predefined document type. Document extraction is a process that extracts information pertinent to the users from the content of a document and assigns the information as the values of the “logical structure” of the document type. Therefore, after document classification and extraction, a paper document will be represented in its digital form instead of its original image file format, …