Open Access. Powered by Scholars. Published by Universities.®

Computer Engineering Commons

Open Access. Powered by Scholars. Published by Universities.®

Electrical and Computer Engineering

PDF

Series

2005

BRC

Articles 1 - 1 of 1

Full-Text Articles in Computer Engineering

Text Degradations And Ocr Training, Elisa H. Barney Smith, Tim Andersen Jan 2005

Text Degradations And Ocr Training, Elisa H. Barney Smith, Tim Andersen

Electrical and Computer Engineering Faculty Publications and Presentations

Printing and scanning of text documents introduces degradations to the characters which can be modeled. Interestingly, certain combinations of the parameters that govern the degradations introduced by the printing and scanning process affect characters in such a way that the degraded characters have a similar appearance, while other degradations leave the characters with an appearance that is very different. It is well known that (generally speaking) a test set that more closely matches a training set will be recognized with higher accuracy than one that matches the training set less well. Likewise, classifiers tend to perform better on data sets …