Physical Sciences and Mathematics | Open Access Articles

Creating An Improved Version Using Noisy Ocr From Multiple Editions, David Wemhoener, Ismet Zeki, R. Manmatha

R. Manmatha

This paper evaluates an automated scheme for aligning and combining optical character recognition (OCR) output from three scans of a book to generate a composite version with fewer OCR errors. While there has been some previous work on aligning multiple OCR versions of the same scan, the scheme introduced in this paper does not require that scans be from the same copy of the book, or even the same edition. The three OCR outputs are combined using an algorithm which builds upon an technique which aligns two sequences at a time. In the algorithm a multiple sequence alignment of the …

Full-Text Articles in Physical Sciences and Mathematics

Creating An Improved Version Using Noisy Ocr From Multiple Editions, David Wemhoener, Ismet Zeki, R. Manmatha

R. Manmatha

On Influence Of Line Segmentation In Efficient Word Segmentation In Old Manuscripts, D. Fernández, J. Lladós, A. Fornés, R. Manmatha

R. Manmatha

A Framework For Manipulating And Searching Multiple Retrieval Types, Marc-Allen Cartright, Ethem F. Can, William Dabney, Jeff Dalton, Logan Giorda, Kriste Krstovski, Xiaoye Wu, Ismet Zeki Yalniz, James Allan, R. Manmatha, David Smith

R. Manmatha

Finding Translations In Scanned Book Collections, Ismet Zeki Yalniz, R. Manmatha

R. Manmatha

Partial Duplicate Detection For Large Book Collections, Ismet Zeki Yalniz, Ethem F. Can, R. Manmatha

R. Manmatha

A Novel Word Spotting Method Based On Recurrent Neural Networks, Volkmar Frinken, Andreas Fischer, R. Manmatha, Horst Bunke

R. Manmatha

An Efficient Framework For Searching Text In Noisy Document Images, Ismet Zeki Yalniz, R. Manmatha

R. Manmatha

A Fast Alignment Scheme For Automatic Ocr Evaluation Of Books, Ismet Zeki Yalniz, R. Manmatha

R. Manmatha

Mining Relational Structure From Millions Of Books, David A. Smith, R. Manmatha, James Allan

R. Manmatha

Blstm Neural Network Based Word Retrieval For Hindi Documents, Raman Jain, Volkmar Frinken, C. V. Jawahar, R. Manmatha

R. Manmatha

Nearest Neighbor Based Collection Ocr, Pramod Sankar K., C. V. Jawahar, R. Manmatha

R. Manmatha

Finding Words In Alphabet Soup: Inference On Freeform Character Recognition For Historical Scripts, Nicholas R. Howe, Shaolei Feng, R. Manmatha

R. Manmatha

A Discrete Direct Retrieval Model For Image And Video Retrieval, Shaolei Feng, R. Manmatha

R. Manmatha

A Hierarchical, Hmmbased Automatic Evaluation Of Ocr Accuracy For A Digital Library Of Books, Shaolei Feng, R. Manmatha

R. Manmatha

Combining Text And Audio-Visual Features In Video Indexing, Shih-Fu Chang, R. Manmatha, Tat-Seng Chua

R. Manmatha

Boosted Decision Trees For Word Recognition In Handwritten Document Retrieval, Nicholas R. Howe, Toni M. Rath, R. Manmatha

R. Manmatha

Joint Visualtext Modeling For Automatic Retrieval Of Multimedia Documents, G. Iyengar, P. Duygulu, S. Feng, P. Ircing, S. P. Khudanpur, D. Klakow, M. R. Krause, R. Manmatha, H. J. Nock, D. Petkova, B. Pytlik, P. Virga

R. Manmatha

Classification Models For Historical Manuscript Recognition, S. L. Feng, R. Manmatha

R. Manmatha

Statistical Models For Automatic Video Annotation And Retrieval, V. Lavrenko, S. L. Feng, R. Manmatha

R. Manmatha

A Scale Space Approach For Automatically Segmenting Words From Historical Handwritten Documents, R. Manmatha, Jamie L. Rothfeder

R. Manmatha

Holistic Word Recognition For Handwritten Historical Documents, Victor Lavrenko, Toni M. Rath, R. Manmatha

R. Manmatha

A Search Engine For Historical Manuscript Images, Toni M. Rath, R. Manmatha, Victor Lavrenko

R. Manmatha

An Inference Network Approach To Image Retrieval, Donald Metzler, R. Manmatha

R. Manmatha

Using Maximum Entropy For Automatic Image Annotation, Jiwoon Jeon, R. Manmatha

R. Manmatha

Server Selection Techniques For Distribution Information Retrieval, Yoshiya Kinuta, Brian Neil Levine, R. Manmatha

R. Manmatha

A Statistical Approach To Retrieving Historical Manuscript Images Without Recognition, Toni M. Rath, Victor Lavrenko, R. Manmatha

R. Manmatha

Retrieving Historical Manuscripts Using Shape, Toni M. Rath, Victor Lavrenko, R. Manmatha

R. Manmatha

Indexing Of Handwritten Historical Documents - Recent Progress, R. Manmatha, Toni M. Rath

R. Manmatha

Text Alignment With Handwritten Documents, E. Micah Kornfield, R. Manmatha, James Allan

R. Manmatha

R. Manmatha