At the frontiers of OCR

G. Nagy

At the frontiers of OCR

George Nagy

1992, Proceedings of the IEEE

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

It is time for a major change of approach to character recognition research. The traditional approach, focusing on the the correct classijication of isolated characters, has been exhausted. The demonstration of the superiority of a new classification method under operational conditions requires large experimental facilities and data bases beyond the resources of most researchers. In any case, even perfect classification of individual characters is insufficient for the conversion of complex archival documents to a useful computer-readable form. Many practical OCR tasks require integrated treatment of entire documents and well-organized typographic and domain-specific knowledge. New OCR systems should take advantage of the typographic uniformity of paragraphs or other layout components. They should also exploit the unavoidable interaction with human operators to improve themselves without explicit "training. "

Related papers

The Fourth Annual Test of OCR Accuracy

Hieu Anh Nguyen

For four years, ISRI has conducted an annual test of optical character recognition (OCR) systems known as "page readers." These systems accept as input a bitmapped image of any document page, and attempt to identify the machine-printed characters on the page. In the annual test, we measure the accuracy of this process by comparing the text that is produced as output with the correct text. The goals of the test include:

Log In

At the frontiers of OCR

Sign up for access to the world's latest research

Abstract

Related papers

Related topics

Related papers