With recognition, there are two important variables affecting success rates that are specific to a customer: errors and rejects. Errors are issues involved with false data recognition
This blog post, perhaps a slap in the face, is about why you can over clean scanned images, to the point where your recognition accuracy decreases...If the accuracy was better ( i.e. correct recognition and percentage of uncertainty decreased ), then it was implemented
This is especially important for forms processing / OCR applications in order to improve character recognition...This will increase the recognition rate of OCR
The reason for this are, these applications while almost always produce a result that is good for viewing, almost also ruin the fonts for recognition. Sad but true
This is what the scanning industry calls “Intelligent Document Recognition” or IDR