Ancient documents bleed-through evaluation and its application for predicting OCR error ratesReportar como inadecuado




Ancient documents bleed-through evaluation and its application for predicting OCR error rates - Descarga este documento en PDF. Documentación en PDF para descargar gratis. Disponible también para leer online.

1 LaBRI - Laboratoire Bordelais de Recherche en Informatique

Abstract : This article presents a way to evaluate the bleed-through defect on very old document images. We design measures to quantify and evaluate the verso ink bleeding through the paper onto the recto side. Measuring the bleed-through defect alows us to perform statistical analysis that are able to predict the feasibility of different post-scan tasks. In this article we choose to illustrate our measures by creating two OCR error rate predicting models based bleed-through evaluation. Two models are proposed, one for Abbyy FineReader ∗ which is a very power-full commercial OCR and OCRopus † which is sponsored by Google. Both prediction models appears to be very accurate when calculating various statistic indicators.





Autor: Vincent Rabeux - Nicholas Journet - Jean-Philippe Domenger -

Fuente: https://hal.archives-ouvertes.fr/



DESCARGAR PDF




Documentos relacionados