The document presents the a-i-pocoto system, which integrates automated and interactive OCR post-correction methods for improving the accuracy of OCR results on historical documents. It details the process of automatic post-correction using supervised machine learning, involving multiple OCRs and profiling for error detection, and describes an interactive tool known as Pocoto for manual corrections. Evaluation results indicate improvements in OCR word accuracy through various experimental setups, though certain steps like lexicon extension showed limited benefits and suggested training adjustments for improving decision-making in corrections.
Related topics: