This document explores the impact of crowdsourcing OCR improvements on retrievability bias, examining how OCR quality relates to document retrieval effectiveness. It discusses findings from a large-scale study involving historical newspapers, highlighting that correction of OCR errors can significantly enhance the retrieval of documents while lowering inequalities in retrieval scores. The study identifies both direct and indirect impacts of OCR error correction on retrieval outcomes and emphasizes the importance of query design in evaluating these effects.
Related topics: