The document presents a hierarchical deep supervised network (DSN) for binarizing degraded document images. DSN extends convolutional neural networks to extract hierarchical features and classify pixels as text or background. It achieves state-of-the-art performance on several document image binarization datasets without pre- or post-processing. Subsequent research has adapted DSN to other document types like music scores and paychecks, and worked to reduce its computational complexity.
Related topics: