Geometry Restoration and Dewarping of Camera-Captured Document Images

Abstract

This research focuses on developing a method for restoring the topology ofdigital images of paper documents captured by a camera, using algorithms fordetection, segmentation, geometry restoration, and dewarping. Our methodologyemploys deep learning (DL) for document outline detection, followed by computervision (CV) to create a topological 2D grid using cubic polynomialinterpolation and correct nonlinear distortions by remapping the image. Usingclassical CV methods makes the document topology restoration process moreefficient and faster, as it requires significantly fewer computationalresources and memory. We developed a new pipeline for automatic documentdewarping and reconstruction, along with a framework and annotated dataset todemonstrate its efficiency. Our experiments confirm the promise of ourmethodology and its superiority over existing benchmarks (including mobile appsand popular DL solutions, such as RectiNet, DocGeoNet, and DocTr++) bothvisually and in terms of document readability via Optical Character Recognition(OCR) and geometry restoration metrics. This paves the way for creatinghigh-quality digital copies of paper documents and enhancing the efficiency ofOCR systems. Project page: https://github.com/HorizonParadox/DRCCBI

Quick Read (beta)

loading the full paper ...