[Audio] Multimodal Learning and Applications Workshop DocRevive: A Unified Pipeline for Document Text Restoration Ayan Banerjee Kunal Purkayastha Josep Lladós Umapada Pal.
[Audio] Overview DocRevive restores missing and occluded text in documents through a unified pipeline of detection, understanding, and generative text editing. DocRevive 1.
[Audio] The Problem Challenges Diverse occlusion types and severities Missing text cannot be read by OCR Existing methods are task specific and limited Lack of comprehensive benchmark Solution We need a unified solution that detects, understands, and restores missing text accurately. DocRevive 2.
[Audio] Our Solution DocRevive Pipeline An end-to-end framework that restores occluded documents by combining detection, language understanding, and generative text editing. OCR & Line grouping Occlusion Detection Blank Extraction Missing Text Prediction Text Editing (diffusion) Composting & Cleanup YOLOv9 detects occluded regions Diffusion model edits text in style of document Detect text and organize into lines Find blank region based on the geometry RoBERTa predicts missing text from context Blend patches and clean occlusion patches One unified pipeline for accurate and consistent restoration DocRevive 3.
[Audio] Benchmark: OPRB Occluded Pages Restoration Benchmark: The first comprehensive dataset for document restoration with 6 occlusion classes and 3 severity levels. Black Ink Burnt Dust Scribble Stamp 22,368 images 289,536 oriented word boxes DocRevive 4.
[Audio] Performance Analysis DocRevive outperforms leading baselines across OCR and restoration metrics on the OPRB benchmark. Best OCR Accuracy and Lowest Edit Distance across all occlusion levels. DocRevive 5.
[Audio] Takeaway and Future Scope Impact Future Directions ➢ Restores readability of damaged and occluded documents. ➔ Extend to more degradation types and cross-domain generalization. ➢ Boosts OCR and downstream NLP performance. ➔ Improve multilingual and handwritten document restoration. ➢ Preserves historical, legal and archival content. ➔ Deploy in real-world systems for archives, libraries and enterprises. ➢ Enable scalable solutions for real-world applications. DocRevive brings us closer to fully readable, trustworthy digital documents - no matter how damaged. DocRevive 6.
DocRevive.