Abstract
Historical manuscripts suffer from various forms of degradation such as ink bleed, fading, and physical damage, which impairs their readability and usability. This study proposes a novel image restoration framework using Diffusion Probabilistic Models (DPMs) enhanced with OCR-guided finetuning. By integrating a two-stage architecture-comprising a NAFNet-based Initial Predictor and a Diffusion-Based De-noiser-this system restores both low- and high-frequency document details. Evaluations on benchmark datasets demonstrate that the proposed method outperforms traditional techniques in terms of PSNR, SSIM, and OCR accuracy, making it a scalable and robust solution for cultural heritage preservation.
| Original language | English |
|---|---|
| Title of host publication | 35th IEEE International Workshop on Machine Learning for Signal Processing |
| Subtitle of host publication | Signal Processing in the Age of Lorge Language Models, MLSP 2025 |
| Publisher | IEEE Computer Society |
| ISBN (Electronic) | 9798331570293 |
| DOIs | |
| Publication status | Published - 2025 |
| Externally published | Yes |
| Event | 35th IEEE International Workshop on Machine Learning for Signal Processing, MLSP 2025 - Istanbul, Turkey Duration: 31 Aug 2025 → 3 Sept 2025 |
Publication series
| Name | IEEE International Workshop on Machine Learning for Signal Processing, MLSP |
|---|---|
| ISSN (Print) | 2161-0363 |
| ISSN (Electronic) | 2161-0371 |
Conference
| Conference | 35th IEEE International Workshop on Machine Learning for Signal Processing, MLSP 2025 |
|---|---|
| Country/Territory | Turkey |
| City | Istanbul |
| Period | 31/08/25 → 3/09/25 |
Bibliographical note
Publisher Copyright:© 2025 IEEE.
Keywords
- deep learning
- diffusion models
- document enhancement
- Historical manuscripts
- image restoration
- OCR