TY - GEN
T1 - Text Edges Guided Network for Historical Document Super Resolution
AU - Madi, Boraq
AU - Alaasam, Reem
AU - El-Sana, Jihad
N1 - Publisher Copyright:
© 2022, The Author(s), under exclusive license to Springer Nature Switzerland AG.
PY - 2022/1/1
Y1 - 2022/1/1
N2 - Super-resolution aims to increase the resolution and the clarity of the details in low-resolution images, and document images are no exception. Although significant improvements have been achieved in super-resolution for different domains, historical document images have not been addressed well. Most of the current works in the text domain deal with modern fonts and rely on extracting prior semantic information from a recognizer to super-resolve images. The absence of a reliable handwritten recognizer for Arabic documents, where historical documents have a complex structure and overlapping parts, makes these text-domain works inapplicable. This paper presents a Text-Attention-ed Super Resolution GAN (TASR-GAN) to address this problem. The model deals with historical Arabic documents and does not rely on prior semantic information. Since our input domain documents, text edges are essential for quality and readability; thus, we introduce a new loss function called text edge loss. This loss function provides more attention and weight to text edge information and guides through optimization to super-resolve images with accurate small regions’ details and fine edges to improve image quality. Experiments on six Arabic manuscripts show that the proposed TASR achieves state-of-the-art performance in terms of PSNR/SSIM metrics and significantly improves the visual image quality, mainly the edges of small regions details, and eliminates artifacts noises. Also, a grid search experiment has been conducted to tune the best hyperparameters values for our text edge loss function.
AB - Super-resolution aims to increase the resolution and the clarity of the details in low-resolution images, and document images are no exception. Although significant improvements have been achieved in super-resolution for different domains, historical document images have not been addressed well. Most of the current works in the text domain deal with modern fonts and rely on extracting prior semantic information from a recognizer to super-resolve images. The absence of a reliable handwritten recognizer for Arabic documents, where historical documents have a complex structure and overlapping parts, makes these text-domain works inapplicable. This paper presents a Text-Attention-ed Super Resolution GAN (TASR-GAN) to address this problem. The model deals with historical Arabic documents and does not rely on prior semantic information. Since our input domain documents, text edges are essential for quality and readability; thus, we introduce a new loss function called text edge loss. This loss function provides more attention and weight to text edge information and guides through optimization to super-resolve images with accurate small regions’ details and fine edges to improve image quality. Experiments on six Arabic manuscripts show that the proposed TASR achieves state-of-the-art performance in terms of PSNR/SSIM metrics and significantly improves the visual image quality, mainly the edges of small regions details, and eliminates artifacts noises. Also, a grid search experiment has been conducted to tune the best hyperparameters values for our text edge loss function.
KW - Generative adversarial networks
KW - Historical handwritten documents
KW - Super-resolution
UR - http://www.scopus.com/inward/record.url?scp=85144433364&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-21648-0_2
DO - 10.1007/978-3-031-21648-0_2
M3 - Conference contribution
AN - SCOPUS:85144433364
SN - 9783031216473
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 18
EP - 33
BT - Frontiers in Handwriting Recognition - 18th International Conference, ICFHR 2022, Proceedings
A2 - Porwal, Utkarsh
A2 - Fornés, Alicia
A2 - Shafait, Faisal
PB - Springer Science and Business Media Deutschland GmbH
T2 - 18th International Conference on Frontiers in Handwriting Recognition, ICFHR 2022
Y2 - 4 December 2022 through 7 December 2022
ER -