Text line extraction using deep learning and minimal sub seams

Adi Azran, Alon Schclar, Raid Saabni

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

Accurate text line extraction is a vital prerequisite for efficient and successful text recognition systems ranging from keywords/phrases searching to complete conversion to text. In many cases, the proposed algorithms target binary pre-processed versions of the image, which may cause insufficient results due to poor quality document images. Recently, more papers present solutions that work directly on gray-level images [1,2,7,12,15]. In this paper, we present a novel robust, and efficient algorithm to extract text-lines directly from gray-level document images. The proposed approach uses a combination of two variants of Convolutional Neural Network (CNNs), followed by minimal energy seam extraction. The first ConvNet is a modified version of the autoencoder used for biomedical image segmentation [8]. The second is a deep convolutional Neural Network, working on overlapping vertical slices of the original image. The two variants are combined to one neural net after re-attaching the resulting slices of the second net. The merged results of the two nets are used as a preprocessed image to obtain an energy map for a second phase. In the second step, we use the algorithm presented in [2], to track minimal energy sub-seams accumulated to perform a full local minimal/maximal separating and medial seam defining the text baselines and the text line regions. We have tested our approach on multi-lingual various datasets written at a range of image quality based on the ICDAR datasets.

Original languageEnglish
Title of host publicationDocEng 2021 - Proceedings of the 2021 ACM Symposium on Document Engineering
PublisherAssociation for Computing Machinery, Inc
ISBN (Electronic)9781450385961
DOIs
StatePublished - 16 Aug 2021
Externally publishedYes
Event21st ACM Symposium on Document Engineering, DocEng 2021 - Virtual, Online, Ireland
Duration: 24 Aug 202127 Aug 2021

Publication series

NameDocEng 2021 - Proceedings of the 2021 ACM Symposium on Document Engineering

Conference

Conference21st ACM Symposium on Document Engineering, DocEng 2021
Country/TerritoryIreland
CityVirtual, Online
Period24/08/2127/08/21

Keywords

  • convolutional neural networks
  • historical document image analysis
  • image processing
  • line extraction
  • local projection profile
  • minimal seams
  • seam carving
  • text line extraction

ASJC Scopus subject areas

  • Computer Science Applications
  • Information Systems
  • Software

Fingerprint

Dive into the research topics of 'Text line extraction using deep learning and minimal sub seams'. Together they form a unique fingerprint.

Cite this