Text line segmentation for gray scale historical document images

Abedelkadir Asi, Raid Saabni, Jihad El-Sana

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

20 Scopus citations

Abstract

In this paper we present a new approach for text line segmentation that works directly on gray-scale document images. Our algorithm constructs distance transform directly on the gray-scale images, which is used to compute two types of seams: medial seams and separating seams. A medial seam is a chain of pixels that crosses the text area of a text line and a separating seam is a path that passes between two consecutive rows. The medial seam determines a text line and the separating seams define the upper and lower boundaries of the text line. The medial and separating seams propagate according to energy maps, which are defined based on the constructed distance transform. We have performed various experimental results on different datasets and received encouraging results.

Original languageEnglish
Title of host publicationHIP'11 - Proceedings of the 2011 Workshop on Historical Document Imaging and Processing
Pages120-126
Number of pages7
DOIs
StatePublished - 25 Oct 2011
Event1st International Workshop on Historical Document Imaging and Processing, HIP'11, Held in Conjunction with ICDAR 2011 - Beijing, China
Duration: 16 Sep 201117 Sep 2011

Publication series

NameACM International Conference Proceeding Series

Conference

Conference1st International Workshop on Historical Document Imaging and Processing, HIP'11, Held in Conjunction with ICDAR 2011
Country/TerritoryChina
CityBeijing
Period16/09/1117/09/11

Keywords

  • dynamic programming
  • handwriting
  • line extraction
  • multilingual
  • seam carving
  • signed distance transform

ASJC Scopus subject areas

  • Software
  • Human-Computer Interaction
  • Computer Vision and Pattern Recognition
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'Text line segmentation for gray scale historical document images'. Together they form a unique fingerprint.

Cite this