Skip to main navigation Skip to search Skip to main content

Efficient recognition of machine printed Arabic text using partial segmentation and Hausdorff distance

  • Raid Saabni

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

10 Scopus citations

Abstract

There is an urgent need for reliable and efficient systems for off-line automatic reading of machine printed Arabic texts. A partial list of applications that may use such system includes searching and reading in scanned books and manuscript as a part of digital libraries; recognizing text on digitized maps, vehicle license plates, road signs and others. In this research we aim to contribute to the research of recognizing Arabic machine printed texts using a partial segmentation process and Hausdorff distance. The process analyses the layout of the image and segments it to words and Parts of Words (PAWs). The Stroke Width Transform (SWT) is used to calculate the size and the font in order to define a set of multi size sliding windows to search and identify characters within the given shape of a PAW. The process evaluates the similarity of the two sub images (character and sliding window) using Hausdorff distance. The top k - ranked candidates and their places within the PAW are recorded and used to generate a list of full PAWs images. In the next step elements of this list are matched to the given shape in a holistic manner. We have tested our approach using the APTI, the PATS- A01 data sets and a private collection of text images and encouraging results were obtained.

Original languageEnglish
Title of host publication6th International Conference on Soft Computing and Pattern Recognition, SoCPaR 2014
PublisherInstitute of Electrical and Electronics Engineers
Pages284-289
Number of pages6
ISBN (Electronic)9781479959341
DOIs
StatePublished - 12 Jan 2014
Externally publishedYes
Event6th International Conference on Soft Computing and Pattern Recognition, SoCPaR 2014 - Tunis, Tunisia
Duration: 11 Aug 201414 Aug 2014

Publication series

Name6th International Conference on Soft Computing and Pattern Recognition, SoCPaR 2014

Conference

Conference6th International Conference on Soft Computing and Pattern Recognition, SoCPaR 2014
Country/TerritoryTunisia
CityTunis
Period11/08/1414/08/14

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Keywords

  • Arabic OCR
  • Hausdorff distance
  • Partial segmentation

ASJC Scopus subject areas

  • Computational Theory and Mathematics
  • Computer Vision and Pattern Recognition
  • Software

Fingerprint

Dive into the research topics of 'Efficient recognition of machine printed Arabic text using partial segmentation and Hausdorff distance'. Together they form a unique fingerprint.

Cite this