Document image binarization using "multi-scale" predefined filters

  • Raid M. Saabni

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Reading text or searching for key words within a historical document is a very challenging task. one of the first steps of the complete task is binarization, where we separate foreground such as text, figures and drawings from the background. Successful results of this important step in many cases can determine next steps to success or failure, therefore it is very vital to the success of the complete task of reading and analyzing the content of a document image. Generally, historical documents images are of poor quality due to their storage condition and degradation over time, which mostly cause to varying contrasts, stains, dirt and seeping ink from reverse side. In this paper, we use banks of anisotropic predefined filters in different scales and orientations to develop a binarization method for degraded documents and manuscripts. Using the fact, that handwritten strokes may follow different scales and orientations, we use predefined sets of filter banks having various scales, weights, and orientations to seek a compact set of filters and weights in order to generate different layers of foregrounds and background. Results of convolving these filters on the gray level image locally, weighted and accumulated to enhance the original image. Based on the different layers, seeds of components in the gray level image and a learning process, we present an improved binarization algorithm to separate the background from layers of foreground. Different layers of foreground which may be caused by seeping ink, degradation or other factors are also separated from the real foreground in a second phase. Promising experimental results were obtained on the DIBCO2011, DIBCO2013 and H-DIBCO2016 data sets and a collection of images taken from real historical documents.

Original languageEnglish
Title of host publicationNinth International Conference on Graphic and Image Processing, ICGIP 2017
EditorsHui Yu, Junyu Dong
PublisherSPIE
ISBN (Electronic)9781510617414
DOIs
StatePublished - 1 Jan 2018
Externally publishedYes
Event9th International Conference on Graphic and Image Processing, ICGIP 2017 - Qingdao, China
Duration: 14 Oct 201716 Oct 2017

Publication series

NameProceedings of SPIE - The International Society for Optical Engineering
Volume10615
ISSN (Print)0277-786X
ISSN (Electronic)1996-756X

Conference

Conference9th International Conference on Graphic and Image Processing, ICGIP 2017
Country/TerritoryChina
CityQingdao
Period14/10/1716/10/17

Keywords

  • "Multi-scale" filters
  • Ada-Boosting
  • Binarization
  • Document Image Analysis

ASJC Scopus subject areas

  • Electronic, Optical and Magnetic Materials
  • Condensed Matter Physics
  • Computer Science Applications
  • Applied Mathematics
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Document image binarization using "multi-scale" predefined filters'. Together they form a unique fingerprint.

Cite this