Case study in Hebrew character searching

Irina Rabaev, Ofer Biller, Jihad El-Sana, Klara Kedem, Itshak Dinstein

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

10 Scopus citations

Abstract

Searching for a letter or a word in historical documents is a practical challenge due to the various degradations present in such documents and the wide variance of handwriting. Searching in historical Hebrew documents is somewhat harder because of high similarities among Hebrew characters. In order to determine the features and their combinations appropriate for recognizing Hebrew script, we study a range of known features using a Dynamic Time Warping algorithm. In addition we describe a novel meth od for feature-based searching, which uses a number of models for the same character. This method is based on our original DTW algorithm that can match fragments of several models of the same character to match a query character. Consequently, we are not limited to any particular model of the character set. Application of this method leads to a significant improvement, even when using a small set of models.

Original languageEnglish
Title of host publicationProceedings - 11th International Conference on Document Analysis and Recognition, ICDAR 2011
Pages1080-1084
Number of pages5
DOIs
StatePublished - 2 Dec 2011
Event11th International Conference on Document Analysis and Recognition, ICDAR 2011 - Beijing, China
Duration: 18 Sep 201121 Sep 2011

Publication series

NameProceedings of the International Conference on Document Analysis and Recognition, ICDAR
ISSN (Print)1520-5363

Conference

Conference11th International Conference on Document Analysis and Recognition, ICDAR 2011
Country/TerritoryChina
CityBeijing
Period18/09/1121/09/11

Keywords

  • Hebrew historical documents
  • character searching
  • dynamic time warping
  • variational method
  • word spotting

ASJC Scopus subject areas

  • Computer Vision and Pattern Recognition

Fingerprint

Dive into the research topics of 'Case study in Hebrew character searching'. Together they form a unique fingerprint.

Cite this