TY - GEN
T1 - Case study in Hebrew character searching
AU - Rabaev, Irina
AU - Biller, Ofer
AU - El-Sana, Jihad
AU - Kedem, Klara
AU - Dinstein, Itshak
PY - 2011/12/2
Y1 - 2011/12/2
N2 - Searching for a letter or a word in historical documents is a practical challenge due to the various degradations present in such documents and the wide variance of handwriting. Searching in historical Hebrew documents is somewhat harder because of high similarities among Hebrew characters. In order to determine the features and their combinations appropriate for recognizing Hebrew script, we study a range of known features using a Dynamic Time Warping algorithm. In addition we describe a novel meth od for feature-based searching, which uses a number of models for the same character. This method is based on our original DTW algorithm that can match fragments of several models of the same character to match a query character. Consequently, we are not limited to any particular model of the character set. Application of this method leads to a significant improvement, even when using a small set of models.
AB - Searching for a letter or a word in historical documents is a practical challenge due to the various degradations present in such documents and the wide variance of handwriting. Searching in historical Hebrew documents is somewhat harder because of high similarities among Hebrew characters. In order to determine the features and their combinations appropriate for recognizing Hebrew script, we study a range of known features using a Dynamic Time Warping algorithm. In addition we describe a novel meth od for feature-based searching, which uses a number of models for the same character. This method is based on our original DTW algorithm that can match fragments of several models of the same character to match a query character. Consequently, we are not limited to any particular model of the character set. Application of this method leads to a significant improvement, even when using a small set of models.
KW - Hebrew historical documents
KW - character searching
KW - dynamic time warping
KW - variational method
KW - word spotting
UR - http://www.scopus.com/inward/record.url?scp=82355160630&partnerID=8YFLogxK
U2 - 10.1109/ICDAR.2011.218
DO - 10.1109/ICDAR.2011.218
M3 - Conference contribution
AN - SCOPUS:82355160630
SN - 9780769545202
T3 - Proceedings of the International Conference on Document Analysis and Recognition, ICDAR
SP - 1080
EP - 1084
BT - Proceedings - 11th International Conference on Document Analysis and Recognition, ICDAR 2011
T2 - 11th International Conference on Document Analysis and Recognition, ICDAR 2011
Y2 - 18 September 2011 through 21 September 2011
ER -