TY - GEN
T1 - Robust text and drawing segmentation algorithm for historical documents
AU - Cohen, Rafi
AU - Asi, Abedelkadir
AU - Kedem, Klara
AU - El-Sana, Jihad
AU - Dinstein, Itshak
PY - 2013/12/23
Y1 - 2013/12/23
N2 - We present a method to segment historical document images into regions of different content. First, we segment text elements from non-text elements using a binarized version of the document. Then, we refine the segmentation of the non-text regions into drawings, background and noise. At this stage, spatial and color features are exploited to guarantee coherent regions in the final segmentation. Experiments show that the suggested approach achieves better segmentation quality with respect to other methods. We examine the segmentation quality on 252 pages of a historical manuscript, for which the suggested method achieves about 92% and 90% segmentation accuracy of drawings and text elements, respectively.
AB - We present a method to segment historical document images into regions of different content. First, we segment text elements from non-text elements using a binarized version of the document. Then, we refine the segmentation of the non-text regions into drawings, background and noise. At this stage, spatial and color features are exploited to guarantee coherent regions in the final segmentation. Experiments show that the suggested approach achieves better segmentation quality with respect to other methods. We examine the segmentation quality on 252 pages of a historical manuscript, for which the suggested method achieves about 92% and 90% segmentation accuracy of drawings and text elements, respectively.
KW - CRF
KW - Historical documents
KW - Layout
KW - Segmentation
KW - Superpixel
UR - http://www.scopus.com/inward/record.url?scp=84890491334&partnerID=8YFLogxK
U2 - 10.1145/2501115.2501117
DO - 10.1145/2501115.2501117
M3 - Conference contribution
AN - SCOPUS:84890491334
SN - 9781450321150
T3 - ACM International Conference Proceeding Series
SP - 110
EP - 117
BT - HIP 2013 - Proceedings of the 2013 Workshop on Historical Document Imaging and Processing
T2 - 2nd International Workshop on Historical Document Imaging and Processing, HIP 2013
Y2 - 24 August 2013 through 24 August 2013
ER -