TY - GEN
T1 - Language-independent text lines extraction using seam carving
AU - Saabni, Raid
AU - El-Sana, Jihad
PY - 2011/12/2
Y1 - 2011/12/2
N2 - In this paper, we present a novel language-independent algorithm for extracting text-lines from handwritten document images. Our algorithm is based on the seam carving approach for content aware image resizing. We adopted the signed distance transform to generate the energy map, where extreme points indicate the layout of text-lines. Dynamic programming is then used to compute the minimum energy left-to-right paths (seams), which pass along the ''middle'' of the text-lines. Each path intersects a set of components, which determine the extracted text-line and estimate its hight. The estimated hight determines the text-line's region, which guides splitting touching components among consecutive lines. Unassigned components that fall within the region of a text-line are added to the components list of the line. The components between two consecutive lines are processed when the two lines are extracted and assigned to the closest text-line, based on the attributes of extracted lines, the sizes and positions of components. Our experimental results on Arabic, Chinese, and English historical documents show that our approach manage to separate multi-skew text blocks into lines at high success rates.
AB - In this paper, we present a novel language-independent algorithm for extracting text-lines from handwritten document images. Our algorithm is based on the seam carving approach for content aware image resizing. We adopted the signed distance transform to generate the energy map, where extreme points indicate the layout of text-lines. Dynamic programming is then used to compute the minimum energy left-to-right paths (seams), which pass along the ''middle'' of the text-lines. Each path intersects a set of components, which determine the extracted text-line and estimate its hight. The estimated hight determines the text-line's region, which guides splitting touching components among consecutive lines. Unassigned components that fall within the region of a text-line are added to the components list of the line. The components between two consecutive lines are processed when the two lines are extracted and assigned to the closest text-line, based on the attributes of extracted lines, the sizes and positions of components. Our experimental results on Arabic, Chinese, and English historical documents show that our approach manage to separate multi-skew text blocks into lines at high success rates.
KW - Dynamic programming
KW - Handwriting
KW - Line Extraction
KW - Multilingual
KW - Seam Carving
KW - Signed Distance Transform
UR - http://www.scopus.com/inward/record.url?scp=82355182464&partnerID=8YFLogxK
U2 - 10.1109/ICDAR.2011.119
DO - 10.1109/ICDAR.2011.119
M3 - Conference contribution
AN - SCOPUS:82355182464
SN - 9780769545202
T3 - Proceedings of the International Conference on Document Analysis and Recognition, ICDAR
SP - 563
EP - 568
BT - Proceedings - 11th International Conference on Document Analysis and Recognition, ICDAR 2011
T2 - 11th International Conference on Document Analysis and Recognition, ICDAR 2011
Y2 - 18 September 2011 through 21 September 2011
ER -