TY - GEN
T1 - Unsupervised Learning of Text Line Segmentation by Differentiating Coarse Patterns
AU - Kurar Barakat, Berat
AU - Droby, Ahmad
AU - Saabni, Raid
AU - El-Sana, Jihad
N1 - Funding Information:
Acknowledgment. The authors would like to thank Gunes Cevik and Hamza Barakat for helping in data preparation. This research was partially supported by The Frankel Center for Computer Science at Ben-Gurion University.
Publisher Copyright:
© 2021, Springer Nature Switzerland AG.
PY - 2021/9/2
Y1 - 2021/9/2
N2 - Despite recent advances in the field of supervised deep learning for text line segmentation, unsupervised deep learning solutions are beginning to gain popularity. In this paper, we present an unsupervised deep learning method that embeds document image patches to a compact Euclidean space where distances correspond to a coarse text line pattern similarity. Once this space has been produced, text line segmentation can be easily implemented using standard techniques with the embedded feature vectors. To train the model, we extract random pairs of document image patches with the assumption that neighbour patches contain a similar coarse trend of text lines, whereas if one of them is rotated, they contain different coarse trends of text lines. Doing well on this task requires the model to learn to recognize the text lines and their salient parts. The benefit of our approach is zero manual labelling effort. We evaluate the method qualitatively and quantitatively on several variants of text line segmentation datasets to demonstrate its effectivity.
AB - Despite recent advances in the field of supervised deep learning for text line segmentation, unsupervised deep learning solutions are beginning to gain popularity. In this paper, we present an unsupervised deep learning method that embeds document image patches to a compact Euclidean space where distances correspond to a coarse text line pattern similarity. Once this space has been produced, text line segmentation can be easily implemented using standard techniques with the embedded feature vectors. To train the model, we extract random pairs of document image patches with the assumption that neighbour patches contain a similar coarse trend of text lines, whereas if one of them is rotated, they contain different coarse trends of text lines. Doing well on this task requires the model to learn to recognize the text lines and their salient parts. The benefit of our approach is zero manual labelling effort. We evaluate the method qualitatively and quantitatively on several variants of text line segmentation datasets to demonstrate its effectivity.
KW - Text line detection
KW - Text line extraction
KW - Text line segmentation
KW - Unsupervised deep learning
UR - http://www.scopus.com/inward/record.url?scp=85115306141&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-86331-9_34
DO - 10.1007/978-3-030-86331-9_34
M3 - Conference contribution
AN - SCOPUS:85115306141
SN - 9783030863302
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 523
EP - 537
BT - Document Analysis and Recognition – ICDAR 2021 - 16th International Conference, Proceedings
A2 - Lladós, Josep
A2 - Lopresti, Daniel
A2 - Uchida, Seiichi
PB - Springer Science and Business Media Deutschland GmbH
T2 - 16th International Conference on Document Analysis and Recognition, ICDAR 2021
Y2 - 5 September 2021 through 10 September 2021
ER -