• Ahmad Droby (Creator)
  • Daria Vasyutinsky Shapira (Creator)
  • Irina Rabaev (Creator)
  • Berat Kurar Bakarat (Creator)
  • Jihad El Sana (Creator)



The VML-HP-ext collection contains 715 page images excerpted from 171 different manuscripts covering 14 medieval writing Hebrew styles, accompanied by their hard and soft GT labels. 

 We also provide the official split of the VML-HP-ext into training, typical test, and blind test sets. 
The typical test set includes unseen pages of the manuscripts from the training set. While training and typical test sets are disjoint on the page level, they do share the same set of manuscripts. Therefore, we also provide the blind test set, which consists of manuscripts that do not appear in the training set. The blind test set imitates a real-life scenario, where scholars would like to obtain a classification for a previously unseen document.
Date made available2022

Cite this