Restoring Hebrew Diacritics Without a Dictionary

Elazar Gershuni, Yuval Pinter

Research output: Working paper/PreprintPreprint

Abstract

We demonstrate that it is feasible to diacritize Hebrew script without any human-curated resources other than plain diacritized text. We present NAKDIMON, a two-layer character level LSTM, that performs on par with much more complicated curation-dependent systems, across a diverse array of modern Hebrew sources.
Original languageEnglish
PublisherarXiv
StatePublished - 2021

Publication series

NamearXiv preprint arXiv:2105.05209

Fingerprint

Dive into the research topics of 'Restoring Hebrew Diacritics Without a Dictionary'. Together they form a unique fingerprint.

Cite this