Improved Coding over Sets for DNA-Based Data Storage

Hengjia Wei, Moshe Schwartz

Research output: Contribution to journalArticlepeer-review

2 Scopus citations


Error-correcting codes over sets, with applications to DNA storage, are studied. The DNA-storage channel receives a set of sequences, and produces a corrupted version of the set, including sequence loss, symbol substitution, symbol insertion/deletion, and limited-magnitude errors in symbols. Various parameter regimes are studied. New bounds on code parameters are provided, which improve upon known bounds. New codes are constructed, at times matching the bounds up to lower-order terms or small constant factors.

Original languageEnglish
Pages (from-to)118-129
Number of pages12
JournalIEEE Transactions on Information Theory
Issue number1
StatePublished - 13 Oct 2021


  • DNA storage
  • Error-correcting codes
  • coding over sets

ASJC Scopus subject areas

  • Information Systems
  • Computer Science Applications
  • Library and Information Sciences


Dive into the research topics of 'Improved Coding over Sets for DNA-Based Data Storage'. Together they form a unique fingerprint.

Cite this