Abstract
This paper studies the problem of encoding messages into sequences which can be uniquely recovered from some noisy observations about their substrings. The observed reads comprise consecutive substrings with some given minimum overlap. This coded reconstruction problem has applications in DNA storage. We consider both single-strand reconstruction codes and multi-strand reconstruction codes, where the message is encoded into a single strand or a set of multiple strands, respectively. Various parameter regimes are studied. New codes are constructed, some of whose rates asymptotically attain the upper bounds.
Original language | English |
---|---|
Pages (from-to) | 7757-7776 |
Number of pages | 20 |
Journal | IEEE Transactions on Information Theory |
Volume | 70 |
Issue number | 11 |
DOIs | |
State | Published - 1 Jan 2024 |
Keywords
- DNA storage
- robust positioning sequences
- sequence (string) reconstruction
- substitution
- substring-distant sequences
ASJC Scopus subject areas
- Information Systems
- Computer Science Applications
- Library and Information Sciences