|
Problemy Peredachi Informatsii, 2022, Volume 58, Issue 2, Pages 12–23 DOI: https://doi.org/10.31857/S0555292322020028
(Mi ppi2365)
|
|
|
|
This article is cited in 4 scientific papers (total in 4 papers)
Coding Theory
On the maximum number of non-confusable strings evolving under short tandem duplications
M. Kovačević Faculty of Technical Sciences, University of Novi Sad, Novi Sad, Serbia
DOI:
https://doi.org/10.31857/S0555292322020028
Abstract:
The set of all q-ary strings that do not contain repeated substrings of length $\leqslant3$
(i.e., that do not contain substrings of the form aa, abab,andabcabc) constitutes a code cor
recting an arbitrary number of tandem-duplication mutations of length $\leqslant3$. In other words,
any two such strings are non-confusable in the sense that they cannot produce the same string
while evolving under tandem duplications of length $\leqslant3$. We demonstrate that this code is
asymptotically optimal in terms of rate, meaning that it represents the largest set of non-con
fusable strings up to subexponential factors. This result settles the zero-error capacity problem
for the last remaining case of tandem-duplication channels satisfying the “root-uniqueness”
property.
Keywords:
tandem duplication, tandem repeat, duplication error, repetition error, sticky in sertion, DNA storage, error correction, zero-error capacity, constrained code, square-free string.
Received: 04.10.2021 Revised: 20.04.2022 Accepted: 21.04.2022
Citation:
M. Kovačević, “On the maximum number of non-confusable strings evolving under short tandem duplications”, Probl. Peredachi Inf., 58:2 (2022), 12–23; Problems Inform. Transmission, 58:2 (2022), 111–121
Linking options:
https://www.mathnet.ru/eng/ppi2365 https://www.mathnet.ru/eng/ppi/v58/i2/p12
|
| Statistics & downloads: |
| Abstract page: | 133 | | Full-text PDF : | 5 | | References: | 31 | | First page: | 22 |
|