CaltechAUTHORS
  A Caltech Library Service

Noise and Uncertainty in String-Duplication Systems

Jain, Siddharth and Farnoud (Hassanzadeh), Farzad and Schwartz, Moshe and Bruck, Jehoshua (2017) Noise and Uncertainty in String-Duplication Systems. California Institute of Technology , Pasadena, CA. (Unpublished) https://resolver.caltech.edu/CaltechAUTHORS:20170119-133807104

[img] PDF - Submitted Version
See Usage Policy.

173kB

Use this Persistent URL to link to this item: https://resolver.caltech.edu/CaltechAUTHORS:20170119-133807104

Abstract

Duplication mutations play a critical role in the generation of biological sequences. Simultaneously, they have a deleterious effect on data stored using in-vivo DNA data storage. While duplications have been studied both as a sequence-generation mechanism and in the context of error correction, for simplicity these studies have not taken into account the presence of other types of mutations. In this work, we consider the capacity of duplication mutations in the presence of point-mutation noise, and so quantify the generation power of these mutations. We show that if the number of point mutations is vanishingly small compared to the number of duplication mutations of a constant length, the generation capacity of these mutations is zero. However, if the number of point mutations increases to a constant fraction of the number of duplications, then the capacity is nonzero. Lower and upper bounds for this capacity are also presented. Another problem that we study is concerned with the mismatch between code design and channel in data storage in the DNA of living organisms with respect to duplication mutations. In this context, we consider the uncertainty of such a mismatched coding scheme measured as the maximum number of input codewords that can lead to the same output.


Item Type:Report or Paper (Technical Report)
Related URLs:
URLURL TypeDescription
http://www.paradise.caltech.edu/papers/etr134.pdfAuthorReport
http://resolver.caltech.edu/CaltechAUTHORS:20170816-165117076Related ItemPublished Version
ORCID:
AuthorORCID
Jain, Siddharth0000-0002-9164-6119
Farnoud (Hassanzadeh), Farzad0000-0002-8684-4487
Schwartz, Moshe0000-0002-1449-0026
Bruck, Jehoshua0000-0001-8474-0812
Additional Information:This work was supported in part by the NSF Expeditions in Computing Program (The Molecular Programming Project).
Group:Parallel and Distributed Systems Group
Funders:
Funding AgencyGrant Number
NSFUNSPECIFIED
Other Numbering System:
Other Numbering System NameOther Numbering System ID
PARADISEetr134
Record Number:CaltechAUTHORS:20170119-133807104
Persistent URL:https://resolver.caltech.edu/CaltechAUTHORS:20170119-133807104
Usage Policy:No commercial reproduction, distribution, display or performance rights in this work are provided.
ID Code:73557
Collection:CaltechPARADISE
Deposited By:INVALID USER
Deposited On:19 Jan 2017 21:44
Last Modified:18 Aug 2021 01:08

Repository Staff Only: item control page