Show simple item record

dc.contributor.authorWang, Qiwen
dc.contributor.authorJaggi, Sidharth
dc.contributor.authorMedard, Muriel
dc.contributor.authorCadambe, Viveck R
dc.contributor.authorSchwartz, Moshe
dc.date.accessioned2021-10-27T20:30:46Z
dc.date.available2021-10-27T20:30:46Z
dc.date.issued2017
dc.identifier.urihttps://hdl.handle.net/1721.1/136088
dc.description.abstract© 1963-2012 IEEE. The problem of one-way file synchronization, henceforth called 'file updates', is studied in this paper. Specifically, a client edits a file, where the edits are modeled by insertions and deletions (InDels). An old copy of the file is stored remotely at a data-centre, and is also available to the client. We consider the problem of throughput- and computationally-efficient communication from the client to the data-centre, to enable the data-centre to update its old copy to the newly edited file. Two models for the source files and edit patterns are studied: the random pre-edit sequence left-to-right random InDel (RPES-LtRRID) process, and the arbitrary pre-edit sequence arbitrary InDel (APES-AID) process. In both models, we consider the regime, in which the number of insertions and deletions is a small (but constant) fraction of the length of the original file. For both models, information-theoretic lower bounds on the best possible compression rates that enable file updates are derived (up to first order terms). Conversely, a simple compression algorithm using dynamic programming (DP) and entropy coding (EC), henceforth called DP-EC algorithm, achieves rates that are within constant additive gap (which diminishes as the alphabet size increases) to information-theoretic lower bounds for both models. For the RPES-LtRRID model, a dynamic-programming-run-length-compression (DP-RLC) algorithm is proposed, which achieves a compression rate matching the information-theoretic lower bound up to first order terms. Therefore, when the insertion and deletion probabilities are small (such that first order terms dominate), the achievable rate by DP-RLC is nearly optimal for the RPES-LtRRID model.
dc.language.isoen
dc.publisherInstitute of Electrical and Electronics Engineers (IEEE)
dc.relation.isversionof10.1109/TIT.2017.2705100
dc.rightsCreative Commons Attribution-Noncommercial-Share Alike
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/
dc.sourcearXiv
dc.titleFile Updates Under Random/Arbitrary Insertions And Deletions
dc.typeArticle
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
dc.relation.journalIEEE Transactions on Information Theory
dc.eprint.versionOriginal manuscript
dc.type.urihttp://purl.org/eprint/type/JournalArticle
eprint.statushttp://purl.org/eprint/status/NonPeerReviewed
dc.date.updated2019-06-20T17:58:32Z
dspace.orderedauthorsWang, Q; Jaggi, S; Medard, M; Cadambe, VR; Schwartz, M
dspace.date.submission2019-06-20T17:58:33Z
mit.journal.volume63
mit.journal.issue10
mit.metadata.statusAuthority Work and Publication Information Needed


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record