Author(s)Demaine, Erik D.; Panchekha, Pavel; Wilson, David A.; Yang, Edward Z.
MetadataShow full item record
We consider the problem of merging individual text documents, motivated by the single-file merge algorithms of document-based version control systems. Abstracting away the merging of conflicting edits to an external conflict resolution function (possibly implemented by a human), we consider the efficient identification of conflicting regions. We show how to implement tree-based document representation to quickly answer a data structure inspired by the “blame” query of some version control systems. A “blame” query associates every line of a document with the revision in which it was last edited. Our tree uses this idea to quickly identify conflicting edits. We show how to perform a merge operation in time proportional to the sum of the logarithms of the shared regions of the documents, plus the cost of conflict resolution. Our data structure is functional and therefore confluently persistent, allowing arbitrary version DAGs as in real version-control systems. Our results rely on concurrent traversal of two trees with short circuiting when shared subtrees are encountered.
DepartmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
Algorithms and Data Structures
Springer-Verlag Berlin Heidelberg
Demaine, Erik D., Pavel Panchekha, David A. Wilson, and Edward Z. Yang. “Blame Trees.” In Algorithms and Data Structures: 13th International Symposium, WADS 2013, London, ON, Canada, August 12-14, 2013. Proceedings (Lecture Notes in Computer Science; vol. 8037) (2013): 280–290.
Author's final manuscript