Uncertain version control in open collaborative editing of tree-structured documents

  • Authors:
  • M. Lamine Ba;Talel Abdessalem;Pierre Senellart

  • Affiliations:
  • Institut Mines--Té/lé/com/ Té/lé/com ParisTech/ CNRS LTCI, Paris, France;Institut Mines--Té/lé/com/ Té/lé/com ParisTech/ CNRS LTCI, Paris, France;Té/lé/com ParisTech &/ The University of Hong Kong, Paris &/ Hong Kong, France

  • Venue:
  • Proceedings of the 2013 ACM symposium on Document engineering
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

In order to ease content enrichment, exchange, and sharing, web-scale collaborative platforms such as Wikipedia or Google Docs enable unbounded interactions between a large number of contributors, without prior knowledge of their level of expertise and reliability. Version control is then essential for keeping track of the evolution of the shared content and its provenance. In such environments, uncertainty is ubiquitous due to the unreliability of the sources, the incompleteness and imprecision of the contributions, the possibility of malicious editing and vandalism acts, etc. To handle this uncertainty, we use a probabilistic XML model as a basic component of our version control framework. Each version of a shared document is represented by an XML tree and the whole document, together with its different versions, is modeled as a probabilistic XML document. Uncertainty is evaluated using the probabilistic model and the reliability measure associated to each source, each contributor, or each editing event, resulting in an uncertainty measure on each version and each part of the document. We show that standard version control operations can be implemented directly as operations on the probabilistic XML model; efficiency with respect to deterministic version control systems is demonstrated on real-world datasets.