SOLOMON: seeking the truth via copying detection

  • Authors:
  • Xin Luna Dong;Laure Berti-Equille;Yifan Hu;Divesh Srivastava

  • Affiliations:
  • AT&T Labs-Research;Universite de Rennes;AT&T Labs-Research;AT&T Labs-Research

  • Venue:
  • Proceedings of the VLDB Endowment
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

We live in the Information Era, with access to a huge amount of information from a variety of data sources. However, data sources are of different qualities, often providing conflicting, out-of-date and incomplete data. Data sources can also easily copy, reformat and modify data from other sources, propagating erroneous data. These issues make the identification of high quality information and sources non-trivial. We demonstrate the Solomon system, whose core is a module that detects copying between sources. We demonstrate that we can effectively detect copying relationship between data sources, leverage the results in truth discovery, and provide a user-friendly interface to facilitate users in identifying sources that best suit their information needs.