Public data integration with WebSmatch

  • Authors:
  • Remi Coletta;Emmanuel Castanier;Patrick Valduriez;Christian Frisch;DuyHoa Ngo;Zohra Bellahsene

  • Affiliations:
  • INRIA and LIRMM, Montpellier, France;INRIA and LIRMM, Montpellier, France;INRIA and LIRMM, Montpellier, France;Data Publica, Paris, France;INRIA and LIRMM, Montpellier, France;INRIA and LIRMM, Montpellier, France

  • Venue:
  • Proceedings of the First International Workshop on Open Data
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Integrating open data sources can yield high value information but raises major problems in terms of metadata extraction, data source integration and visualization of integrated data. In this paper, we describe WebSmatch, a flexible environment for Web data integration, based on a real, end-to-end data integration scenario over public data from Data Publica. WebSmatch supports the full process of importing, refining and integrating data sources and uses third party tools for high quality visualization. We use a typical scenario of public data integration which involves problems not solved by currents tools: poorly structured input data sources (XLS files) and rich visualization of integrated data.