On the topology of the web of data

  • Authors:
  • Markus Luczak-Rösch;Robert Tolksdorf

  • Affiliations:
  • Freie Universität Berlin, Berlin, Germany;Freie Universität Berlin, Berlin, Germany

  • Venue:
  • Proceedings of the 24th ACM Conference on Hypertext and Social Media
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

The Web of Data consists of the open accessible structured data on the Web. This includes the evolving number of Linked Open Data data sets but also the structured data which is embedded in Web pages. In this paper we address questions related to a unified definition of distinct data sets and factors that influence different network representations of structured Web data. The contributions are (1) an algorithm to generate a data set linking structure of the embedded structured data sourcing from (a) the Billion Triples Challenge corpus (b) the Web Data Commons corpus, and (c) the sindice crawl, (2) a discussion on the issue of identifying distinct data sets in a generic fashion, and (3) a high level visual abstraction of the current Web of Data topology.