RDFStats - An Extensible RDF Statistics Generator and Library

  • Authors:
  • Andreas Langegger;Wolfram Woss

  • Affiliations:
  • -;-

  • Venue:
  • DEXA '09 Proceedings of the 2009 20th International Workshop on Database and Expert Systems Application
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper RDFStats is introduced, which is a generator for statistics of RDF sources like SPARQL endpoints and RDF documents. RDFStats does not only provide a statistics generator, but also a powerful API for persisting and accessing statistics including several estimation functions that also support SPARQL filter-like expressions. For many Semantic Web applications like the Semantic Web Integrator and Query Engine (SemWIQ), which is currently developed at the University of Linz, detailed statistics about the contents of RDF data sources are very important. RDFStats has been primarily designed and implemented for the SemWIQ federator and optimizer, but it can also be used for other applications like linked data browsers, aggregators, or visualization tools. It is based on the popular Semantic Web framework Jena developed by HP Labs Bristol and can be easily extended and integrated into other applications.