Data summaries for on-demand queries over linked data
Proceedings of the 19th international conference on World wide web
Towards benefit-based RDF source selection for SPARQL queries
SWIM '12 Proceedings of the 4th International Workshop on Semantic Web Information Management
Data profiling for semantic web data
WISM'12 Proceedings of the 2012 international conference on Web Information Systems and Mining
LODStats --- an extensible framework for high-performance dataset analytics
EKAW'12 Proceedings of the 18th international conference on Knowledge Engineering and Knowledge Management
Binary RDF representation for publication and exchange (HDT)
Web Semantics: Science, Services and Agents on the World Wide Web
Hi-index | 0.00 |
In this paper RDFStats is introduced, which is a generator for statistics of RDF sources like SPARQL endpoints and RDF documents. RDFStats does not only provide a statistics generator, but also a powerful API for persisting and accessing statistics including several estimation functions that also support SPARQL filter-like expressions. For many Semantic Web applications like the Semantic Web Integrator and Query Engine (SemWIQ), which is currently developed at the University of Linz, detailed statistics about the contents of RDF data sources are very important. RDFStats has been primarily designed and implemented for the SemWIQ federator and optimizer, but it can also be used for other applications like linked data browsers, aggregators, or visualization tools. It is based on the popular Semantic Web framework Jena developed by HP Labs Bristol and can be easily extended and integrated into other applications.