The design and implementation of the redland RDF application framework
Proceedings of the 10th international conference on World Wide Web
RDFStats - An Extensible RDF Statistics Generator and Library
DEXA '09 Proceedings of the 2009 20th International Workshop on Database and Expert Systems Application
Discovering and Maintaining Links on the Web of Data
ISWC '09 Proceedings of the 8th International Semantic Web Conference
Streaming SPARQL extending SPARQL to process data streams
ESWC'08 Proceedings of the 5th European semantic web conference on The semantic web: research and applications
Querying RDF streams with C-SPARQL
ACM SIGMOD Record
EP-SPARQL: a unified language for event processing and stream reasoning
Proceedings of the 20th international conference on World wide web
LIMES: a time-efficient approach for large-scale link discovery on the web of data
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Generating a linked soccer dataset
Proceedings of the 9th International Conference on Semantic Systems
Web Semantics: Science, Services and Agents on the World Wide Web
Test-driven evaluation of linked data quality
Proceedings of the 23rd international conference on World wide web
Hi-index | 0.00 |
One of the major obstacles for a wider usage of web data is the difficulty to obtain a clear picture of the available datasets. In order to reuse, link, revise or query a dataset published on the Web it is important to know the structure, coverage and coherence of the data. In order to obtain such information we developed LODStats --- a statement-stream-based approach for gathering comprehensive statistics about datasets adhering to the Resource Description Framework (RDF). LODStats is based on the declarative description of statistical dataset characteristics. Its main advantages over other approaches are a smaller memory footprint and significantly better performance and scalability. We integrated LODStats with the CKAN dataset metadata registry and obtained a comprehensive picture of the current state of a significant part of the Data Web.