Software unit test coverage and adequacy
ACM Computing Surveys (CSUR)
EDBT '08 Proceedings of the 11th international conference on Extending database technology: Advances in database technology
Dependencies revisited for improving data quality
Proceedings of the twenty-seventh ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
What Have Innsbruck and Leipzig in Common? Extracting Semantics from Wiki Content
ESWC '07 Proceedings of the 4th European conference on The Semantic Web: Research and Applications
WebTables: exploring the power of tables on the web
Proceedings of the VLDB Endowment
Quality-driven information filtering using the WIQA policy framework
Web Semantics: Science, Services and Agents on the World Wide Web
RaDON -- Repair and Diagnosis in Ontology Networks
ESWC 2009 Heraklion Proceedings of the 6th European Semantic Web Conference on The Semantic Web: Research and Applications
DBpedia - A crystallization point for the Web of Data
Web Semantics: Science, Services and Agents on the World Wide Web
EvoPat - pattern-based evolution and refactoring of RDF knowledge bases
ISWC'10 Proceedings of the 9th international semantic web conference on The semantic web - Volume Part I
Using semantic web resources for data quality management
EKAW'10 Proceedings of the 17th international conference on Knowledge engineering and management by the masses
Sieve: linked data quality assessment and fusion
Proceedings of the 2012 Joint EDBT/ICDT Workshops
Assessing linked data mappings using network measures
ESWC'12 Proceedings of the 9th international conference on The Semantic Web: research and applications
Internationalization of Linked Data: The case of the Greek DBpedia edition
Web Semantics: Science, Services and Agents on the World Wide Web
Universal OWL axiom enrichment for large knowledge bases
EKAW'12 Proceedings of the 18th international conference on Knowledge Engineering and Knowledge Management
LODStats --- an extensible framework for high-performance dataset analytics
EKAW'12 Proceedings of the 18th international conference on Knowledge Engineering and Knowledge Management
Improving the quality of SKOS vocabularies with skosify
EKAW'12 Proceedings of the 18th international conference on Knowledge Engineering and Knowledge Management
User-driven quality evaluation of DBpedia
Proceedings of the 9th International Conference on Semantic Systems
LinkedGeoData: A core for a web of spatial open data
Semantic Web - On linked spatiotemporal data and geo-ontologies
Hi-index | 0.00 |
Linked Open Data (LOD) comprises an unprecedented volume of structured data on the Web. However, these datasets are of varying quality ranging from extensively curated datasets to crowdsourced or extracted data of often relatively low quality. We present a methodology for test-driven quality assessment of Linked Data, which is inspired by test-driven software development. We argue that vocabularies, ontologies and knowledge bases should be accompanied by a number of test cases, which help to ensure a basic level of quality. We present a methodology for assessing the quality of linked data resources, based on a formalization of bad smells and data quality problems. Our formalization employs SPARQL query templates, which are instantiated into concrete quality test case queries. Based on an extensive survey, we compile a comprehensive library of data quality test case patterns. We perform automatic test case instantiation based on schema constraints or semi-automatically enriched schemata and allow the user to generate specific test case instantiations that are applicable to a schema or dataset. We provide an extensive evaluation of five LOD datasets, manual test case instantiation for five schemas and automatic test case instantiations for all available schemata registered with Linked Open Vocabularies (LOV). One of the main advantages of our approach is that domain specific semantics can be encoded in the data quality test cases, thus being able to discover data quality problems beyond conventional quality heuristics.