Towards designing an efficient crawling window to analysis and annotate changes in linked data sources

  • Authors:
  • David Urdiales-Nieto;José F. Aldana-Montes

  • Affiliations:
  • University of Malaga, Malaga, Spain;University of Malaga, Malaga, Spain

  • Venue:
  • Proceedings of the 1st International Workshop on Linked Web Data Management
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Today the popularity of data quality is increasing in linked data, and its changes are being annotated. Linked data consuming applications need to be aware of changes in a dataset. Changes such as update, remove or creation links may occur for a time so it is necessary to detect them to update local data dependencies where this annotation is made by detecting changes systems. Updated or removed links can be detected using a syntactic change similarity measure, and it is simply done using the Levenshtein distance measure. However, a specific event classification of updated event and removed event, which is created by developing detecting changes systems, does not exist based on content analysis. A Hash number comparison approach over each linked data resource is developed to create a more specific classification of the initial updated and removed event. It is computed over RDF triples and is used to enrich the resources, annotating the new classification of the initial updated event and removed event. Annotations on the modification time are made in linked data resource, and making an average time study about when these specific events change, and using a Crawling Window to analyze a portion of Linked Data graph could be improved the crawling techniques for a domain. In this paper a study based in depth of child nodes analyzed in Linked Data graph is made to estimate what is the Crawling Window design more efficient.