Annotation for the Deep Web

  • Authors:
  • Siegfried Handschuh;Raphael Volz;Steffen Staab

  • Affiliations:
  • University of Karlsruhe;University of Karlsruhe;University of Karlsruhe and Ontoprise GmbH

  • Venue:
  • IEEE Intelligent Systems
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

One of the core challenges of the Semantic Web is to create metadata by mass collaboration-by combining semantic content created by a large number of people. To attain this objective, researchers have developed several approaches to deal with the manual or semiautomatic creation of metadata from existing information. However, these approaches-as well as older ones that provide metadata-build on the assumption that the information sources under consideration are static, given as static HTML pages or as books in a library. Today, however, a large percentage of Web pages are not static documents. On the contrary, the majority of Web pages are dynamic. Estimates about the ratio of static to dynamic pages based on Web pages actually crawled by search engines typically conclude that dynamic Web pages outnumber static ones by 100 to 1. For dynamic Web pages (those that are generated from a database that contains a catalog of books), annotating every single page manually does not seem to be useful. Rather, it is better to annotate the database to reuse it for site-specific Semantic Web purposes. Deep annotation, an annotation process that can help with this problem, uses information proper, information structure, and information context to derive mappings between information structures.