Integrating information extraction and automatic hyperlinking

  • Authors:
  • Stephan Busemann;Witold Drożdżyński;Hans-Ulrich Krieger;Jakub Piskorski;Ulrich Schäfer;Hans Uszkoreit;Feiyu Xu

  • Affiliations:
  • German Research Center for Artificial Intelligence (DFKI GmbH), Saarbrücken, Germany;German Research Center for Artificial Intelligence (DFKI GmbH), Saarbrücken, Germany;German Research Center for Artificial Intelligence (DFKI GmbH), Saarbrücken, Germany;German Research Center for Artificial Intelligence (DFKI GmbH), Saarbrücken, Germany;German Research Center for Artificial Intelligence (DFKI GmbH), Saarbrücken, Germany;German Research Center for Artificial Intelligence (DFKI GmbH), Saarbrücken, Germany;German Research Center for Artificial Intelligence (DFKI GmbH), Saarbrücken, Germany

  • Venue:
  • ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 2
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a novel information system integrating advanced information extraction technology and automatic hyper-linking. Extracted entities are mapped into a domain ontology that relates concepts to a selection of hyperlinks. For information extraction, we use SProUT, a generic platform for the development and use of multilingual text processing components. By combining finite-state and unification-based formalisms, the grammar formalism used in SProUT offers both processing efficiency and a high degree of decalrativeness. The ExtraLink demo system show-cases the extraction of relevant concepts from German texts in the tourism domain, offering the direct connection to associated web documents on demand.