Knowledge-based multilingual document analysis

  • Authors:
  • R. Basili;R. Catizone;L. Padro;M. T. Pazienza;G. Rigau;A. Setzer;N. Webb;F. Zanzotto

  • Affiliations:
  • University of Rome, Tor Vergata, Roma, Italy;University of Sheffield, Sheffield, UK;Universitat Politecnica de Catalunya, Barcelona, Spain;University of Rome, Tor Vergata, Roma, Italy;Universitat Politecnica de Catalunya, Barcelona, Spain;University of Sheffield, Sheffield, UK;University of Sheffield, Sheffield, UK;University of Rome, Tor Vergata, Roma, Italy

  • Venue:
  • SEMANET '02 Proceedings of the 2002 workshop on Building and using semantic networks - Volume 11
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

The growing availability of multilingual resources, like EuroWordnet, has recently inspired the development of large scale linguistic technologies, e.g. multilingual IE and Q&A, that were considered infeasible until a few years ago. In this paper a system for categorisation and automatic authoring of news streams in different languages is presented. In our system, a knowledge-based approach to Information Extraction is adopted as a support for hyperlinking. Authoring across documents in different languages is triggered by Named Entities and event recognition. The matching of events in texts is carried out by discourse processing driven by a large scale world model. This kind of multilingual analysis relies on a lexical knowledge base of nouns(i.e. the EuroWordnet Base Concepts) shared among English, Spanish and Italian lexicons. The impact of the design choices on the language independence and the possibilities it opens for automatic learning of the event hierarchy will be discussed.