Coupled hierarchical IR and stochastic models for surface information extraction

  • Authors:
  • Hugo Zaragoza;Patrick Gallinari

  • Affiliations:
  • LIP6, Université Pierre et Marie Curie, Paris Cedex;LIP6, Université Pierre et Marie Curie, Paris Cedex

  • Venue:
  • IRSG'98 Proceedings of the 20th Annual BCS-IRSG conference on Information Retrieval Research
  • Year:
  • 1998

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present in this paper a combination of Machine Learning based Information Retrieval (IR) techniques and stochastic language modelling in a hierarchical system that extracts surface information from text. At the lowest level of this hierarchy, documents and paragraphs are successively routed with IR techniques. At the top level, a stochastic language model extracts the most relevant phrases, and labels the type of information they contain. The approach and preliminary results are demonstrated on a subset of the MUC-6 Scenario Templates task.