Extracting and visualizing quotations from news wires

  • Authors:
  • Éric de La Clergerie;Benoît Sagot;Rosa Stern;Pascal Denis;Gaëlle Recourcé;Victor Mignot

  • Affiliations:
  • ALPAGE, INRIA Paris-Rocquencourt & Université Paris 7, Domaine de Voluceau - Rocquencourt, Le Chesnay Cedex - France;ALPAGE, INRIA Paris-Rocquencourt & Université Paris 7, Domaine de Voluceau - Rocquencourt, Le Chesnay Cedex - France;ALPAGE, INRIA Paris-Rocquencourt & Université Paris 7, Domaine de Voluceau - Rocquencourt, Le Chesnay Cedex - France;ALPAGE, INRIA Paris-Rocquencourt & Université Paris 7, Domaine de Voluceau - Rocquencourt, Le Chesnay Cedex - France;ALPAGE, INRIA Paris-Rocquencourt & Université Paris 7, Domaine de Voluceau - Rocquencourt, Le Chesnay Cedex - France;ALPAGE, INRIA Paris-Rocquencourt & Université Paris 7, Domaine de Voluceau - Rocquencourt, Le Chesnay Cedex - France

  • Venue:
  • LTC'09 Proceedings of the 4th conference on Human language technology: challenges for computer science and linguistics
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

We introduce SAPIENS, a platformfor extracting quotations fromnews wires, associated with their author and context. The originality of SAPIENS is that it relies on a deep linguistic processing chain, which allows for extracting quotations with a wide coverage and an extended definition, including quotations which are only partially quotes-delimited verbatim transcripts. We describe the architecture of SAPIENS and how it was applied to process a corpus of French news wires from the AFP news agency.