Summarizing speech by contextual reinforcement of important passages

  • Authors:
  • Ricardo Ribeiro;David Martins de Matos

  • Affiliations:
  • Instituto Universitário de Lisboa (ISCTE-IUL), Lisboa, Portugal and L2F, INESC-ID Lisboa, Lisboa, Portugal;Instituto Superior Técnico, Universidade Técnica de Lisboa, Lisboa, Portugal and L2F, INESC-ID Lisboa, Lisboa, Portugal

  • Venue:
  • PROPOR'12 Proceedings of the 10th international conference on Computational Processing of the Portuguese Language
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

We explore the use of contextual information of the same type, i.e., speech transcriptions, to assess the relevant content of a single information source. Our proposal consists in the use of topic-related additional information sources to contextualize the information of the main input source, improving the estimation of the most important passages. We analyse the impact of using as additional information both the full topic-related stories and just the passages from those stories that are closer to the passages of the input source to be summarized. A multi-document summarization framework, Latent Semantic Analysis (LSA), provides the means to assess the relevant content. To minimize the influence of speech-related problems, we explore several term weighting strategies. Evaluation is performed using an information-theoretic evaluation measure, the Jensen-Shannon divergence, that does not need reference summaries.