Self reinforcement for important passage retrieval

  • Authors:
  • Ricardo Ribeiro;Luís Marujo;David Martins de Matos;João P. Neto;Anatole Gershman;Jaime Carbonell

  • Affiliations:
  • ISCTE-IUL and INESC-ID Lisboa, Lisboa, Portugal;INESC ID Lisboa and CMU, Lisboa, Portugal;IST and INESC-ID Lisboa, Lisboa, Portugal;IST and INESC-ID Lisboa, Lisboa, Portugal;CMU, Pittsburgh, Pennsylvania, USA;CMU, Pittsburgh, Pennsylvania, USA

  • Venue:
  • Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

In general, centrality-based retrieval models treat all elements of the retrieval space equally, which may reduce their effectiveness. In the specific context of extractive summarization (or important passage retrieval), this means that these models do not take into account that information sources often contain lateral issues, which are hardly as important as the description of the main topic, or are composed by mixtures of topics. We present a new two-stage method that starts by extracting a collection of key phrases that will be used to help centrality-as-relevance retrieval model. We explore several approaches to the integration of the key phrases in the centrality model. The proposed method is evaluated using different datasets that vary in noise (noisy vs clean) and language (Portuguese vs English). Results show that the best variant achieves relative performance improvements of about 31% in clean data and 18% in noisy data.