Finding related sentences in multiple documents for multidocument discourse parsing of Brazilian Portuguese texts

  • Authors:
  • Priscila Aleixo;Thiago Alexandre Salgueiro Pardo

  • Affiliations:
  • Universidade de São Paulo, São Carlos -- SP;Universidade de São Paulo, São Carlos -- SP

  • Venue:
  • Companion Proceedings of the XIV Brazilian Symposium on Multimedia and the Web
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Based on Cross-document Structure Theory (CST), we investigate the problem of finding related sentences from multiple documents on the same topic. We test some lexical similarity measures from related literature and improve them with language specific resources. The conclusions are that for Portuguese a different measure from English is the best one and that the knowledge resources we use affect the results in different ways.