Computer Evaluation of Indexing and Text Processing
Journal of the ACM (JACM)
The Theory and Practice of Discourse Parsing and Summarization
The Theory and Practice of Discourse Parsing and Summarization
Finding the WRITE Stuff: Automatic Identification of Discourse Structure in Student Essays
IEEE Intelligent Systems
PorTAL '02 Proceedings of the Third International Conference on Advances in Natural Language Processing
Adaptive duplicate detection using learnable string similarity measures
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Learning cross-document structural relationships using boosting
CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
BLEU: a method for automatic evaluation of machine translation
ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
An analysis of clarification dialogue for question answering
NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
A common theory of information fusion from multiple text sources step one: cross-document structure
SIGDIAL '00 Proceedings of the 1st SIGdial workshop on Discourse and dialogue - Volume 10
Learning trees and rules with set-valued features
AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 1
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Formalizing CST-based content selection operations
PROPOR'10 Proceedings of the 9th international conference on Computational Processing of the Portuguese Language
Hi-index | 0.00 |
Based on Cross-document Structure Theory (CST), we investigate the problem of finding related sentences from multiple documents on the same topic. We test some lexical similarity measures from related literature and improve them with language specific resources. The conclusions are that for Portuguese a different measure from English is the best one and that the knowledge resources we use affect the results in different ways.