An empirical study of information synthesis tasks

Authors:
Enrique Amigó;Julio Gonzalo;Víctor Peinado;Anselmo Peñas;Felisa Verdejo
Affiliations:
Universidad Nacional de Educación a Distancia, Madrid -- Spain;Universidad Nacional de Educación a Distancia, Madrid -- Spain;Universidad Nacional de Educación a Distancia, Madrid -- Spain;Universidad Nacional de Educación a Distancia, Madrid -- Spain;Universidad Nacional de Educación a Distancia, Madrid -- Spain
Venue:
ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Year:
2004

Citing 7
Cited 14

Foundations of statistical natural language processing

Foundations of statistical natural language processing
Creating and evaluating multi-document sentence extract summaries

Proceedings of the ninth international conference on Information and knowledge management
An evaluation corpus for temporal summarization

HLT '01 Proceedings of the first international conference on Human language technology research
BLEU: a method for automatic evaluation of machine translation

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Automatic evaluation of summaries using N-gram co-occurrence statistics

NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Centroid-based summarization of multiple documents: sentence extraction, utility-based evaluation, and user studies

NAACL-ANLP-AutoSum '00 Proceedings of the 2000 NAACL-ANLPWorkshop on Automatic summarization - Volume 4
Examining the consensus between human summaries: initial experiments with factoid analysis

HLT-NAACL-DUC '03 Proceedings of the HLT-NAACL 03 on Text summarization workshop - Volume 5

Do summaries help?

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
QARLA: a framework for the evaluation of text summarization systems

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Answer extraction, semantic clustering, and extractive summarization for clinical question answering

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Using random walks for question-focused sentence retrieval

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Will pyramids built of nuggets topple over?

HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
An exploration of the principles underlying redundancy-based factoid question answering

ACM Transactions on Information Systems (TOIS)
The role of information retrieval in answering complex questions

COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Automatic summarization of MEDLINE citations for evidence-based medical treatment: A topic-oriented evaluation

Journal of Biomedical Informatics
Dimensionality reduction aids term co-occurrence based multi-document summarization

SumQA '06 Proceedings of the Workshop on Task-Focused Summarization and Question Answering
DUC 2005: evaluation of question-focused summarization systems

SumQA '06 Proceedings of the Workshop on Task-Focused Summarization and Question Answering
UNED at WebCLEF 2008: applying high restrictive summarization, low restrictive information retrieval and multilingual techniques

CLEF'08 Proceedings of the 9th Cross-language evaluation forum conference on Evaluating systems for multilingual and multimodal information access
Using semantic information to answer complex questions

Canadian AI'11 Proceedings of the 24th Canadian conference on Advances in artificial intelligence
Improving graph-based random walks for complex question answering using syntactic, shallow semantic and extended string subsequence kernels

Information Processing and Management: an International Journal
Degree centrality for semantic abstraction summarization of therapeutic studies

Journal of Biomedical Informatics

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper describes an empirical study of the "Information Synthesis" task, defined as the process of (given a complex information need) extracting, organizing and inter-relating the pieces of information contained in a set of relevant documents, in order to obtain a comprehensive, non redundant report that satisfies the information need.Two main results are presented: a) the creation of an Information Synthesis testbed with 72 reports manually generated by nine subjects for eight complex topics with 100 relevant documents each; and b) an empirical comparison of similarity metrics between reports, under the hypothesis that the best metric is the one that best distinguishes between manual and automatically generated reports. A metric based on key concepts overlap gives better results than metrics based on n-gram overlap (such as ROUGE) or sentence overlap.