Multidocument summarization: An added value to clustering in interactive retrieval

Authors:
Manuel J. Maña-López;Manuel De Buenaga;José M. Gómez-Hidalgo
Affiliations:
Universidad de Vigo, Huelva, Spain;Universidad Europea de Madrid, Madrid, Spain;Universidad Europea de Madrid, Madrid, Spain
Venue:
ACM Transactions on Information Systems (TOIS)
Year:
2004

Citing 24
Cited 15

Term-weighting approaches in automatic text retrieval

Information Processing and Management: an International Journal
Constructing literature abstracts by computer: techniques and prospects

Information Processing and Management: an International Journal - Special issue on natural language processing and information retrieval
Clustering algorithms

Information retrieval
Automatic text decomposition and structuring

Information Processing and Management: an International Journal
Reexamining the cluster hypothesis: scatter/gather on retrieval results

SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
Automatic text structuring and summarization

Information Processing and Management: an International Journal - Special issue: methods and tools for the automatic construction of hypertext
Advantages of query biased summaries in information retrieval

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Web document clustering: a feasibility demonstration

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Selecting text spans for document summaries: heuristics and metrics

AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
New Methods in Automatic Extracting

Journal of the ACM (JACM)
Data mining: practical machine learning tools and techniques with Java implementations

Data mining: practical machine learning tools and techniques with Java implementations
Creating and evaluating multi-document sentence extract summaries

Proceedings of the ninth international conference on Information and knowledge management
Using clustering and classification approaches in interactive retrieval

Information Processing and Management: an International Journal - Special issue on interactivity at the text retrieval conference (TREC)
Evaluating document clustering for interactive information retrieval

Proceedings of the tenth international conference on Information and knowledge management
Information Retrieval

Information Retrieval
From E-Sex to E-Commerce: Web Search Changes

Computer
Model Selection in Unsupervised Learning with Applications To Document Clustering

ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
Implementation of the SMART Information Retrieval System

Implementation of the SMART Information Retrieval System
Using machine learning to improve information access

Using machine learning to improve information access
TextTiling: segmenting text into multi-paragraph subtopic passages

Computational Linguistics
An algorithm for one-page summarization of a long text based on thematic hierarchy detection

ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
Automatic Information Organization and Retrieval.

Automatic Information Organization and Retrieval.
Centroid-based summarization of multiple documents: sentence extraction, utility-based evaluation, and user studies

NAACL-ANLP-AutoSum '00 Proceedings of the 2000 NAACL-ANLPWorkshop on Automatic summarization - Volume 4
Multi-document summarization by visualizing topical content

NAACL-ANLP-AutoSum '00 Proceedings of the 2000 NAACL-ANLPWorkshop on Automatic summarization - Volume 4

Hoarding location-based data using clustering

Proceedings of the 4th ACM international workshop on Mobility management and wireless access
QCS: A system for querying, clustering and summarizing documents

Information Processing and Management: an International Journal
iSpreadRank: Ranking sentences for extraction-based summarization using feature weight propagation in the sentence similarity network

Expert Systems with Applications: An International Journal
Gather customer concerns from online product reviews - A text summarization approach

Expert Systems with Applications: An International Journal
Clustering of document collection - A weighting approach

Expert Systems with Applications: An International Journal
Genetic algorithm based multi-document summarization

PRICAI'06 Proceedings of the 9th Pacific Rim international conference on Artificial intelligence
Sumstega: summarisation-based steganography methodology

International Journal of Information and Computer Security
Integrating Document Clustering and Multidocument Summarization

ACM Transactions on Knowledge Discovery from Data (TKDD)
Automatic summarization

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts of ACL 2011
The SINAMED and ISIS projects: applying text mining techniques to improve access to a medical digital library

ECDL'06 Proceedings of the 10th European conference on Research and Advanced Technology for Digital Libraries
Multi-document summarization based on BE-Vector clustering

CICLing'06 Proceedings of the 7th international conference on Computational Linguistics and Intelligent Text Processing
A new approach for cluster detection for large datasets with high dimensionality

DaWaK'05 Proceedings of the 7th international conference on Data Warehousing and Knowledge Discovery
Information retrieval from the web: an interactive paradigm

MIS'05 Proceedings of the 11th international conference on Advances in Multimedia Information Systems
On macro- and micro-level information in multiple documents and its influence on summarization

International Journal of Information Management: The Journal for Information Professionals
Summarising customer online reviews using a new text mining approach

International Journal of Business Information Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

A more and more generalized problem in effective information access is the presence in the same corpus of multiple documents that contain similar information. Generally, users may be interested in locating, for a topic addressed by a group of similar documents, one or several particular aspects. This kind of task, called instance or aspectual retrieval, has been explored in several TREC Interactive Tracks. In this article, we propose in addition to the classification capacity of clustering techniques, the possibility of offering a indicative extract about the contents of several sources by means of multidocument summarization techniques. Two kinds of summaries are provided. The first one covers the similarities of each cluster of documents retrieved. The second one shows the particularities of each document with respect to the common topic in the cluster. The document multitopic structure has been used in order to determine similarities and differences of topics in the cluster of documents. The system is independent of document domain and genre. An evaluation of the proposed system with users proves significant improvements in effectiveness. The results of previous experiments that have compared clustering algorithms are also reported.