A trainable document summarizer
SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
Stochastic complexity in learning
Journal of Computer and System Sciences - Special issue: 26th annual ACM symposium on the theory of computing & STOC'94, May 23–25, 1994, and second annual Europe an conference on computational learning theory (EuroCOLT'95), March 13–15, 1995
The use of MMR, diversity-based reranking for reordering documents and producing summaries
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Foundations of statistical natural language processing
Foundations of statistical natural language processing
The automatic construction of large-scale corpora for summarization research
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
New Methods in Automatic Extracting
Journal of the ACM (JACM)
Refining Initial Points for K-Means Clustering
ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
X-means: Extending K-means with Efficient Estimation of the Number of Clusters
ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Pattern Classification (2nd Edition)
Pattern Classification (2nd Edition)
Distribution of content words and phrases in text and language modelling
Natural Language Engineering
The rhetorical parsing of natural language texts
ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Fast generation of abstracts from general domain text corpora by extracting relevant sentences
COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 2
Query-relevant summarization using FAQs
ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
Using sentence-selection heuristics to rank text segments in TXTRACTOR
Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries
The use of unlabeled data to improve supervised learning for text summarization
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
The diversity-based approach to open-domain text summarization
Information Processing and Management: an International Journal
Refining a divisive partitioning algorithm for unsupervised clustering
Design and application of hybrid intelligent systems
Supervised ranking in open-domain text summarization
ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Combining optimal clustering and Hidden Markov models for extractive summarization
MultiSumQA '03 Proceedings of the ACL 2003 workshop on Multilingual summarization and question answering - Volume 12
A study for documents summarization based on personal annotation
HLT-NAACL-DUC '03 Proceedings of the HLT-NAACL 03 on Text summarization workshop - Volume 5
Summary in context: Searching versus browsing
ACM Transactions on Information Systems (TOIS)
One story, one flow: Hidden Markov Story Models for multilingual multidocument summarization
ACM Transactions on Speech and Language Processing (TSLP)
Pushing task relevant web links down to the desktop
WIDM '06 Proceedings of the 8th annual ACM international workshop on Web information and data management
Summarizing local context to personalize global web search
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Designing semantics-preserving cluster representatives for scientific input conditions
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Corpus and evaluation measures for multiple document summarization with multiple sources
COLING '04 Proceedings of the 20th international conference on Computational Linguistics
An information delivery system with automatic summarization for mobile commerce
Decision Support Systems
CollabSum: exploiting multiple document clustering for collaborative single document summarizations
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Document concept lattice for text understanding and summarization
Information Processing and Management: an International Journal
User-model based personalized summarization
Information Processing and Management: an International Journal
Locality-Based pruning methods for web search
ACM Transactions on Information Systems (TOIS)
TSCAN: a novel method for topic summarization and content anatomy
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
GA, MR, FFNN, PNN and GMM based models for automatic text summarization
Computer Speech and Language
Generic Summarization Using Non-negative Semantic Variable
ICIC '08 Proceedings of the 4th international conference on Intelligent Computing: Advanced Intelligent Computing Theories and Applications - with Aspects of Theoretical and Methodological Issues
MedSearch: a specialized search engine for medical information retrieval
Proceedings of the 17th ACM conference on Information and knowledge management
Automatic generic document summarization based on non-negative matrix factorization
Information Processing and Management: an International Journal
A Comparative Study of Probabilistic Ranking Models for Chinese Spoken Document Summarization
ACM Transactions on Asian Language Information Processing (TALIP)
Enhancing diversity, coverage and balance for summarization through structure learning
Proceedings of the 18th international conference on World wide web
Single document summarization with document expansion
AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 1
Text summarization model based on the budgeted median problem
Proceedings of the 18th ACM conference on Information and knowledge management
Exploiting neighborhood knowledge for single document summarization and keyphrase extraction
ACM Transactions on Information Systems (TOIS)
APWeb/WAIM'07 Proceedings of the joint 9th Asia-Pacific web and 8th international conference on web-age information management conference on Advances in data and web management
Chinese text automatic summarization based on affinity propagation cluster
FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 1
Extracting multi-document summarization based on local topics
FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 2
Exploiting novelty, coverage and balance for topic-focused multi-document summarization
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Multi-document summarization for terrorism information extraction
ISI'06 Proceedings of the 4th IEEE international conference on Intelligence and Security Informatics
Incorporating cross-document relationships between sentences for single document summarizations
ECDL'06 Proceedings of the 10th European conference on Research and Advanced Technology for Digital Libraries
An improved approach to extract document summaries based on popularity
DNIS'05 Proceedings of the 4th international conference on Databases in Networked Information Systems
Improving document summarization by incorporating social contextual information
AIRS'11 Proceedings of the 7th Asia conference on Information Retrieval Technology
Large-margin learning of submodular summarization models
EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
Sentence clustering via projection over term clusters
SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
Semantics-based event-driven web news classification
ISPA'07 Proceedings of the 2007 international conference on Frontiers of High Performance Computing and Networking
Exploiting relevance, coverage, and novelty for query-focused multi-document summarization
Knowledge-Based Systems
Hi-index | 0.00 |
The paper presents a novel approach to unsupervised text summarization. The novelty lies in exploiting the diversity of concepts in text for summarization, which has not received much attention in the summarization literature. A diversity-based approach here is a principled generalization of Maximal Marginal Relevance criterion by Carbonell and Goldstein \cite{carbonell-goldstein98}.We propose, in addition, aninformation-centricapproach to evaluation, where the quality of summaries is judged not in terms of how well they match human-created summaries but in terms of how well they represent their source documents in IR tasks such document retrieval and text categorization.To find the effectiveness of our approach under the proposed evaluation scheme, we set out to examine how a system with the diversity functionality performs against one without, using the BMIR-J2 corpus, a test data developed by a Japanese research consortium. The results demonstrate a clear superiority of a diversity based approach to a non-diversity based approach.