Generic text summarization using relevance measure and latent semantic analysis

Authors:
Yihong Gong;Xin Liu
Affiliations:
NEC USA, San Jose, CA;NEC USA, San Jose, CA
Venue:
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Year:
2001

Citing 4
Cited 113

Numerical recipes in C (2nd ed.): the art of scientific computing

Numerical recipes in C (2nd ed.): the art of scientific computing
Accurate user directed summarization from existing tools

Proceedings of the seventh international conference on Information and knowledge management
Summarizing text documents: sentence selection and evaluation metrics

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Using Linear Algebra for Intelligent Information Retrieval

Using Linear Algebra for Intelligent Information Retrieval

Generic summarization and keyphrase extraction using mutual reinforcement principle and sentence clustering

SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Chinese Text Summarization Using a Trainable Summarizer and Latent Semantic Analysis

ICADL '02 Proceedings of the 5th International Conference on Asian Digital Libraries: Digital Libraries: People, Knowledge, and Technology
The diversity-based approach to open-domain text summarization

Information Processing and Management: an International Journal
Web-page classification through summarization

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Web page clustering enhanced by summarization

Proceedings of the thirteenth ACM international conference on Information and knowledge management
Text summarization using a trainable summarizer and latent semantic analysis

Information Processing and Management: an International Journal - Special issue: An Asian digital libraries perspective
Supervised ranking in open-domain text summarization

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Web-page summarization using clickthrough data

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Gist summaries for visually impaired surfers

Proceedings of the 7th international ACM SIGACCESS conference on Computers and accessibility
Using thematic information in statistical headline generation

MultiSumQA '03 Proceedings of the ACL 2003 workshop on Multilingual summarization and question answering - Volume 12
Latent semantic analysis for multiple-type interrelated data objects

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
A Novel Partitioning-Based Clustering Method and Generic Document Summarization

WI-IATW '06 Proceedings of the 2006 IEEE/WIC/ACM international conference on Web Intelligence and Intelligent Agent Technology
From social bookmarking to social summarization: an experiment in community-based summary generation

Proceedings of the 12th international conference on Intelligent user interfaces
Paragraph-, word-, and coherence-based approaches to sentence ranking: a comparison of algorithm and human performance

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Generating overview summaries of ongoing email thread discussions

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Improving LSA-based summarization with anaphora resolution

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Incorporating speaker and discourse features into speech summarization

HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
CollabSum: exploiting multiple document clustering for collaborative single document summarizations

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Summarizing audiovisual contents of a video program

EURASIP Journal on Applied Signal Processing
Noise reduction through summarization for Web-page classification

Information Processing and Management: an International Journal
Two uses of anaphora resolution in summarization

Information Processing and Management: an International Journal
Multidocument Summary Generation: Using Informative and Event Words

ACM Transactions on Asian Language Information Processing (TALIP)
Extractive spoken document summarization for information retrieval

Pattern Recognition Letters
Multi-document summarization via sentence-level semantic analysis and symmetric matrix factorization

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
TSCAN: a novel method for topic summarization and content anatomy

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
GA, MR, FFNN, PNN and GMM based models for automatic text summarization

Computer Speech and Language
Generating succinct titles for web URLs

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Multi-document Summarization Based on Cluster Using Non-negative Matrix Factorization

SOFSEM '07 Proceedings of the 33rd conference on Current Trends in Theory and Practice of Computer Science
Generic Summarization Using Non-negative Semantic Variable

ICIC '08 Proceedings of the 4th international conference on Intelligent Computing: Advanced Intelligent Computing Theories and Applications - with Aspects of Theoretical and Methodological Issues
Web content summarization using social bookmarks: a new approach for social summarization

Proceedings of the 10th ACM workshop on Web information and data management
Automatic generic document summarization based on non-negative matrix factorization

Information Processing and Management: an International Journal
Gather customer concerns from online product reviews - A text summarization approach

Expert Systems with Applications: An International Journal
A Comparative Study of Probabilistic Ranking Models for Chinese Spoken Document Summarization

ACM Transactions on Asian Language Information Processing (TALIP)
Concept Map Mining: A Definition and a Framework for Its Evaluation

WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 03
Personalized Summarization Agent Using Non-negative Matrix Factorization

PRICAI '08 Proceedings of the 10th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
User-oriented document summarization through vision-based eye-tracking

Proceedings of the 14th international conference on Intelligent user interfaces
Enhancing diversity, coverage and balance for summarization through structure learning

Proceedings of the 18th international conference on World wide web
In-browser summarisation: generating elaborative summaries biased towards the reading context

HLT-Short '08 Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers
Update summarization based on novel topic distribution

Proceedings of the 9th ACM symposium on Document engineering
Mixed-source multi-document speech-to-text summarization

MMIES '08 Proceedings of the Workshop on Multi-source Multilingual Information Extraction and Summarization
Update Summarization Based on Latent Semantic Analysis

TSD '09 Proceedings of the 12th International Conference on Text, Speech and Dialogue
Single document summarization with document expansion

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 1
Document summarization using conditional random fields

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Multilingual Statistical News Summarisation: Preliminary Experiments with English
Comparative document summarization via discriminative sentence selection

Proceedings of the 18th ACM conference on Information and knowledge management
Dimensionality reduction aids term co-occurrence based multi-document summarization

SumQA '06 Proceedings of the Workshop on Task-Focused Summarization and Question Answering
Multi-document summarization using sentence-based topic models

ACLShort '09 Proceedings of the ACL-IJCNLP 2009 Conference Short Papers
Exploiting neighborhood knowledge for single document summarization and keyphrase extraction

ACM Transactions on Information Systems (TOIS)
Multi-document summarization using weighted similarity between topic and clustering-based non-negative semantic feature

APWeb/WAIM'07 Proceedings of the joint 9th Asia-Pacific web and 8th international conference on web-age information management conference on Advances in data and web management
Extractive summarization of broadcast news: comparing strategies for European portuguese

TSD'07 Proceedings of the 10th international conference on Text, speech and dialogue
A novel approach for social behavior analysis of the blogosphere

Canadian AI'08 Proceedings of the Canadian Society for computational studies of intelligence, 21st conference on Advances in artificial intelligence
Feature subset non-negative matrix factorization and its applications to document understanding

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Many are better than one: improving multi-document summarization via weighted consensus

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
VisualSum: an interactive multi-document summarizationsystem using visualization

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
A novel approach for enhancing student reading comprehension and assisting teacher assessment of literacy

Computers & Education
A risk minimization framework for extractive speech summarization

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
A new approach to improving multilingual summarization using a genetic algorithm

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Wrapping up a summary: from representation to generation

ACLShort '10 Proceedings of the ACL 2010 Conference Short Papers
Document update summarization using incremental hierarchical clustering

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Topic aspect analysis for multi-document summarization

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Bipolar person name identification of topic documents using principal component analysis

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Text summarization of Turkish texts using latent semantic analysis

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Using parallel corpora for multilingual (multi-document) summarisation evaluation

CLEF'10 Proceedings of the 2010 international conference on Multilingual and multimodal information access evaluation: cross-language evaluation forum
Recent advances in automatic speech summarization

Large Scale Semantic Access to Content (Text, Image, Video, and Sound)
Comparability of LSI and human judgment in text analysis tasks

MMACTEE'09 Proceedings of the 11th WSEAS international conference on Mathematical methods and computational techniques in electrical engineering
An unsupervised sentiment classifier on summarized or full reviews

WISE'10 Proceedings of the 11th international conference on Web information systems engineering
Integrating Document Clustering and Multidocument Summarization

ACM Transactions on Knowledge Discovery from Data (TKDD)
Automatic summarization

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts of ACL 2011
Social context summarization

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Text summarization using Latent Semantic Analysis

Journal of Information Science
iDVS: an interactive multi-document visual summarization system

ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part III
Text summarization and singular value decomposition

ADVIS'04 Proceedings of the Third international conference on Advances in Information Systems
Incorporating cross-document relationships between sentences for single document summarizations

ECDL'06 Proceedings of the 10th European conference on Research and Advanced Technology for Digital Libraries
Extractive chinese spoken document summarization using probabilistic ranking models

ISCSLP'06 Proceedings of the 5th international conference on Chinese Spoken Language Processing
Summarize what you are interested in: an optimization framework for interactive personalized summarization

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Topic decomposition and summarization

PAKDD'10 Proceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
Weighted consensus multi-document summarization

Information Processing and Management: an International Journal
Sentence retrieval with LSI and topic identification

ECIR'06 Proceedings of the 28th European conference on Advances in Information Retrieval
Improving document summarization by incorporating social contextual information

AIRS'11 Proceedings of the 7th Asia conference on Information Retrieval Technology
Exploring clustering for multi-document arabic summarisation

AIRS'11 Proceedings of the 7th Asia conference on Information Retrieval Technology
Why read if you can skim: towards enabling faster screen reading

Proceedings of the International Cross-Disciplinary Conference on Web Accessibility
Revisiting centrality-as-relevance: support sets and similarity as geometric proximity

Journal of Artificial Intelligence Research
Mutual-reinforcement document summarization using embedded graph based sentence clustering for storytelling

Information Processing and Management: an International Journal
On using a quantum physics formalism for multidocument summarization

Journal of the American Society for Information Science and Technology
Using wikipedia anchor text and weighted clustering coefficient to enhance the traditional multi-document summarization

CICLing'12 Proceedings of the 13th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part II
Summarizing speech by contextual reinforcement of important passages

PROPOR'12 Proceedings of the 10th international conference on Computational Processing of the Portuguese Language
GenDocSum+MCLR: Generic document summarization based on maximum coverage and less redundancy

Expert Systems with Applications: An International Journal
Comparative document summarization via discriminative sentence selection

ACM Transactions on Knowledge Discovery from Data (TKDD)
SumView: A Web-based engine for summarizing product reviews and customer opinions

Expert Systems with Applications: An International Journal
A survey of methods to ease the development of highly multilingual text mining applications

Language Resources and Evaluation
Challenges and solutions in the opinion summarization of user-generated content

Journal of Intelligent Information Systems
CDDS: Constraint-driven document summarization models

Expert Systems with Applications: An International Journal
Machine translation for multilingual summary content evaluation

Proceedings of Workshop on Evaluation Metrics and System Comparison for Automatic Summarization
Unsupervised topic modeling approaches to decision summarization in spoken meetings

SIGDIAL '12 Proceedings of the 13th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Extractive speech summarization using evaluation metric-related training criteria

Information Processing and Management: an International Journal
Multiple documents summarization based on evolutionary optimization algorithm

Expert Systems with Applications: An International Journal
Representations for multi-document event clustering

Data Mining and Knowledge Discovery
Comparative Document Summarization via Discriminative Sentence Selection

ACM Transactions on Knowledge Discovery from Data (TKDD)
Addressing Challenges in Web Accessibility for the Blind and Visually Impaired

International Journal of Distance Education Technologies
Single document semantic spaces

AusDM '09 Proceedings of the Eighth Australasian Data Mining Conference - Volume 101
Topic-based Amharic text summarization with probabilistic latent semantic analysis

Proceedings of the International Conference on Management of Emergent Digital EcoSystems
Exploiting relevance, coverage, and novelty for query-focused multi-document summarization

Knowledge-Based Systems
Automatically assessing machine summary content without a gold standard

Computational Linguistics
Sumblr: continuous summarization of evolving tweet streams

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Multimedia summarization for trending topics in microblogs

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Composition of semantic relations: Theoretical framework and case study

ACM Transactions on Speech and Language Processing (TSLP)
TopicDSDR: combining topic decomposition and data reconstruction for summarization

WAIM'13 Proceedings of the 14th international conference on Web-Age Information Management
Summaries on the fly: query-based extraction of structured knowledge from web documents

ICWE'13 Proceedings of the 13th international conference on Web Engineering
Summarization of legal texts with high cohesion and automatic compression rate

JSAI-isAI'12 Proceedings of the 2012 international conference on New Frontiers in Artificial Intelligence
Weighted archetypal analysis of the multi-element graph for query-focused multi-document summarization

Expert Systems with Applications: An International Journal
Revisiting centrality-as-relevance: support sets and similarity as geometric proximity: extended abstract

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
SNS-based issue detection and related news summarization scheme

Proceedings of the 8th International Conference on Ubiquitous Information Management and Communication
Extractive single-document summarization based on genetic operators and guided local search

Expert Systems with Applications: An International Journal

Quantified Score

Hi-index	0.01

Visualization

Abstract

In this paper, we propose two generic text summarization methods that create text summaries by ranking and extracting sentences from the original documents. The first method uses standard IR methods to rank sentence relevances, while the second method uses the latent semantic analysis technique to identify semantically important sentences, for summary creations. Both methods strive to select sentences that are highly ranked and different from each other. This is an attempt to create a summary with a wider coverage of the document's main content and less redundancy. Performance evaluations on the two summarization methods are conducted by comparing their summarization outputs with the manual summaries generated by three independent human evaluators. The evaluations also study the influence of different VSM weighting schemes on the text summarization performances. Finally, the causes of the large disparities in the evaluators' manual summarization results are investigated, and discussions on human text summarization patterns are presented.