Headline generation based on statistical translation

Authors:
Michele Banko;Vibhu O. Mittal;Michael J. Witbrock
Affiliations:
Johns Hopkins University, Baltimore, MD;Just Research, Pittsburgh, PA;Lycos Inc., Waltham, MA
Venue:
ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
Year:
2000

Citing 13
Cited 40

Automatic text structuring and summarization

Information Processing and Management: an International Journal - Special issue: methods and tools for the automatic construction of hypertext
The decomposition of human-written summary sentences

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Information retrieval as statistical translation

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Ultra-summarization (poster abstract): a statistical approach to generating highly condensed non-extractive summaries

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Towards multidocument summarization by reformulation: progress and prospects

AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Selecting text spans for document summaries: heuristics and metrics

AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Problems in automatic abstracting

Communications of the ACM
Statistics-Based Summarization - Step One: Sentence Compression

Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence
The mathematics of statistical machine translation: parameter estimation

Computational Linguistics - Special issue on using large corpora: II
Generating natural language summaries from multiple on-line sources

Computational Linguistics - Special issue on natural language generation
A structured language model

ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Exploiting syntactic structure for language modeling

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Improving summaries by revising them

ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics

Mining the web for answers to natural language questions

Proceedings of the tenth international conference on Information and knowledge management
Summarization beyond sentence extraction: a probabilistic approach to sentence compression

Artificial Intelligence
Book Review, Inderjeet Mani: Automatic Summarization, John Benjamins Publishing Co., Amsterdam, The Netherlands, 2001, xi + 286 pp.

Machine Translation
A noisy-channel model for document compression

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Discovering "title-like" terms

Information Processing and Management: an International Journal
Induction of Word and Phrase Alignments for Automatic Document Summarization

Computational Linguistics
Lexical substitution as a task for WSD evaluation

WSD '02 Proceedings of the ACL-02 workshop on Word sense disambiguation: recent successes and future directions - Volume 8
Using thematic information in statistical headline generation

MultiSumQA '03 Proceedings of the ACL 2003 workshop on Multilingual summarization and question answering - Volume 12
Combining optimal clustering and Hidden Markov models for extractive summarization

MultiSumQA '03 Proceedings of the ACL 2003 workshop on Multilingual summarization and question answering - Volume 12
Hedge Trimmer: a parse-and-trim approach to headline generation

HLT-NAACL-DUC '03 Proceedings of the HLT-NAACL 03 on Text summarization workshop - Volume 5
Automatic summarising: The state of the art

Information Processing and Management: an International Journal
Abstractive headline generation using WIDL-expressions

Information Processing and Management: an International Journal
Multi-candidate reduction: Sentence compression as a tool for document summarization tasks

Information Processing and Management: an International Journal
Syntactic sentence compression in the biomedical domain: facilitating access to related articles

Information Retrieval
Generating succinct titles for web URLs

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Predicting the fluency of text with shallow structural features: case studies of machine translation and human-written text

EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Generic title labeling for clustered documents

Expert Systems with Applications: An International Journal
A classification algorithm for predicting the structure of summaries

UCNLG+Sum '09 Proceedings of the 2009 Workshop on Language Generation and Summarisation
Document retrieval: shallow data, deep theories; historical reflections, potential directions

ECIR'03 Proceedings of the 25th European conference on IR research
Automatic generation of story highlights

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
How many words is a picture worth? Automatic caption generation for news images

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Title generation with quasi-synchronous grammar

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Multi-sentence compression: finding shortest paths in word graphs

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Structural features for predicting the linguistic quality of text: applications to machine translation, automatic summarization and human-authored text

Empirical methods in natural language generation
Opinion summarization with integer linear programming formulation for sentence extraction and ordering

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Learning predicate insertion rules for document abstracting

CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part II
Multi-document summarization using link analysis based on rhetorical relations between sentences

CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part II
Automatic keyphrase extraction by bridging vocabulary gap

CoNLL '11 Proceedings of the Fifteenth Conference on Computational Natural Language Learning
Toward generic title generation for clustered documents

AIRS'06 Proceedings of the Third Asia conference on Information Retrieval Technology
Mining the interests of Chinese microbloggers via keyword extraction

Frontiers of Computer Science in China
A multi-layered summarization system for multi-media archives by understanding and structuring of chinese spoken documents

ISCSLP'06 Proceedings of the 5th international conference on Chinese Spoken Language Processing
Towards developing probabilistic generative models for reasoning with natural language representations

CICLing'05 Proceedings of the 6th international conference on Computational Linguistics and Intelligent Text Processing
A simple word trigger method for social tag suggestion

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Topic tracking based on linguistic features

IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
An approach to summarizing Bengali news documents

Proceedings of the International Conference on Advances in Computing, Communications and Informatics
A section title authoring tool for clinical guidelines

Proceedings of the 2012 ACM symposium on Document engineering
Just title it! (by an online application)

EACL '12 Proceedings of the Demonstrations at the 13th Conference of the European Chapter of the Association for Computational Linguistics
NOMIT: automatic titling by nominalizing

NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
How can catchy titles be generated without loss of informativeness?

Expert Systems with Applications: An International Journal
A global model for concept-to-text generation

Journal of Artificial Intelligence Research

Quantified Score

Hi-index	0.00

Visualization

Abstract

Extractive summarization techniques cannot generate document summaries shorter than a single sentence, something that is often required. An ideal summarization system would understand each document and generate an appropriate summary directly from the results of that understanding. A more practical approach to this problem results in the use of an approximation: viewing summarization as a problem analogous to statistical machine translation. The issue then becomes one of generating a target document in a more concise language from a source document in a more verbose language. This paper presents results on experiments using this approach, in which statistical models of the term selection and term ordering are jointly applied to produce summaries in a style learned from a training corpus.