A new probabilistic model for title generation

Authors:
Rong Jin;Alexander G. Hauptmann
Affiliations:
Carnegie Mellon University, Pittsburgh, PA;Carnegie Mellon University, Pittsburgh, PA
Venue:
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Year:
2002

Citing 7
Cited 6

Term-weighting approaches in automatic text retrieval

Information Processing and Management: an International Journal
A statistical approach to machine translation

Computational Linguistics
Ultra-summarization (poster abstract): a statistical approach to generating highly condensed non-extractive summaries

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Automatic title generation for EM

DL '00 Proceedings of the fifth ACM conference on Digital libraries
Information Retrieval

Information Retrieval
Advances in Automatic Text Summarization

Advances in Automatic Text Summarization
Learning to Select Good Title Words: An New Approach based on Reverse Information Retrieval

ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning

Web-assisted annotation, semantic indexing and search of television and radio news

WWW '05 Proceedings of the 14th international conference on World Wide Web
Automatic labeling of multinomial topic models

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
The effect of title term suggestion on e-commerce sites

Proceedings of the 10th ACM workshop on Web information and data management
How many words is a picture worth? Automatic caption generation for news images

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Automatic headline generation using character cross-correlation

HLT-SS '11 Proceedings of the ACL 2011 Student Session
Comparing topiary-style approaches to headline generation

ECIR'05 Proceedings of the 27th European conference on Advances in Information Retrieval Research

Quantified Score

Hi-index	0.00

Visualization

Abstract

Title generation is a complex task involving both natural language understanding and natural language synthesis. In this paper, we propose a new probabilistic model for title generation. Different from the previous statistical models for title generation, which treat title generation as a generation process that converts the 'document representation' of information directly into a 'title representation' of the same information, this model introduces a hidden state called 'information source' and divides title generation into two steps, namely the step of distilling the 'information source' from the observation of a document and the step of generating a title from the estimated 'information source'. In our experiment, the new probabilistic model outperforms the previous model for title generation in terms of both automatic evaluations and human judgments.