Statistical acquisition of content selection rules for natural language generation

Authors:
Pablo A. Duboue;Kathleen R. McKeown
Affiliations:
Columbia University;Columbia University
Venue:
EMNLP '03 Proceedings of the 2003 conference on Empirical methods in natural language processing
Year:
2003

Citing 12
Cited 18

Text generation: using discourse strategies and focus constraints to generate natural language text

Text generation: using discourse strategies and focus constraints to generate natural language text
Extracting viewpoints from knowledge bases

AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
Planning text for advisory dialogues: capturing intentional and rhetorical information

Computational Linguistics
Developing and empirically evaluating robust explanation generators: the KNIGHT experiments

Computational Linguistics
Building a generation knowledge source using Internet-accessible newswire

ANLC '97 Proceedings of the fifth conference on Applied natural language processing
Producing biographical summaries: combining linguistic knowledge with corpus statistics

ACL '01 Proceedings of the 39th Annual Meeting on Association for Computational Linguistics
A two-stage model for content determination

EWNLG '01 Proceedings of the 8th European workshop on Natural Language Generation - Volume 8
Knowledge acquisition for natural language generation

INLG '00 Proceedings of the first international conference on Natural language generation - Volume 14
Dealing with dependencies between content planning and surface realisation in a pipeline generation architecture

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
PROGENIE: biographical descriptions for intelligence analysis

ISI'03 Proceedings of the 1st NSF/NIJ conference on Intelligence and security informatics
From local to global coherence: a bottom-up approach to text planning

AAAI'97/IAAI'97 Proceedings of the fourteenth national conference on artificial intelligence and ninth conference on Innovative applications of artificial intelligence
Learning trees and rules with set-valued features

AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 1

Tell me what you do and I'll tell you what you are: learning occupation-related activities for biographies

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Collective content selection for concept-to-text generation

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Automatic creation of domain templates

COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Investigating content selection for language generation using machine learning

ENLG '09 Proceedings of the 12th European Workshop on Natural Language Generation
Individual and domain adaptation in sentence planning for dialogue

Journal of Artificial Intelligence Research
Natural language query recommendation in conversation systems

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Non-textual event summarization by applying machine learning to template-based language generation

UCNLG+Sum '09 Proceedings of the 2009 Workshop on Language Generation and Summarisation
Training a multilingual sportscaster: using perceptual context to learn language

Journal of Artificial Intelligence Research
Focused and aggregated search: a perspective from natural language generation

Information Retrieval
Computing EM-based alignments of routes and route directions as a basis for natural language generation

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Learning what to say and how to say it: Joint optimisation of spoken dialogue management and natural language generation

Computer Speech and Language
FootbOWL: using a generic ontology of football competition for planning match summaries

ESWC'11 Proceedings of the 8th extended semantic web conference on The semantic web: research and applications - Volume Part I
A framework for the automatic extraction of rules from online text

RuleML'2011 Proceedings of the 5th international conference on Rule-based reasoning, programming, and applications
Content selection from an ontology-based knowledge base for the generation of football summaries

ENLG '11 Proceedings of the 13th European Workshop on Natural Language Generation
Detecting interesting event sequences for sports reporting

ENLG '11 Proceedings of the 13th European Workshop on Natural Language Generation
Editorial: Occupation inference through detection and classification of biographical activities

Data & Knowledge Engineering
Content selection from semantic web data

INLG '12 Proceedings of the Seventh International Natural Language Generation Conference
Generating natural language descriptions from OWL ontologies: the natural OWL system

Journal of Artificial Intelligence Research

Quantified Score

Hi-index	0.00

Visualization

Abstract

A Natural Language Generation system produces text using as input semantic data. One of its very first tasks is to decide which pieces of information to convey in the output. This task, called Content Selection, is quite domain dependent, requiring considerable re-engineering to transport the system from one scenario to another. In this paper, we present a method to acquire content selection rules automatically from a corpus of text and associated semantics. Our proposed technique was evaluated by comparing its output with information selected by human authors in unseen texts, where we were able to filter half the input data set without loss of recall.