Content selection from an ontology-based knowledge base for the generation of football summaries

Authors:
Nadjet Bouayad-Agha;Gerard Casamayor;Leo Wanner
Affiliations:
DTIC, University Pompeu, Fabra Barcelona, Spain;DTIC, University Pompeu, Fabra Barcelona, Spain;ICREA and DTIC, University Pompeu, Fabra Barcelona, Spain
Venue:
ENLG '11 Proceedings of the 13th European Workshop on Natural Language Generation
Year:
2011

Citing 16
Cited 2

Automated discourse generation using discourse structure relations

Artificial Intelligence - Special volume on natural language processing
BoosTexter: A Boosting-based Systemfor Text Categorization

Machine Learning - Special issue on information retrieval
Planning text for advisory dialogues: capturing intentional and rhetorical information

Computational Linguistics
ILEX: an architecture for a dynamic hypertext generation system

Natural Language Engineering
Statistical acquisition of content selection rules for natural language generation

EMNLP '03 Proceedings of the 2003 conference on Empirical methods in natural language processing
Collective content selection for concept-to-text generation

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Choosing the content of textual summaries of large time-series data sets

Natural Language Engineering
Natural language directed inference from ontologies

Artificial Intelligence
Automatic generation of textual summaries from neonatal intensive care data

Artificial Intelligence
An architecture for data-to-text systems

ENLG '07 Proceedings of the Eleventh European Workshop on Natural Language Generation
Investigating content selection for language generation using machine learning

ENLG '09 Proceedings of the 12th European Workshop on Natural Language Generation
Learning content selection rules for generating object descriptions in dialogue

Journal of Artificial Intelligence Research
A discourse-aware graph-based content-selection framework

INLG '10 Proceedings of the 6th International Natural Language Generation Conference
MARQUIS: GENERATION OF USER-TAILORED MULTILINGUAL AIR QUALITY BULLETINS

Applied Artificial Intelligence
Expressing OWL axioms by English sentences: dubious in theory, feasible in practice

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
FootbOWL: using a generic ontology of football competition for planning match summaries

ESWC'11 Proceedings of the 8th extended semantic web conference on The semantic web: research and applications - Volume Part I

Perspective-oriented generation of football match summaries: Old tasks, new challenges

ACM Transactions on Speech and Language Processing (TSLP)
Content selection from semantic web data

INLG '12 Proceedings of the Seventh International Natural Language Generation Conference

Quantified Score

Hi-index	0.00

Visualization

Abstract

We present an approach to content selection that works on an ontology-based knowledge base developed independently from the task at hand, i.e., Natural Language Generation. Prior to content selection, a stage akin to signal analysis and data assessment used in the generation from numerical data is performed for identifying and abstracting patterns and trends, and identifying relations between individuals. This new information is modeled as an extended ontology on top of the domain ontology which is populated via inference rules. Content selection leverages the ontology-based description of the domain and is performed throughout the text planning at increasing levels of granularity. It includes a main topic selection phase that takes into account a simple user model, a set of heuristics, and semantic relations that link individuals of the KB. The heuristics are based on weights determined empirically by supervised learning on a corpus of summaries aligned with data. The generated texts are short football match summaries that take into account the user perspective.