Combining automatic acquisition of knowledge with machine learning approaches for multilingual temporal recognition and normalization

Authors:
E. Saquete;O. Ferrández;S. Ferrández;P. Martínez-Barco;R. Muñoz
Affiliations:
Natural Language Processing and Information System Group, Department of Software and Computing Systems, University of Alicante, Apartado de correos 99, 03080 Alicante, Spain;Natural Language Processing and Information System Group, Department of Software and Computing Systems, University of Alicante, Apartado de correos 99, 03080 Alicante, Spain;Natural Language Processing and Information System Group, Department of Software and Computing Systems, University of Alicante, Apartado de correos 99, 03080 Alicante, Spain;Natural Language Processing and Information System Group, Department of Software and Computing Systems, University of Alicante, Apartado de correos 99, 03080 Alicante, Spain;Natural Language Processing and Information System Group, Department of Software and Computing Systems, University of Alicante, Apartado de correos 99, 03080 Alicante, Spain
Venue:
Information Sciences: an International Journal
Year:
2008

Citing 33
Cited 5

Entropy of English text: experiments with humans and a machine learning system based on rough sets

Information Sciences: an International Journal - From rough sets to soft computing
Foundations of statistical natural language processing

Foundations of statistical natural language processing
Statistical Language Learning

Statistical Language Learning
Maximum entropy models for natural language ambiguity resolution

Maximum entropy models for natural language ambiguity resolution
A maximum entropy approach to named entity recognition

A maximum entropy approach to named entity recognition
Tagging English text with a probabilistic model

Computational Linguistics
TnT: a statistical part-of-speech tagger

ANLC '00 Proceedings of the sixth conference on Applied natural language processing
Evaluation of the Automatic Multilinguality for Time Expression Resolution

DEXA '04 Proceedings of the Database and Expert Systems Applications, 15th International Workshop
Maximum Entropy Modeling: A Suitable Framework to Learn Context-Dependent Lexicon Models for Statistical Machine Translation

Machine Learning
Telling apart temporal locating adverbials and time-denoting expressions

TASIP '01 Proceedings of the workshop on Temporal and spatial information processing - Volume 13
A multilingual approach to annotating and extracting temporal information

TASIP '01 Proceedings of the workshop on Temporal and spatial information processing - Volume 13
Named Entity Extraction using AdaBoost

COLING-02 proceedings of the 6th conference on Natural language learning - Volume 20
Introduction to the CoNLL-2002 shared task: language-independent named entity recognition

COLING-02 proceedings of the 6th conference on Natural language learning - Volume 20
Introduction to the CoNLL-2003 shared task: language-independent named entity recognition

CONLL '03 Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003 - Volume 4
Automatic resolution rule assignment to multilingual Temporal Expressions using annotated corpora

TIME '06 Proceedings of the Thirteenth International Symposium on Temporal Representation and Reasoning
Event ordering using TERSEO system

Data & Knowledge Engineering - Special issue: Application of natural language to information systems (NLDB04)
Learning finite-state models for machine translation

Machine Learning
Splitting complex temporal questions for question answering systems

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Learning event durations from event descriptions

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
A maximum entropy word aligner for Arabic-English machine translation

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Learning to classify e-mail

Information Sciences: an International Journal
Combining data-driven systems for improving Named Entity Recognition

Data & Knowledge Engineering
Corpus-based semantic role approach in information retrieval

Data & Knowledge Engineering
Semantic passage segmentation based on sentence topics for question answering

Information Sciences: an International Journal
A Model-Based Learning Process for Modeling Coarticulation of Human Speech

IEICE - Transactions on Information and Systems
Applying Machine Learning to Chinese Entity Detection and Tracking

CICLing '07 Proceedings of the 8th International Conference on Computational Linguistics and Intelligent Text Processing
Combining knowledge- and corpus-based word-sense-disambiguation methods

Journal of Artificial Intelligence Research
The stages of event extraction

ARTE '06 Proceedings of the Workshop on Annotating and Reasoning about Time and Events
A pilot study on acquiring metric temporal constraints for events

ARTE '06 Proceedings of the Workshop on Annotating and Reasoning about Time and Events
Evaluating knowledge-based approaches to the multilingual extension of a temporal expression normalizer

ARTE '06 Proceedings of the Workshop on Annotating and Reasoning about Time and Events
Investigating the best configuration of HMM spanish pos tagger when minimum amount of training data is available

NLDB'05 Proceedings of the 10th international conference on Natural Language Processing and Information Systems
Automatic time expression labeling for english and chinese text

CICLing'05 Proceedings of the 6th international conference on Computational Linguistics and Intelligent Text Processing
Error bounds for convolutional codes and an asymptotically optimum decoding algorithm

IEEE Transactions on Information Theory

A scalable machine-learning approach for semi-structured named entity recognition

Proceedings of the 19th international conference on World wide web
Ontology learning from biomedical natural language documents using UMLS

Expert Systems with Applications: An International Journal
Automatic transformation from TIDES to TimeML annotation

Language Resources and Evaluation
BioOntoVerb: A top level ontology based framework to populate biomedical ontologies from texts

Knowledge-Based Systems
Extending enterprise service design knowledge using clustering

ICSOC'12 Proceedings of the 10th international conference on Service-Oriented Computing

Quantified Score

Hi-index	0.08

Visualization

Abstract

This paper presents an improvement in the temporal expression (TE) recognition phase of a knowledge based system at a multilingual level. For this purpose, the combination of different approaches applied to the recognition of temporal expressions are studied. In this work, for the recognition task, a knowledge based system that recognizes temporal expressions and had been automatically extended to other languages (TERSEO system) was combined with a system that recognizes temporal expressions using machine learning techniques. In particular, two different techniques were applied: maximum entropy model (ME) and hidden Markov model (HMM), using two different types of tagging of the training corpus: (1) BIO model tagging of literal temporal expressions and (2) BIO model tagging of simple patterns of temporal expressions. Each system was first evaluated independently and then combined in order to: (a) analyze if the combination gives better results without increasing the number of erroneous expressions in the same percentage and (b) decide which machine learning approach performs this task better. When the TERSEO system is combined with the maximum entropy approach the best results for F-measure (89%) are obtained, improving TERSEO recognition by 4.5 points and ME recognition by 7.