Semantic and syntactic features for dutch coreference resolution

Authors:
Iris Hendrickx;Veronique Hoste;Walter Daelemans
Affiliations:
CNTS, Language Technology Group, University of Antwerp, Antwerp, Belgium;Language and Translation Technology Team, University College Ghent, Ghent, Belgium;CNTS, Language Technology Group, University of Antwerp, Antwerp, Belgium
Venue:
CICLing'08 Proceedings of the 9th international conference on Computational linguistics and intelligent text processing
Year:
2008

Citing 19
Cited 1

A maximum entropy approach to natural language processing

Computational Linguistics
EuroWordNet: a multilingual database with lexical semantic networks

EuroWordNet: a multilingual database with lexical semantic networks
A trainable approach to coreference resolution for information extraction

A trainable approach to coreference resolution for information extraction
A machine learning approach to coreference resolution of noun phrases

Computational Linguistics - Special issue on computational anaphora resolution
An architecture for anaphora resolution

ANLC '88 Proceedings of the second conference on Applied natural language processing
Automatic retrieval and clustering of similar words

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
Robust pronoun resolution with limited knowledge

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
A model-theoretic coreference scoring scheme

MUC6 '95 Proceedings of the 6th conference on Message understanding
Improving machine learning approaches to coreference resolution

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Text and knowledge mining for coreference resolution

NAACL '01 Proceedings of the second meeting of the North American Chapter of the Association for Computational Linguistics on Language technologies
Comparing Knowledge Sources for Nominal Anaphora Resolution

Computational Linguistics
Combining sample selection and error-driven pruning for machine learning of coreference rules

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Memory-based named entity recognition

COLING-02 proceedings of the 6th conference on Natural language learning - Volume 20
Learning to resolve bridging references

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Kernel-based pronoun resolution with structured syntactic knowledge

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Using semantic relations to refine coreference decisions

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Multi-lingual coreference resolution with syntactic features

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Exploiting semantic role labeling, WordNet and Wikipedia for coreference resolution

HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
Shallow semantics for coreference resolution

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence

Coreference resolution using tree CRFs

CICLing'12 Proceedings of the 13th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part I

Quantified Score

Hi-index	0.00

Visualization

Abstract

We investigate the effect of encoding additional semantic and syntactic information sources in a classification-based machine learning approach to the task of coreference resolution for Dutch. We experiment both with a memory-based learning approach and a maximum entropy modeling method. As an alternative to using external lexical resources, such as the lowcoverage Dutch EuroWordNet, we evaluate the effect of automatically generated semantic clusters as information source. We compare these clusters, which group together semantically similar nouns, to two semantic features based on EuroWordNet encoding synonym and hypernym relations between nouns. The syntactic function of the anaphor and antecedent in the sentence can be an important clue for resolving coreferential relations. As baseline approach, we encode syntactic information as predicted by a memorybased shallow parser in a set of features. We contrast these shallow parse based features with features encoding richer syntactic information from a dependency parser. We show that using both the additional semantic information and syntactic information lead to small but significant performance improvement of our coreference resolution approach.