Learning dictionaries for information extraction by multi-level bootstrapping
AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Class-Based Construction of a Verb Lexicon
Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence
An empirically based system for processing definite descriptions
Computational Linguistics
A non-projective dependency parser
ANLC '97 Proceedings of the fifth conference on Applied natural language processing
Verb class disambiguation using informative priors
Computational Linguistics
Inducing a semantically annotated lexicon via EM-based clustering
ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Corpus-based identification of non-anaphoric noun phrases
ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Acquiring lexical generalizations from corpora: a case study for diathesis alternations
ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Identifying anaphoric and non-anaphoric noun phrases to improve coreference resolution
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
High-precision identification of discourse new and unique noun phrases
ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 2
A bootstrapping method for learning semantic lexicons using extraction pattern contexts
EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Infinitely Imbalanced Logistic Regression
The Journal of Machine Learning Research
Design of a multi-lingual, parallel-processing statistical parsing engine
HLT '02 Proceedings of the second international conference on Human Language Technology Research
A general feature space for automatic verb classification
Natural Language Engineering
Modern Multivariate Statistical Techniques: Regression, Classification, and Manifold Learning
Modern Multivariate Statistical Techniques: Regression, Classification, and Manifold Learning
NP animacy identification for anaphora resolution
Journal of Artificial Intelligence Research
The WEKA data mining software: an update
ACM SIGKDD Explorations Newsletter
Improving verb clustering with automatically acquired selectional preferences
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Variable selection in logistic regression: the British English dative alternation
ESSLLI'08/09 Proceedings of the 2008 international conference on Interfaces: explorations in logic, language and computation
SWITCHBOARD: telephone speech corpus for research and development
ICASSP'92 Proceedings of the 1992 IEEE international conference on Acoustics, speech and signal processing - Volume 1
Query difficulty prediction for contextual image retrieval
ECIR'2010 Proceedings of the 32nd European conference on Advances in Information Retrieval
Economically organized hierarchies in WordNet and the Oxford English Dictionary
Cognitive Systems Research
Hi-index | 0.00 |
In this article, we automatically create two large and richly annotated data sets for studying the English dative alternation. With an intrinsic and an extrinsic evaluation, we address the question of whether such data sets that are obtained and enriched automatically are suitable for linguistic research, even if they contain errors. The extrinsic evaluation consists of building logistic regression models with these data sets. We conclude that the automatic approach for detecting instances of the dative alternation still needs human intervention, but that it is indeed possible to annotate the instances with features that are syntactic, semantic and discourse-related in nature. Only the automatic classification of the concreteness of nouns is problematic.