Mining syntactically annotated corpora with XQuery

Authors:
Gosse Bouma;Geert Kloosterman
Affiliations:
University of Groningen, The Netherlands;University of Groningen, The Netherlands
Venue:
LAW '07 Proceedings of the Linguistic Annotation Workshop
Year:
2007

Citing 5
Cited 2

Discovery of inference rules for question-answering

Natural Language Engineering
Question answering passage retrieval using dependency relations

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Designing and Evaluating an XPath Dialect for Linguistic Queries

ICDE '06 Proceedings of the 22nd International Conference on Data Engineering
A shortest path dependency kernel for relation extraction

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Towards an alternative implementation of NXT's query language via XQuery

NLPXML '06 Proceedings of the 5th Workshop on NLP and XML: Multi-Dimensional Markup in Natural Language Processing

Parsed corpora for linguistics

ILCL '09 Proceedings of the EACL 2009 Workshop on the Interaction between Linguistics and Computational Linguistics: Virtuous, Vicious or Vacuous?
Using large-scale parser output to guide grammar development

GEAF '09 Proceedings of the 2009 Workshop on Grammar Engineering Across Frameworks

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents a uniform approach to data extraction from syntactically annotated corpora encoded in XML. XQuery, which incorporates XPath, has been designed as a query language for XML. The combination of XPath and XQuery offers flexibility and expressive power, while corpus specific functions can be added to reduce the complexity of individual extraction tasks. We illustrate our approach using examples from dependency treebanks for Dutch.