Towards a modular data model for multi-layer annotated corpora
COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Adding nesting structure to words
Journal of the ACM (JACM)
Molecular event extraction from link grammar parse trees
BioNLP '09 Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing: Shared Task
Recent advances in a feature-rich framework for treebank annotation
COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Towards an alternative implementation of NXT's query language via XQuery
NLPXML '06 Proceedings of the 5th Workshop on NLP and XML: Multi-Dimensional Markup in Natural Language Processing
Mining syntactically annotated corpora with XQuery
LAW '07 Proceedings of the Linguistic Annotation Workshop
Journal of Logic, Language and Information
System for querying syntactically annotated corpora
ACLDemos '09 Proceedings of the ACL-IJCNLP 2009 Software Demonstrations
Fast query for large treebanks
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Efficient indexing and querying over syntactically annotated trees
Proceedings of the VLDB Endowment
Hi-index | 0.00 |
Linguistic research and natural language processing employ large repositories of ordered trees. XML, a standard ordered tree model, and XPath, its associated language, are natural choices for linguistic data and queries. However, several important expressive features required for linguistic queries are missing or hard to express in XPath. In this paper, we motivate and illustrate these features with a variety of linguistic queries. Then we propose extensions to XPath to support linguistic queries, and design an efficient query engine based on a novel labeling scheme. Experiments demonstrate that our language is not only sufficiently expressive for linguistic trees but also efficient for practical usage.