A formal framework for linguistic annotation
Speech Communication - Special issue on speech annotation and corpus tools
Designing and Evaluating an XPath Dialect for Linguistic Queries
ICDE '06 Proceedings of the 22nd International Conference on Data Engineering
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Proceedings of the 2012 ACM symposium on Document engineering
Hi-index | 0.00 |
In this paper we discuss the current methods in the representation of corpora annotated at multiple levels of linguistic organization (so-called multi-level or multi-layer corpora). Taking five approaches which are representative of the current practice in this area, we discuss the commonalities and differences between them focusing on the underlying data models. The goal of the paper is to identify the common concerns in multi-layer corpus representation and processing so as to lay a foundation for a unifying, modular data model.