Building a large annotated corpus of English: the penn treebank
Computational Linguistics - Special issue on using large corpora: II
The FrameNet tagset for frame-semantic and syntactic coding of predicate-argument structure
NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
Towards a resource for lexical semantics: a large German corpus with extensive semantic annotation
ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Automatic annotation for all semantic layers in FrameNet
EACL '06 Proceedings of the Eleventh Conference of the European Chapter of the Association for Computational Linguistics: Posters & Demonstrations
Identifying and analyzing Brazilian Portuguese complex predicates
MWE '11 Proceedings of the Workshop on Multiword Expressions: from Parsing and Generation to the Real World
Corpus-Based acquisition of support verb constructions for portuguese
PROPOR'12 Proceedings of the 10th international conference on Computational Processing of the Portuguese Language
Learning to detect english and hungarian light verb constructions
ACM Transactions on Speech and Language Processing (TSLP) - Special issue on multiword expressions: From theory to practice and use, part 1
Hi-index | 0.00 |
We present an annotation scheme for the annotation of complex predicates, understood as constructions with more than one lexical unit, each contributing part of the information normally associated with a single predicate. We discuss our annotation guidelines of four types of complex predicates, and the treatment of several difficult cases, related to ambiguity, overlap and coordination. We then discuss the process of marking up the Portuguese CINTIL corpus of 1M tokens (written and spoken) with a new layer of information regarding complex predicates. We also present the outcomes of the annotation work and statistics on the types of CPs that we found in the corpus.