Assessing agreement on classification tasks: the kappa statistic
Computational Linguistics
Message Understanding Conference-6: a brief history
COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1
Fine grained classification of named entities
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
The GENIA corpus: an annotated research abstract corpus in molecular biology domain
HLT '02 Proceedings of the second international conference on Human Language Technology Research
Inter-coder agreement for computational linguistics
Computational Linguistics
Complex linguistic annotation --- no easy way out!: a case from Bangla and Hindi POS labeling tasks
ACL-IJCNLP '09 Proceedings of the Third Linguistic Annotation Workshop
Towards a methodology for named entities annotation
ACL-IJCNLP '09 Proceedings of the Third Linguistic Annotation Workshop
Agile corpus annotation in practice: an overview of manual and automatic annotation of CVs
LAW IV '10 Proceedings of the Fourth Linguistic Annotation Workshop
Heuristic methods for reducing errors of geographic named entities learned by bootstrapping
IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
Recall-oriented learning of named entities in Arabic Wikipedia
EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
Tree representations in probabilistic models for extended named entities detection
EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
An improved corpus of disease mentions in PubMed citations
BioNLP '12 Proceedings of the 2012 Workshop on Biomedical Natural Language Processing
LAW VI '12 Proceedings of the Sixth Linguistic Annotation Workshop
Hi-index | 0.00 |
Within the framework of the construction of a fact database, we defined guidelines to extract named entities, using a taxonomy based on an extension of the usual named entities definition. We thus defined new types of entities with broader coverage including substantive-based expressions. These extended named entities are hierarchical (with types and components) and compositional (with recursive type inclusion and metonymy annotation). Human annotators used these guidelines to annotate a 1.3M word broadcast news corpus in French. This article presents the definition and novelty of extended named entity annotation guidelines, the human annotation of a global corpus and of a mini reference corpus, and the evaluation of annotations through the computation of inter-annotator agreements. Finally, we discuss our approach and the computed results, and outline further work.