Annotating a Japanese text corpus with predicate-argument and coreference relations

Authors:
Ryu Iida;Mamoru Komachi;Kentaro Inui;Yuji Matsumoto
Affiliations:
Nara Institute of Science and Technology, Ikoma, Nara, Japan;Nara Institute of Science and Technology, Ikoma, Nara, Japan;Nara Institute of Science and Technology, Ikoma, Nara, Japan;Nara Institute of Science and Technology, Ikoma, Nara, Japan
Venue:
LAW '07 Proceedings of the Linguistic Annotation Workshop
Year:
2007

Citing 10
Cited 11

Class-Based Construction of a Verb Lexicon

Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence
A machine learning approach to coreference resolution of noun phrases

Computational Linguistics - Special issue on computational anaphora resolution
A model-theoretic coreference scoring scheme

MUC6 '95 Proceedings of the 6th conference on Message understanding
Improving machine learning approaches to coreference resolution

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Anaphora resolution by antecedent identification followed by anaphoricity determination

ACM Transactions on Asian Language Information Processing (TALIP)
The Proposition Bank: An Annotated Corpus of Semantic Roles

Computational Linguistics
Learning to resolve bridging references

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Exploiting syntactic patterns as clues in zero-anaphora resolution

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
A logic-based semantic approach to recognizing textual entailment

COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
What is coreference, and what should coreference annotation be?

CorefApp '99 Proceedings of the Workshop on Coreference and its Applications

Zero-anaphora resolution by learning rich syntactic pattern features

ACM Transactions on Asian Language Information Processing (TALIP)
A Japanese predicate argument structure analysis using decision lists

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Discriminative approach to predicate-argument structure analysis with zero-anaphora resolution

ACLShort '09 Proceedings of the ACL-IJCNLP 2009 Conference Short Papers
Accurate learning for Chinese function tags from minimal features

ACLstudent '09 Proceedings of the ACL-IJCNLP 2009 Student Research Workshop
Capturing salience with a trainable cache model for zero-anaphora resolution

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
Enhancing the Japanese WordNet

ALR7 Proceedings of the 7th Workshop on Asian Language Resources
Supervised noun phrase coreference research: the first fifteen years

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Beyond NomBank: a study of implicit arguments for nominal predicates

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Predicate argument structure analysis using transformation-based learning

ACLShort '10 Proceedings of the ACL 2010 Conference Short Papers
A cross-lingual ILP solution to zero anaphora resolution

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Semantic role labeling of implicit arguments for nominal predicates

Computational Linguistics

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we discuss how to annotate coreference and predicate-argument relations in Japanese written text. There have been research activities for building Japanese text corpora annotated with coreference and predicate-argument relations as are done in the Kyoto Text Corpus version 4.0 (Kawahara et al., 2002) and the GDA-Tagged Corpus (Hasida, 2005). However, there is still much room for refining their specifications. For this reason, we discuss issues in annotating these two types of relations, and propose a new specification for each. In accordance with the specification, we built a large-scaled annotated corpus, and examined its reliability. As a result of our current work, we have released an annotated corpus named the NAIST Text Corpus1, which is used as the evaluation data set in the coreference and zero-anaphora resolution tasks in Iida et al. (2005) and Iida et al. (2006).