Automatic extraction of rules for anaphora resolution of Japanese zero pronouns from aligned sentence pairs

  • Authors:
  • Hiromi Nakaiwa

  • Affiliations:
  • NTT Communication Science Laboratories, Yokosuka-shi, Kanagawa-ken, Japan

  • Venue:
  • ANARESOLUTION '97 Proceedings of a Workshop on Operational Factors in Practical, Robust Anaphora Resolution for Unrestricted Texts
  • Year:
  • 1997

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper proposes a method to extract rules for anaphora resolution of Japanese zero pronouns from aligned sentence pairs. The method focuses on the characteristics of Japanese and English in which both the language families and the distribution of zero pronouns are very different. In this method, zero pronouns in the Japanese sentence and the English translation equivalents of their antecedents are extracted from Japanese and English aligned sentence pairs. Then resolution rules for Japanese zero pronouns are automatically extracted using the pairs of Japanese zero pronouns and translation equivalents of their antecedents in English and equivalent word/phrase pairs which were extracted from the aligned sentence pairs, based on the syntactic and semantic structure of the Japanese sentence. This method was implemented in the Japanese-to-English machine translation system, ALT-J/E. The evaluation showed that, for 371 zero pronouns with deictic reference in a sentence set for the evaluation of Japanese-to-English machine translation systems, the rules which were created automatically from Japanese and English aligned sentence pairs correctly resolved 99.2% of zero pronouns in a window test and 87.6% of zero pronouns in a blind test.