Treebank annotation for formal semantics research

  • Authors:
  • Alastair Butler;Ruriko Otomo;Zhen Zhou;Kei Yoshimoto

  • Affiliations:
  • PRESTO, Japan Science and Technology Agency, Japan,Center for the Advancement of Higher Education, Tohoku University, Japan;Center for the Advancement of Higher Education, Tohoku University, Japan;Graduate School of International Cultural Studies, Tohoku University, Japan;Center for the Advancement of Higher Education, Tohoku University, Japan,Graduate School of International Cultural Studies, Tohoku University, Japan

  • Venue:
  • JSAI-isAI'12 Proceedings of the 2012 international conference on New Frontiers in Artificial Intelligence
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper motivates and describes treebank annotation for Japanese and English following a scheme adapted from the Annotation manual for the Penn Historical Corpora and the PCEEC (Santorini 2010). The purpose of this annotation is to create a syntactic base from which meaning representations can be built automatically on a corpus linguistics scale (thousands of examples). Advantages of the adopted annotation scheme are highlighted. Most notably, marking clause level functional information is essential for deterministically building meaning representations beyond the predicate-argument structure level. Also an internal syntax where phrasal categories are fundamentally similar is of great assistance. Finally, the paper demonstrates how scope information is simple to add when bracketed syntactic structure is inherently flat.