Efficient sentence retrieval based on syntactic structure

  • Authors:
  • Ichikawa Hiroshi;Hakoda Keita;Hashimoto Taiichi;Tokunaga Takenobu

  • Affiliations:
  • Tokyo Institute of Technology;Tokyo Institute of Technology;Tokyo Institute of Technology;Tokyo Institute of Technology

  • Venue:
  • COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
  • Year:
  • 2006
  • Fast query for large treebanks

    HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper proposes an efficient method of sentence retrieval based on syntactic structure. Collins proposed Tree Kernel to calculate structural similarity. However, structual retrieval based on Tree Kernel is not practicable because the size of the index table by Tree Kernel becomes impractical. We propose more efficient algorithms approximating Tree Kernel: Tree Overlapping and Subpath Set. These algorithms are more efficient than Tree Kernel because indexing is possible with practical computation resources. The results of the experiments comparing these three algorithms showed that structural retrieval with Tree Overlapping and Subpath Set were faster than that with Tree Kernel by 100 times and 1,000 times respectively.