MAXLCA: a new query semantic model for XML keyword search

  • Authors:
  • Ning Gao;Zhi-Hong Deng;Jia-Jian Jiang;HANG YU

  • Affiliations:
  • School of Electronic Engineering and Computer Science, Peking University;School of Electronic Engineering and Computer Science, Peking University;School of Electronic Engineering and Computer Science, Peking University;School of Electronic Engineering and Computer Science, Peking University

  • Venue:
  • Journal of Web Engineering
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Keyword search enables web users to easily access XML data without understanding the complex data schemas. However, the ambiguity of keyword search makes it arduous to select qualified data nodes matching keywords. To address this challenge in XML datasets whose documents have a relatively low average size, we present a new keyword query semantic model: MAXimal Lowest Common Ancestor (MAXLCA). MAXLCA can effectively avoid false negative problem observed in ELCA, SLCA and XSeek. Furthermore, we construct an algorithm GMAX for MAXLCA-based queries that is proved efficient in evaluations. Experiments on INEX show that the search engine using MAXLCA and GMAX outperforms in all three comparative criteria: effective, efficient and processing scalability.