Identifying relevant matches with NOT semantics over XML documents

  • Authors:
  • Rung-Ren Lin;Ya-Hui Chang;Kun-Mao Chao

  • Affiliations:
  • Department of Computer Science and Information Engineering, National Taiwan Ocean University, Keelung, Taiwan;Department of Computer Science and Engineering, National Taiwan Ocean University, Keelung, Taiwan;Department of Computer Science and Information Engineering and Graduate Institute of Biomedical Electronics and Bioinformatics and Graduate Institute of Networking and Multimedia, National Taiwan ...

  • Venue:
  • DASFAA'11 Proceedings of the 16th international conference on Database systems for advanced applications - Volume Part I
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Keyword search over XML documents has been widely studied in recent years. It allows users to retrieve relevant data from XML documents without learning complicated query languages. SLCA (smallest lowest common ancestor)-based keyword search is a common mechanism to locate the desirable LCAs for the given query keywords, but the conventional SLCA-based keyword search is for AND-only semantics. In this paper, we extend the SLCA keyword search to a more general case, where the keyword query could be an arbitrary combination of AND, OR, and NOT operators. We further define the query result based on the monotonicity and consistency properties, and propose an efficient algorithm to figure out the SLCAs and the relevant matches. Since the keyword query becomes more complex, we also discuss the variations of the monotonicity and consistency properties in our framework. Finally, the experimental results show that the proposed algorithm runs efficiently and gives reasonable query results by measuring the processing time, scalability, precision, and recall.