Greedy Cached Query Rewriting in Content-Oriented XML Web Engine

  • Authors:
  • Jun Gao;Tang Shi Wei;Dongqing Yang;Tengjiao Wang

  • Affiliations:
  • -;-;-;-

  • Venue:
  • IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

As XML emerge information representation and exchanging standard on the Web, XML turns into primary choice in many E-commercial applications. In web application, XML query receives much more attention. Unlike traditional web search, XML query directly returns needed information, and its engine caches query results rather than physical web pages. Such query is also called Content-Oriented Web query. In the XML query engine, it is more difficult to utilize the cached material to answer the user query than traditional web search engine because the former could be reduced to the answering semi-structured query using views, which is essentially an NP problem. Traditional algorithms take exponential time in the size of query and views, in addition, they takes no information generated in some special application into account. In the paper, a novel method was proposed under XML query engine environment, which makes use of the information that could be obtained in XML query engine as heuristic knowledge to prune the exponential query rewriting plan space. Theoretical analysis shows that the algorithm is polynomial time complexity in the size of query and view definition. The primary result of our prototype--COMMIX shows that the algorithm is valid and efficient.