Fast ELCA computation for keyword queries on XML data

Authors:
Rui Zhou;Chengfei Liu;Jianxin Li
Affiliations:
Swinburne University of Technology, Melbourne, VIC, Australia;Swinburne University of Technology, Melbourne, VIC, Australia;Swinburne University of Technology, Melbourne, VIC, Australia
Venue:
Proceedings of the 13th International Conference on Extending Database Technology
Year:
2010

Citing 21
Cited 18

Querying XML Documents Made Easy: Nearest Concept Queries

Proceedings of the 17th International Conference on Data Engineering
Proximity Search in Databases

VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
XRANK: ranked keyword search over XML documents

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
DBXplorer: A System for Keyword-Based Search over Relational Databases

ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Keyword Searching and Browsing in Databases using BANKS

ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Efficient keyword search for smallest LCAs in XML databases

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Bidirectional expansion for keyword search on graph databases

VLDB '05 Proceedings of the 31st international conference on Very large data bases
Effective keyword search in relational databases

Proceedings of the 2006 ACM SIGMOD international conference on Management of data
Multiway SLCA-based keyword search in XML data

Proceedings of the 16th international conference on World Wide Web
Spark: top-k keyword query in relational databases

Proceedings of the 2007 ACM SIGMOD international conference on Management of data
BLINKS: ranked keyword searches on graphs

Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Identifying meaningful return information for XML keyword search

Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Discover: keyword search in relational databases

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
XSEarch: a semantic search engine for XML

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Schema-free XQuery

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Effective keyword search for valuable lcas over xml documents

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Efficient LCA based keyword search in XML data

EDBT '08 Proceedings of the 11th international conference on Extending database technology: Advances in database technology
Keyword proximity search in complex data graphs

Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Reasoning and identifying relevant matches for XML keyword search

Proceedings of the VLDB Endowment
Retrieving meaningful relaxed tightest fragments for XML keyword search

Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
Keyword search in databases: the power of RDBMS

Proceedings of the 2009 ACM SIGMOD International Conference on Management of data

Relevant answers for XML keyword search: a skyline approach

WISE'10 Proceedings of the 11th international conference on Web information systems engineering
Improving the performance of identifying contributors for XML keyword search

ACM SIGMOD Record
Adaptive and effective keyword search for XML

PAKDD'11 Proceedings of the 15th Pacific-Asia conference on Advances in knowledge discovery and data mining - Volume Part I
K-graphs: selecting top-k data sources for XML keyword queries

DEXA'11 Proceedings of the 22nd international conference on Database and expert systems applications - Volume Part I
Processing keyword search on XML: a survey

World Wide Web
Top-K data source selection for keyword queries over multiple XML data sources

Journal of Information Science
MAXLCA: a new query semantic model for XML keyword search

Journal of Web Engineering
MALEX: a MAp-like exploration model on XML database

KEYS '12 Proceedings of the Third International Workshop on Keyword Search on Structured Data
Fast result enumeration for keyword queries on XML data

DASFAA'12 Proceedings of the 17th international conference on Database Systems for Advanced Applications - Volume Part I
Top-Down SLCA computation based on list partition

DASFAA'12 Proceedings of the 17th international conference on Database Systems for Advanced Applications - Volume Part I
Efficiently identifying contributors for XML keyword search

DASFAA'12 Proceedings of the 17th international conference on Database Systems for Advanced Applications - Volume Part I
XML filtering with XPath expressions containing parent and ancestor axes

Information Sciences: an International Journal
An extended compact TVP index for finding top-k nearest neighbors over XML data tree

WISE'12 Proceedings of the 13th international conference on Web Information Systems Engineering
ELCA evaluation for keyword search on probabilistic XML data

World Wide Web
Top-down keyword query processing on XML data

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Semantics-based keyword search over XML and relational databases

Proceedings of the Fourth Symposium on Information and Communication Technology
Efficient query processing for XML keyword queries based on the IDList index

The VLDB Journal — The International Journal on Very Large Data Bases
XML keyword search with promising result type recommendations

World Wide Web

Quantified Score

Hi-index	0.00

Visualization

Abstract

Keyword search is integrated in many applications on account of the convenience to convey users' query intention. Recently, answering keyword queries on XML data has drawn the attention of web and database communities, because the success of this research will relieve users from learning complex XML query languages, such as XPath/XQuery, and/or knowing the underlying schema of the queried XML data. As a result, information in XML data can be discovered much easier. To model the result of answering keyword queries on XML data, many LCA (lowest common ancestor) based notions have been proposed. In this paper, we focus on ELCA (Exclusive LCA) semantics, which is first proposed by Guo et al. and afterwards named by Xu and Papakonstantinou. We propose an algorithm named Hash Count to find ELCAs efficiently. Our analysis shows the complexity of Hash Count algorithm is O(kd|S1|), where k is the number of keywords, d is the depth of the queried XML document and |S1| is the frequency of the rarest keyword. This complexity is the best result known so far. We also evaluate the algorithm on a real DBLP dataset, and compare it with the state-of-the-art algorithms. The experimental results demonstrate the advantage of Hash Count algorithm in practice.