Automatically generating structured queries in XML keyword search

Authors:
Felipe Da C. Hummel;Altigran S. Da Silva;Mirella M. Moro;Alberto H. F. Laender
Affiliations:
Departamento de Ciência da Computação, Universidade Federal do Amazonas, Manaus, Brazil;Departamento de Ciência da Computação, Universidade Federal do Amazonas, Manaus, Brazil;Departamento de Ciência da Computação, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil;Departamento de Ciência da Computação, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
Venue:
INEX'10 Proceedings of the 9th international conference on Initiative for the evaluation of XML retrieval: comparative evaluation of focused retrieval
Year:
2010

Citing 21
Cited 1

XRANK: ranked keyword search over XML documents

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
DBXplorer: A System for Keyword-Based Search over Relational Databases

ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Efficient keyword search for smallest LCAs in XML databases

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
NaLIX: an interactive natural language interface for querying XML

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Keyword Proximity Search in XML Trees

IEEE Transactions on Knowledge and Data Engineering
LABRADOR: Efficiently publishing relational databases on the web by using keyword-based query interfaces

Information Processing and Management: an International Journal
Multiway SLCA-based keyword search in XML data

Proceedings of the 16th international conference on World Wide Web
Identifying meaningful return information for XML keyword search

Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Discover: keyword search in relational databases

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
BANKS: browsing and keyword searching in relational databases

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
XSEarch: a semantic search engine for XML

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Efficient IR-style keyword search over relational databases

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Schema-free XQuery

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Objectrank: authority-based keyword search in databases

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
FleDEx: flexible data exchange

Proceedings of the 9th annual ACM international workshop on Web information and data management
Effective keyword search for valuable lcas over xml documents

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
An effective and versatile keyword search engine on heterogenous data sources

Proceedings of the VLDB Endowment
SUITS: Faceted User Interface for Constructing Structured Queries from Keywords

DASFAA '09 Proceedings of the 14th International Conference on Database Systems for Advanced Applications
A Probabilistic Retrieval Model for Semistructured Data

ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
SPARK: A Keyword Search Engine on Relational Databases

ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
An X-ray on web-available XML schemas

ACM SIGMOD Record

Estimating structural relevance of XML elements through language model

Proceedings of the 10th Conference on Open Research Areas in Information Retrieval

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we present a novel method for automatically deriving structured XML queries from keyword-based queries and show how it was applied to the experimental tasks proposed for the INEX 2010 data-centric track. In our method, called StruX, users specify a schema-independent unstructured keyword-based query and it automatically generates a top-k ranking of schemaaware queries based on a target XML database. Then, one of the top ranked structured queries can be selected, automatically or by a user, to be executed by an XML query engine. The generated structured queries are XPath expressions consisting of an entity path (e.g., dblp/article) and predicates (e.g., /dblp/article[author="john" and title="xml"]). We use the concept of entity, commonly adopted in the XML keyword search literature, to define suitable root nodes for the query results. Also, StruX uses IR techniques to determine in which elements a term is more likely to occur.