Machine learning for online query relaxation

Authors:
Ion Muslea
Affiliations:
SRI International, Menlo Park, CA
Venue:
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Year:
2004

Citing 19
Cited 16

SEAVE: a mechanism for verifying user presuppositions in query systems

ACM Transactions on Information Systems (TOIS)
Providing Quality Responses with Natural Language Interfaces: The Null Value Problem

IEEE Transactions on Software Engineering
C4.5: programs for machine learning

C4.5: programs for machine learning
Query answering via cooperative data inference

Journal of Intelligent Information Systems
CoBase: a scalable and extensible cooperative information system

Journal of Intelligent Information Systems - Special issue on intelligent integration of information
An error-based conceptual clustering method for providing approximate query answers

Communications of the ACM - Electronic supplement to the December issue
Flexible queries over semistructured data

PODS '01 Proceedings of the twentieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
XIRQL: a query language for information retrieval in XML documents

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Modern Information Retrieval

Modern Information Retrieval
VisDB: Database Exploration Using Multidimensional Visualization

IEEE Computer Graphics and Applications
FLEX: A Tolerant and Cooperative User Interface to Databases

IEEE Transactions on Knowledge and Data Engineering
Cooperative Answering through Controlled Query Relaxation

IEEE Expert: Intelligent Systems and Their Applications
Tree Pattern Relaxation

EDBT '02 Proceedings of the 8th International Conference on Extending Database Technology: Advances in Database Technology
Generalization and a Framework for Query Modification

Proceedings of the Sixth International Conference on Data Engineering
Using Type Inference and Induced Rules to Provide Intensional Answers

Proceedings of the Seventh International Conference on Data Engineering
Cooperative Responses to Boolean Queries

Proceedings of the First International Conference on Data Engineering
Evaluating Top-k Selection Queries

VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Adding Relevance to XML

Selected papers from the Third International Workshop WebDB 2000 on The World Wide Web and Databases
Query relaxation for xml model

Query relaxation for xml model

Relaxing join and selection queries

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
LACO: A location-aware cooperative query system for securely personalized services

Expert Systems with Applications: An International Journal
Empty versus overabundant answers to flexible relational queries

Fuzzy Sets and Systems
Wildcards for lightweight information integration in virtual desktops

Proceedings of the 17th ACM conference on Information and knowledge management
Answering approximate queries over autonomous web databases

Proceedings of the 18th international conference on World wide web
Supporting queries with imprecise constraints

AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Online query relaxation via Bayesian causal structures discovery

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
Incremental controlled relaxation of failing flexible queries

Journal of Intelligent Information Systems
Cooperative answering to flexible queries via a tolerance relation

ISMIS'08 Proceedings of the 17th international conference on Foundations of intelligent systems
An effective query relaxation solution for the deep web

APWeb'08 Proceedings of the 10th Asia-Pacific web conference on Progress in WWW research and development
QRelX: generating meaningful queries that provide cardinality assurance

Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Search space reduction for an efficient handling of empty answers in database flexible querying

KES'10 Proceedings of the 14th international conference on Knowledge-based and intelligent information and engineering systems: Part I
Approximating query answering on RDF databases

World Wide Web
Flexible query answering in data cubes

DaWaK'05 Proceedings of the 7th international conference on Data Warehousing and Knowledge Discovery
A method based on query caching and predicate substitution for the treatment of failing database queries

ICCBR'10 Proceedings of the 18th international conference on Case-Based Reasoning Research and Development
Taxonomy-Based Fragmentation for Anti-instantiation in Distributed Databases

UCC '13 Proceedings of the 2013 IEEE/ACM 6th International Conference on Utility and Cloud Computing

Quantified Score

Hi-index	0.01

Visualization

Abstract

In this paper we provide a fast, data-driven solution to the failing query problem: given a query that returns an empty answer, how can one relax the query's constraints so that it returns a non-empty set of tuples? We introduce a novel algorithm, loqr, which is designed to relax queries that are in the disjunctive normal form and contain a mixture of discrete and continuous attributes. loqr discovers the implicit relationships that exist among the various domain attributes and then uses this knowledge to relax the constraints from the failing query.In a first step, loqr uses a small, randomly-chosen subset of the target database to learn a set of decision rules that predict whether an attribute's value satisfies the constraints in the failing query; this query-driven operation is performed online for each failing query. In the second step, loqr uses nearest-neighbor techniques to find the learned rule that is the most similar to the failing query; then it uses the attributes' values from this rule to relax the failing query's constraints. Our experiments on six application domains show that loqr is both robust and fast: it successfully relaxes more than 95% of the failing queries, and it takes under a second for processing queries that consist of up to 20 attributes (larger queries of up to 93 attributes are processed in several seconds).