Automatic text processing
Introduction to algorithms
Predicate migration: optimizing queries with expensive predicates
SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
Optimizing disjunctive queries with expensive predicates
SIGMOD '94 Proceedings of the 1994 ACM SIGMOD international conference on Management of data
Artificial intelligence: a modern approach
Artificial intelligence: a modern approach
Why decision support fails and how to fix it
ACM SIGMOD Record
Optimizing queries over multimedia repositories
SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
Combining fuzzy information from multiple systems (extended abstract)
PODS '96 Proceedings of the fifteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
On saying “Enough already!” in SQL
SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
A framework for expressing and combining preferences
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Optimal aggregation algorithms for middleware
PODS '01 Proceedings of the twentieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
PREFER: a system for the efficient execution of multi-parametric ranked queries
SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Reducing the Braking Distance of an SQL Query Engine
VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
Evaluating Top-k Selection Queries
VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Supporting Incremental Join Queries on Ranked Inputs
Proceedings of the 27th International Conference on Very Large Data Bases
Using Fagin's Algorithm for Merging Ranked Results in Multimedia Middleware
COOPIS '99 Proceedings of the Fourth IECIS International Conference on Cooperative Information Systems
Database research at the University of Illinois at Urbana-Champaign
ACM SIGMOD Record
Querying structured text in an XML database
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
A Probabilistic Approach to Metasearching with Adaptive Probing
ICDE '04 Proceedings of the 20th International Conference on Data Engineering
Evaluating top-k queries over web-accessible databases
ACM Transactions on Database Systems (TODS)
SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Optimizing Top-k Selection Queries over Multimedia Repositories
IEEE Transactions on Knowledge and Data Engineering
Ranked Relations: Query Languages and Query Processing Methods for Multimedia
Multimedia Tools and Applications
Supporting top-k join queries in relational databases
The VLDB Journal — The International Journal on Very Large Data Bases
Rewriting Rules To Permeate Complex Similarity and Fuzzy Queries within a Relational Database System
IEEE Transactions on Knowledge and Data Engineering
Querying web metadata: Native score management and text support in databases
ACM Transactions on Database Systems (TODS)
Adaptive Processing of Top-k Queries in XML
ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Optimizing Access Cost for Top-k Queries over Web Sources: A Unified Cost-Based Approach
ICDE '05 Proceedings of the 21st International Conference on Data Engineering
RankFP: A Framework for Supporting Rank Formulation and Processing
ICDE '05 Proceedings of the 21st International Conference on Data Engineering
RankSQL: query algebra and optimization for relational top-k queries
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Efficient and self-tuning incremental query expansion for top-k query processing
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
An efficient and versatile query engine for TopX search
VLDB '05 Proceedings of the 31st international conference on Very large data bases
KLEE: a framework for distributed top-k query algorithms
VLDB '05 Proceedings of the 31st international conference on Very large data bases
RankSQL: supporting ranking queries in relational database management systems
VLDB '05 Proceedings of the 31st international conference on Very large data bases
Supporting ad-hoc ranking aggregates
Proceedings of the 2006 ACM SIGMOD international conference on Management of data
Boolean + ranking: querying a database by k-constrained optimization
Proceedings of the 2006 ACM SIGMOD international conference on Management of data
Continuous monitoring of top-k queries over sliding windows
Proceedings of the 2006 ACM SIGMOD international conference on Management of data
Relaxing join and selection queries
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Answering top-k queries using views
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Answering top-k queries with multi-dimensional selections: the ranking cube approach
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
IO-Top-k: index-access optimized top-k query processing
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Supporting stratum access for fuzzy queries
DBA'06 Proceedings of the 24th IASTED international conference on Database and applications
Efficient processing of complex similarity queries in RDBMS through query rewriting
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Adaptive rank-aware query optimization in relational databases
ACM Transactions on Database Systems (TODS)
Optimizing top-k queries for middleware access: A unified cost-based approach
ACM Transactions on Database Systems (TODS)
Branch-and-bound processing of ranked queries
Information Systems
Enabling soft queries for data retrieval
Information Systems
Mining and processing category ranking
Proceedings of the 2007 ACM symposium on Applied computing
Semi-automatic model integration using matching transformations and weaving models
Proceedings of the 2007 ACM symposium on Applied computing
Spark: top-k keyword query in relational databases
Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Reverse Nearest Neighbors Search in Ad Hoc Subspaces
IEEE Transactions on Knowledge and Data Engineering
The Threshold Algorithm: From Middleware Systems to the Relational Engine
IEEE Transactions on Knowledge and Data Engineering
Probe Minimization by Schedule Optimization: Supporting Top-K Queries with Expensive Predicates
IEEE Transactions on Knowledge and Data Engineering
Progressive ranking of range aggregates
Data & Knowledge Engineering
Efficient top-k aggregation of ranked inputs
ACM Transactions on Database Systems (TODS)
Efficient top-k processing in large-scaled distributed environments
Data & Knowledge Engineering
Sideway value algebra for object-relational databases
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
A practical approach for efficiently answering top-k relational queries
Decision Support Systems
Supporting top-K join queries in relational databases
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Top-k query evaluation with probabilistic guarantees
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Best position algorithms for top-k queries
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Anytime measures for top-k algorithms
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
A rank algebra to support multimedia mining applications
Proceedings of the 8th international workshop on Multimedia data mining: (associated with the ACM SIGKDD 2007)
Rank-aware XML data model and algebra: towards unifying exact match and similar match in XML
MIV'07 Proceedings of the 7th Conference on 7th WSEAS International Conference on Multimedia, Internet & Video Technologies - Volume 7
Efficient online top-K retrieval with arbitrary similarity measures
EDBT '08 Proceedings of the 11th international conference on Extending database technology: Advances in database technology
Probabilistic ranked queries in uncertain databases
EDBT '08 Proceedings of the 11th international conference on Extending database technology: Advances in database technology
Ranking with tagging as quality indicators
Proceedings of the 2008 ACM symposium on Applied computing
ARCube: supporting ranking aggregate queries in partially materialized data cubes
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Supporting personalized ranking over categorical attributes
Information Sciences: an International Journal
A survey of top-k query processing techniques in relational database systems
ACM Computing Surveys (CSUR)
Search structures and algorithms for personalized ranking
Information Sciences: an International Journal
Top-k Retrieval in Description Logic Programs Under Vagueness for the Semantic Web
SUM '07 Proceedings of the 1st international conference on Scalable Uncertainty Management
Efficient Top-k Data Sources Ranking for Query on Deep Web
WISE '08 Proceedings of the 9th international conference on Web Information Systems Engineering
Optimizing Distributed Top-k Queries
WISE '08 Proceedings of the 9th international conference on Web Information Systems Engineering
On Top-k Search with No Random Access Using Small Memory
ADBIS '08 Proceedings of the 12th East European conference on Advances in Databases and Information Systems
SUM '08 Proceedings of the 2nd international conference on Scalable Uncertainty Management
Efficient search for the top-k probable nearest neighbors in uncertain databases
Proceedings of the VLDB Endowment
Personalized top-k skyline queries in high-dimensional space
Information Systems
Anytime measures for top-k algorithms on exact and fuzzy data sets
The VLDB Journal — The International Journal on Very Large Data Bases
Privacy preservation of aggregates in hidden databases: why and how?
Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Answering linear optimization queries with an approximate stream index
Knowledge and Information Systems
Distributed top-k aggregation queries at large
Distributed and Parallel Databases
Skyline View: Efficient Distributed Subspace Skyline Computation
DaWaK '09 Proceedings of the 11th International Conference on Data Warehousing and Knowledge Discovery
Location-aware privacy and more: a systems approach using context-aware database management systems
Proceedings of the 2nd SIGSPATIAL ACM GIS 2009 International Workshop on Security and Privacy in GIS and LBS
Optimal algorithms for evaluating rank joins in database systems
ACM Transactions on Database Systems (TODS)
Subspace Discovery for Promotion: A Cell Clustering Approach
DS '09 Proceedings of the 12th International Conference on Discovery Science
A Minimal Deductive System for General Fuzzy RDF
RR '09 Proceedings of the 3rd International Conference on Web Reasoning and Rule Systems
Continuous Processing of Preference Queries in Data Streams
SOFSEM '10 Proceedings of the 36th Conference on Current Trends in Theory and Practice of Computer Science
Personalizing queries based on networks of composite preferences
ACM Transactions on Database Systems (TODS)
Turbo-charging hidden database samplers with overflowing queries and skew reduction
Proceedings of the 13th International Conference on Extending Database Technology
Towards efficient ranked query processing in peer-to-peer networks
Proceedings of the 2005 joint Chinese-German conference on Cognitive systems
Telescope: zooming to interesting skylines
DASFAA'07 Proceedings of the 12th international conference on Database systems for advanced applications
Adaptive relaxation for querying heterogeneous XML data sources
Information Systems
Efficient top-k search across heterogeneous XML data sources
DASFAA'08 Proceedings of the 13th international conference on Database systems for advanced applications
Unbiased estimation of size and other aggregates over hidden web databases
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Efficient processing of exact top-k queries over disk-resident sorted lists
The VLDB Journal — The International Journal on Very Large Data Bases
Supporting ranking queries on uncertain and incomplete data
The VLDB Journal — The International Journal on Very Large Data Bases
Preference query evaluation over expensive attributes
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Predicate-based indexing for desktop search
The VLDB Journal — The International Journal on Very Large Data Bases
Top-k vectorial aggregation queries in a distributed environment
Journal of Parallel and Distributed Computing
Distributed threshold querying of general functions by a difference of monotonic representation
Proceedings of the VLDB Endowment
Selective sampling techniques for feedback-based data retrieval
Data Mining and Knowledge Discovery
Probabilistic inverse ranking queries in uncertain databases
The VLDB Journal — The International Journal on Very Large Data Bases
An optimal strategy for monitoring top-k queries in streaming windows
Proceedings of the 14th International Conference on Extending Database Technology
TopRecs: Top-k algorithms for item-based collaborative filtering
Proceedings of the 14th International Conference on Extending Database Technology
Efficient and generic evaluation of ranked queries
Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
Efficient similarity joins for near-duplicate detection
ACM Transactions on Database Systems (TODS)
Efficient evaluation of k-NN queries using spatial mashups
SSTD'11 Proceedings of the 12th international conference on Advances in spatial and temporal databases
A general top-k algorithm for web data sources
DEXA'11 Proceedings of the 22nd international conference on Database and expert systems applications - Volume Part I
A self-adaptive cross-domain query approach on the deep web
WAIM'11 Proceedings of the 12th international conference on Web-age information management
Privacy-preserving distributed network troubleshooting—bridging the gap between theory and practice
ACM Transactions on Information and System Security (TISSEC)
MTopS: scalable processing of continuous top-k multi-query workloads
Proceedings of the 20th ACM international conference on Information and knowledge management
Scalable entity matching computation with materialization
Proceedings of the 20th ACM international conference on Information and knowledge management
Towards top-k query answering in description logics: the case of DL-Lite
JELIA'06 Proceedings of the 10th European conference on Logics in Artificial Intelligence
High-performance processing of text queries with tunable pruned term and term pair indexes
ACM Transactions on Information Systems (TOIS)
Efficient non-blocking top-k query processing in distributed networks
DASFAA'06 Proceedings of the 11th international conference on Database Systems for Advanced Applications
Top-k skyline: a unified approach
OTM'05 Proceedings of the 2005 OTM Confederated international conference on On the Move to Meaningful Internet Systems
INEX'05 Proceedings of the 4th international conference on Initiative for the Evaluation of XML Retrieval
Effective early termination techniques for text similarity join operator
ISCIS'05 Proceedings of the 20th international conference on Computer and Information Sciences
Attribute and object selection queries on objects with probabilistic attributes
ACM Transactions on Database Systems (TODS)
H3: a hybrid handheld healthcare framework
KES'06 Proceedings of the 10th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part II
Preferred skyline: a hybrid approach between SQLf and skyline
DEXA'05 Proceedings of the 16th international conference on Database and Expert Systems Applications
Progressive ranking of range aggregates
DaWaK'05 Proceedings of the 7th international conference on Data Warehousing and Knowledge Discovery
Supporting efficient distributed top-k monitoring
WAIM '06 Proceedings of the 7th international conference on Advances in Web-Age Information Management
Supporting efficient distributed skyline computation using skyline views
Information Sciences: an International Journal
Optimal top-k generation of attribute combinations based on ranked lists
SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
A top-k query answering procedure for fuzzy logic programming
Fuzzy Sets and Systems
A distributed index for efficient parallel top-k keyword search on massive graphs
Proceedings of the twelfth international workshop on Web information and data management
Density index and proximity search in large graphs
Proceedings of the 21st ACM international conference on Information and knowledge management
Computing immutable regions for subspace top-k queries
Proceedings of the VLDB Endowment
SMashQ: spatial mashup framework for k-NN queries in time-dependent road networks
Distributed and Parallel Databases
Flexible and extensible preference evaluation in database systems
ACM Transactions on Database Systems (TODS)
Provisional reporting for rank joins
Journal of Intelligent Information Systems
Top-k join queries: overcoming the curse of anti-correlation
Proceedings of the 17th International Database Engineering & Applications Symposium
Top-K structural diversity search in large networks
Proceedings of the VLDB Endowment
Efficient entity matching using materialized lists
Information Sciences: an International Journal
Range query estimation with data skewness for top-k retrieval
Decision Support Systems
Hi-index | 0.00 |
This paper addresses the problem of evaluating ranked top-k queries with expensive predicates. As major DBMSs now all support expensive user-defined predicates for Boolean queries, we believe such support for ranked queries will be even more important: First ranked queries often need to model user-specific concepts of preference, relevance, or similarity, which call for dynamic user-defined functions. Second, middleware systems must incorporate external predicates for integrating autonomous sources typically accessible only by per-object queries. Third, fuzzy joins are inherently expensive, as they are essentially user-defined operations that dynamically associate multiple relations. These predicates, being dynamically defined or externally accessed, cannot rely on index mechanisms to provide zero-time sorted output, and must instead require per-object probe to evaluate. The current standard sort-merge framework for ranked queries cannot efficiently handle such predicates because it must completely probe all objects, before sorting and merging them to produce top-k answers. To minimize expensive probes, we thus develop the formal principle of "necessary probes," which determines if a probe is absolutely required. We then propose Algorithm MPro which, by implementing the principle, is provably optimal with minimal probe cost. Further, we show that MPro can scale well and can be easily parallelized. Our experiments using both a real-estate benchmark database and synthetic datasets show that MPro enables significant probe reduction, which can be orders of magnitude faster than the standard scheme using complete probing.