ACM Computing Surveys (CSUR) - Annals of discrete mathematics, 24
The EXODUS optimizer generator
SIGMOD '87 Proceedings of the 1987 ACM SIGMOD international conference on Management of data
Query processing in a multimedia document system
ACM Transactions on Information Systems (TOIS)
Automatic text processing
Extensible query processing in starburst
SIGMOD '89 Proceedings of the 1989 ACM SIGMOD international conference on Management of data
Integration of text search with ORION
Data Engineering
Towards an open architecture for LDL
VLDB '89 Proceedings of the 15th international conference on Very large data bases
On rules, procedure, caching and views in data base systems
SIGMOD '90 Proceedings of the 1990 ACM SIGMOD international conference on Management of data
Rule-based optimization and query processing in an extensible geometric database system
ACM Transactions on Database Systems (TODS)
Information retrieval
Query optimization for parallel execution
SIGMOD '92 Proceedings of the 1992 ACM SIGMOD international conference on Management of data
Predicate migration: optimizing queries with expensive predicates
SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
SIGMOD '94 Proceedings of the 1994 ACM SIGMOD international conference on Management of data
From structured documents to novel query facilities
SIGMOD '94 Proceedings of the 1994 ACM SIGMOD international conference on Management of data
Query processing in a system for distributed databases (SDD-1)
ACM Transactions on Database Systems (TODS)
The Papyrus integrated data server
PDIS '91 Proceedings of the first international conference on Parallel and distributed information systems
Access path selection in a relational database management system
SIGMOD '79 Proceedings of the 1979 ACM SIGMOD international conference on Management of data
Extended User-Defined Indexing with Application to Textual Databases
VLDB '88 Proceedings of the 14th International Conference on Very Large Data Bases
Optimization for Spatial Query Processing
VLDB '91 Proceedings of the 17th International Conference on Very Large Data Bases
Query Optimization in the Presence of Foreign Functions
VLDB '93 Proceedings of the 19th International Conference on Very Large Data Bases
A Blackboard Architecture for Query Optimization in Object Bases
VLDB '93 Proceedings of the 19th International Conference on Very Large Data Bases
Querying and Updating the File
VLDB '93 Proceedings of the 19th International Conference on Very Large Data Bases
Integrating a Structured-Text Retrieval System with an Object-Oriented Database System
VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
Control of an Extensible Query Optimizer: A Planning-Based Approach
VLDB '93 Proceedings of the 19th International Conference on Very Large Data Bases
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Query optimization in the presence of limited access patterns
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Harp: a distributed query system for legacy public libraries and structured databases
ACM Transactions on Information Systems (TOIS)
WSQ/DSQ: a practical approach for combined querying of databases and the Web
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Data integration using similarity joins and a word-based information representation language
ACM Transactions on Information Systems (TOIS)
A Hybrid Estimator for Selectivity Estimation
IEEE Transactions on Knowledge and Data Engineering
Optimizing Queries with Foreign Functions in a Distributed Environment
IEEE Transactions on Knowledge and Data Engineering
Object-Oriented Mediator Queries to Internet Search Engines
OOIS '02 Proceedings of the Workshops on Advances in Object-Oriented Information Systems
The Case for Enhanced Abstract Data Types
VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
Enhanced abstract data types in object-relational databases
The VLDB Journal — The International Journal on Very Large Data Bases
Web data retrieval and extraction
Data & Knowledge Engineering - Special issue: Data integration over the Web
Towards automatic association of relevant unstructured content with structured query results
Proceedings of the 14th ACM international conference on Information and knowledge management
Flow algorithms for two pipelined filter ordering problems
Proceedings of the twenty-fifth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Foundations and Trends in Databases
A generic flow algorithm for shared filter ordering problems
Proceedings of the twenty-seventh ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Towards the Design of a Scalable Email Archiving and Discovery Solution
ADBIS '08 Proceedings of the 12th East European conference on Advances in Databases and Information Systems
Algorithms for distributional and adversarial pipelined filter ordering problems
ACM Transactions on Algorithms (TALG)
Keyword search over relational tables and streams
ACM Transactions on Database Systems (TODS)
Parallel pipelined filter ordering with precedence constraints
ACM Transactions on Algorithms (TALG)
Hi-index | 0.01 |
Text is a pervasive information type, and many applications require querying over text sources in addition to structured data. This paper studies the problem of query processing in a system that loosely integrates an extensible database system and a text retrieval system. We focus on a class of conjunctive queries that include joins between text and structured data, in addition to selections over these two types of data. We adapt techniques from distributed query processing and introduce a novel class of join methods based on probing that is especially useful for joins with text systems, and we present a cost model for the various alternative query processing methods. Experimental results confirm the utility of these methods. The space of query plans is extended due to the additional techniques, and we describe an optimization algorithm for searching this extended space. The techniques we describe in this paper are applicable to other types of external data managers loosely integrated with a database system.