Efficient IR-style keyword search over relational databases

Authors:
Vagelis Hristidis;Luis Gravano;Yannis Papakonstantinou
Affiliations:
UC, San Diego;Columbia University;UC, San Diego
Venue:
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Year:
2003

Citing 12
Cited 150

Automatic text processing

Automatic text processing
Preemptive priority-based scheduling: an appropriate engineering approach

Advances in real-time systems
The anatomy of a large-scale hypertextual Web search engine

WWW7 Proceedings of the seventh international conference on World Wide Web 7
Integrating keyword search into XML query processing

Proceedings of the 9th international World Wide Web conference on Computer networks : the international journal of computer and telecommunications netowrking
Optimal aggregation algorithms for middleware

PODS '01 Proceedings of the twentieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
PREFER: a system for the efficient execution of multi-parametric ranked queries

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Supporting Ranked Boolean Similarity Queries in MARS

IEEE Transactions on Knowledge and Data Engineering
Proximity Search in Databases

VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
Supporting Incremental Join Queries on Ranked Inputs

Proceedings of the 27th International Conference on Very Large Data Bases
XRANK: ranked keyword search over XML documents

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Keyword Searching and Browsing in Databases using BANKS

ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Discover: keyword search in relational databases

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases

Incorporating Updates in Domain Indexes: Experiences with Oracle Spatial R-trees

ICDE '04 Proceedings of the 20th International Conference on Data Engineering
Rank-aware query optimization

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Querying web metadata: Native score management and text support in databases

ACM Transactions on Database Systems (TODS)
Guiding queries to information sources with InfoBeacons

Proceedings of the 5th ACM/IFIP/USENIX international conference on Middleware
Efficient Inverted Lists and Query Algorithms for Structured Value Ranking in Update-Intensive Relational Databases

ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Adaptive stream filters for entity-based queries with non-value tolerance

VLDB '05 Proceedings of the 31st international conference on Very large data bases
Bidirectional expansion for keyword search on graph databases

VLDB '05 Proceedings of the 31st international conference on Very large data bases
The SphereSearch engine for unified ranked retrieval of heterogeneous XML and web documents

VLDB '05 Proceedings of the 31st international conference on Very large data bases
Quality-driven approximate methods for integrating GIS data

Proceedings of the 13th annual ACM international workshop on Geographic information systems
Improving intranet search-engines using context information from databases

Proceedings of the 14th ACM international conference on Information and knowledge management
Report on the DB/IR panel at SIGMOD 2005

ACM SIGMOD Record
Performance of query processing implementations in ranking-based text retrieval systems using inverted indices

Information Processing and Management: an International Journal
Principles of dataspace systems

Proceedings of the twenty-fifth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Efficiently linking text documents with relevant structured information

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
NUITS: a novel user interface for efficient keyword search over databases

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Probabilistic information retrieval approach for ranking of database query results

ACM Transactions on Database Systems (TODS)
A system for query-specific document summarization

CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
An efficient location update mechanism for continuous queries over moving objects

Information Systems
LABRADOR: Efficiently publishing relational databases on the web by using keyword-based query interfaces

Information Processing and Management: an International Journal
Spark: top-k keyword query in relational databases

Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Effective keyword-based selection of relational databases

Proceedings of the 2007 ACM SIGMOD international conference on Management of data
BLINKS: ranked keyword searches on graphs

Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Keyword search on relational data streams

Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Towards keyword-driven analytical processing

Proceedings of the 2007 ACM SIGMOD international conference on Management of data
CLASCN: candidate network selection for efficient top-k keyword queries over databases

Journal of Computer Science and Technology
Effective keyword search for valuable lcas over xml documents

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Efficient keyword search over virtual XML views

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Authority-based keyword search in databases

ACM Transactions on Database Systems (TODS)
Synthesizing structured text from logical database subsets

EDBT '08 Proceedings of the 11th international conference on Extending database technology: Advances in database technology
Efficiently enumerating results of keyword search over data graphs

Information Systems
CSV: visualizing and mining cohesive subgraphs

Proceedings of the 2008 ACM SIGMOD international conference on Management of data
SQAK: doing more with keywords

Proceedings of the 2008 ACM SIGMOD international conference on Management of data
EASE: an effective 3-in-1 keyword search method for unstructured, semi-structured and structured data

Proceedings of the 2008 ACM SIGMOD international conference on Management of data
A graph method for keyword-based selection of the top-K databases

Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Keyword proximity search in complex data graphs

Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Utilization of intelligent agents for supporting citizens in their access to e-government services

Web Intelligence and Agent Systems
Research on personal dataspace management

Proceedings of the 2nd SIGMOD PhD workshop on Innovative database research
Augmenting Data Retrieval with Information Retrieval Techniques by Using Word Similarity

NLDB '08 Proceedings of the 13th international conference on Natural Language and Information Systems: Applications of Natural Language to Information Systems
Query Planning for Searching Inter-dependent Deep-Web Databases

SSDBM '08 Proceedings of the 20th international conference on Scientific and Statistical Database Management
Progressive Ranking for Efficient Keyword Search over Relational Databases

BNCOD '08 Proceedings of the 25th British national conference on Databases: Sharing Data, Information and Knowledge
Relaxation in text search using taxonomies

Proceedings of the VLDB Endowment
Keyword query cleaning

Proceedings of the VLDB Endowment
Keyword search on external memory data graphs

Proceedings of the VLDB Endowment
An effective and versatile keyword search engine on heterogenous data sources

Proceedings of the VLDB Endowment
Retune: Retrieving and Materializing Tuple Units for Effective Keyword Search over Relational Databases

ER '08 Proceedings of the 27th International Conference on Conceptual Modeling
The Research on the Algorithms of Keyword Search in Relational Database

Advanced Web and NetworkTechnologies, and Applications
Answering aggregate keyword queries on relational databases using minimal group-bys

Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
Data clouds: summarizing keyword search results over structured data

Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
Finding frequent co-occurring terms in relational keyword search

Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
Efficient keyword search over virtual XML views

The VLDB Journal — The International Journal on Very Large Data Bases
Hierarchical result views for keyword queries over relational databases

Proceedings of the First International Workshop on Keyword Search on Structured Data
Query segmentation using conditional random fields

Proceedings of the First International Workshop on Keyword Search on Structured Data
Do we mean the same?: disambiguation of extracted keyword queries for database search

Proceedings of the First International Workshop on Keyword Search on Structured Data
Combining keyword search and forms for ad hoc querying of databases

Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Keyword search in databases: the power of RDBMS

Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Efficient type-ahead search on relational data: a TASTIER approach

Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Answering web queries using structured data sources

Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Benchmarking Fulltext Search Performance of RDF Stores

ESWC 2009 Heraklion Proceedings of the 6th European Semantic Web Conference on The Semantic Web: Research and Applications
Keyword search over relational tables and streams

ACM Transactions on Database Systems (TODS)
Efficient IR-Style Search over Web Services

CAiSE '09 Proceedings of the 21st International Conference on Advanced Information Systems Engineering
A practical method for browsing a relational database using a standard search engine

Integrated Computer-Aided Engineering - Selected papers from the IEEE Conference on Information Reuse and Integration (IRI), July 13-15, 2008
Information discovery across multiple streams

Information Sciences: an International Journal
SAIL: Structure-aware indexing for effective and progressive top-k keyword search over XML documents

Information Sciences: an International Journal
XKMis: effective and efficient keyword search in XML databases

IDEAS '09 Proceedings of the 2009 International Database Engineering & Applications Symposium
Efficient keyword proximity search using a frontier-reduce strategy based on d-distance graph index

IDEAS '09 Proceedings of the 2009 International Database Engineering & Applications Symposium
Hermes: Data Web search on a pay-as-you-go integration infrastructure

Web Semantics: Science, Services and Agents on the World Wide Web
MING: mining informative entity relationship subgraphs

Proceedings of the 18th ACM conference on Information and knowledge management
Finding and ranking compact connected trees for effective keyword proximity search in XML documents

Information Systems
Structured search result differentiation

Proceedings of the VLDB Endowment
Subspace Discovery for Promotion: A Cell Clustering Approach

DS '09 Proceedings of the 12th International Conference on Discovery Science
Cluster-Based Exploration for Effective Keyword Search over Semantic Datasets

ER '09 Proceedings of the 28th International Conference on Conceptual Modeling
EasyKSORD: A Platform of Keyword Search Over Relational Databases

WISM '09 Proceedings of the International Conference on Web Information Systems and Mining
PerK: personalized keyword search in relational databases through preferences

Proceedings of the 13th International Conference on Extending Database Technology
WSXplorer: searching for desired web services

CAiSE'07 Proceedings of the 19th international conference on Advanced information systems engineering
Study on efficiency and effectiveness of KSORD

APWeb/WAIM'07 Proceedings of the joint 9th Asia-Pacific web and 8th international conference on web-age information management conference on Advances in data and web management
Efficient keyword search over data-centric XML documents

APWeb/WAIM'07 Proceedings of the joint 9th Asia-Pacific web and 8th international conference on web-age information management conference on Advances in data and web management
Graph-based concept identification and disambiguation for enterprise search

Proceedings of the 19th international conference on World wide web
ITREKS: keyword search over relational database by indexing tuple relationship

DASFAA'07 Proceedings of the 12th international conference on Database systems for advanced applications
QuickCN: a combined approach for efficient keyword search over databases

DASFAA'07 Proceedings of the 12th international conference on Database systems for advanced applications
Structured annotations of web queries

Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
DivQ: diversification for keyword search over structured databases

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Web services discovery and rank: An information retrieval approach

Future Generation Computer Systems
Structural consistency: enabling XML keyword search to eliminate spurious results consistently

The VLDB Journal — The International Journal on Very Large Data Bases
Querying Wikipedia documents and relationships

Procceedings of the 13th International Workshop on the Web and Databases
WikiAnalytics: disambiguation of keyword search results on highly heterogeneous structured data

Procceedings of the 13th International Workshop on the Web and Databases
Structured data retrieval using cover density ranking

Proceedings of the 2nd International Workshop on Keyword Search on Structured Data
Editorial: BioDB: An ontology-enhanced information system for heterogeneous biological information

Data & Knowledge Engineering
FACeTOR: cost-driven exploration of faceted query results

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
A framework for evaluating database keyword search strategies

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Accessing semi-structured databases: a survey

Proceedings of the 1st International Conference on Intelligent Semantic Web-Services and Applications
Efficient continuous top-k keyword search in relational databases

WAIM'10 Proceedings of the 11th international conference on Web-age information management
An effective 3-in-1 keyword search method over heterogeneous data sources

Information Systems
Semantic-distance based evaluation of ranking queries over relational databases

Journal of Intelligent Information Systems
Ten thousand SQLs: parallel keyword queries computing

Proceedings of the VLDB Endowment
Toward scalable keyword search over relational data

Proceedings of the VLDB Endowment
A novel keyword search paradigm in relational databases: Object summaries

Data & Knowledge Engineering
XRCJ: supporting keyword search in XML and relation co-occurrence

WAIM'10 Proceedings of the 2010 international conference on Web-age information management
Scalable keyword search on large data streams

The VLDB Journal — The International Journal on Very Large Data Bases
Providing built-in keyword search capabilities in RDBMS

The VLDB Journal — The International Journal on Very Large Data Bases
Context-sensitive document ranking

Journal of Computer Science and Technology
Facet discovery for structured web search: a query-log mining approach

Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
Sharing work in keyword search over databases

Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
Exploiting correlation to rank database query results

DASFAA'11 Proceedings of the 16th international conference on Database systems for advanced applications: Part II
A path-oriented RDF index for keyword search query processing

DEXA'11 Proceedings of the 22nd international conference on Database and expert systems applications - Volume Part II
Efficient fuzzy full-text type-ahead search

The VLDB Journal — The International Journal on Very Large Data Bases
Keyword query cleaning with query logs

WAIM'11 Proceedings of the 12th international conference on Web-age information management
Automatically generating structured queries in XML keyword search

INEX'10 Proceedings of the 9th international conference on Initiative for the evaluation of XML retrieval: comparative evaluation of focused retrieval
Keyword-Driven SPARQL Query Generation Leveraging Background Knowledge

WI-IAT '11 Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01
DC proposal: automatically transforming keyword queries to SPARQL on large-scale knowledge bases

ISWC'11 Proceedings of the 10th international conference on The semantic web - Volume Part II
Finding relevant information of certain types from enterprise data

Proceedings of the 20th ACM international conference on Information and knowledge management
Index structures and top-k join algorithms for native keyword search databases

Proceedings of the 20th ACM international conference on Information and knowledge management
Ranking support for keyword search on structured data using relevance models

Proceedings of the 20th ACM international conference on Information and knowledge management
Learning to rank results in relational keyword search

Proceedings of the 20th ACM international conference on Information and knowledge management
Database and information retrieval techniques for XML

ASIAN'05 Proceedings of the 10th Asian Computing Science conference on Advances in computer science: data management on the web
An approach towards automatic workflow composition through information retrieval

Proceedings of the 15th Symposium on International Database Engineering & Applications
Size-l object summaries for relational keyword search

Proceedings of the VLDB Endowment
REX: explaining relationships between entity pairs

Proceedings of the VLDB Endowment
PreCN: preprocessing candidate networks for efficient keyword search over databases

WISE'06 Proceedings of the 7th international conference on Web Information Systems
An enhanced search interface for information discovery from digital libraries

ECDL'06 Proceedings of the 10th European conference on Research and Advanced Technology for Digital Libraries
Expressiveness and performance of full-text search languages

EDBT'06 Proceedings of the 10th international conference on Advances in Database Technology
Si-SEEKER: ontology-based semantic search over databases

KSEM'06 Proceedings of the First international conference on Knowledge Science, Engineering and Management
Language models for keyword search over data graphs

Proceedings of the fifth ACM international conference on Web search and data mining
Comprehensible answers to précis queries

CAiSE'06 Proceedings of the 18th international conference on Advanced Information Systems Engineering
TreeCluster: clustering results of keyword search over databases

WAIM '06 Proceedings of the 7th international conference on Advances in Web-Age Information Management
Logical data independence reconsidered (extended abstract)

ISMIS'05 Proceedings of the 15th international conference on Foundations of Intelligent Systems
Combining query translation with query answering for efficient keyword search

ESWC'10 Proceedings of the 7th international conference on The Semantic Web: research and Applications - Volume Part II
Usability of keyword-driven schema-agnostic search: a comparative study of keyword search, faceted search, query completion and result completion

ESWC'10 Proceedings of the 7th international conference on The Semantic Web: research and Applications - Volume Part II
Auto-completion of underspecified SQL queries

ER'06 Proceedings of the 25th international conference on Conceptual Modeling
Pattern-based query answering

EDBT'06 Proceedings of the 2006 international conference on Current Trends in Database Technology
Interactive predicate suggestion for keyword search on RDF graphs

ADMA'11 Proceedings of the 7th international conference on Advanced Data Mining and Applications - Volume Part II
ColumbuScout: towards building local search engines over large databases

SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
iSearch: an interpretation based framework for keyword search in relational databases

KEYS '12 Proceedings of the Third International Workshop on Keyword Search on Structured Data
STRUCT: incorporating contextual information for English query search on relational databases

KEYS '12 Proceedings of the Third International Workshop on Keyword Search on Structured Data
KESOSD: keyword search over structured data

KEYS '12 Proceedings of the Third International Workshop on Keyword Search on Structured Data
Scalable top-k keyword search in relational databases

DASFAA'12 Proceedings of the 17th international conference on Database Systems for Advanced Applications - Volume Part II
Pay-as-You-Go ranking of schema mappings using query logs

DILS'12 Proceedings of the 8th international conference on Data Integration in the Life Sciences
Ranking the answer trees of graph search by both structure and content

Proceedings of the 1st Joint International Workshop on Entity-Oriented and Semantic Search
3SEPIAS: A Semi-Structured Search Engine for Personal Information in dAtaspace System

Information Sciences: an International Journal
Interpreting keyword queries over web knowledge bases

Proceedings of the 21st ACM international conference on Information and knowledge management
Predicting the effectiveness of keyword queries on databases

Proceedings of the 21st ACM international conference on Information and knowledge management
Pragmatic correlation analysis for probabilistic ranking over relational data

Expert Systems with Applications: An International Journal
Database Keyword Search: A Perspective from Optimization

WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 03
Efficient query construction for large scale data

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Retrieving documents with mathematical content

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Efficient parsing-based search over structured data

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Answering Top-k Keyword Queries on Relational Databases

International Journal of Information Retrieval Research
k-nearest keyword search in RDF graphs

Web Semantics: Science, Services and Agents on the World Wide Web
Probabilistic query rewriting for efficient and effective keyword search on graph data

Proceedings of the VLDB Endowment
Supporting keyword search in product database: a probabilistic approach

Proceedings of the VLDB Endowment
Generating SPARQL queries using templates

Web Intelligence and Agent Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Applications in which plain text coexists with structured data are pervasive. Commercial relational database management systems (RDBMSs) generally provide querying capabilities for text attributes that incorporate state-of-the-art information retrieval (IR) relevance ranking strategies, but this search functionality requires that queries specify the exact column or columns against which a given list of keywords is to be matched. This requirement can be cumbersome and inflexible from a user perspective: good answers to a keyword query might need to be "assembled" -in perhaps unforeseen ways- by joining tuples from multiple relations. This observation has motivated recent research on free-form keyword search over RDBMSs. In this paper, we adapt IR-style document-relevance ranking strategies to the problem of processing free-form keyword queries over RDBMSs. Our query model can handle queries with both AND and OR semantics, and exploits the sophisticated single-column text-search functionality often available in commercial RDBMSs. We develop query-processing strategies that build on a crucial characteristic of IR-style keyword search: only the few most relevant matches -according to some definition of "relevance"- are generally of interest. Consequently, rather than computing all matches for a keyword query, which leads to inefficient executions, our techniques focus on the top-k matches for the query, for moderate values of k. A thorough experimental evaluation over real data shows the performance advantages of our approach.