Keyword proximity search in complex data graphs

Authors:
Konstantin Golenberg;Benny Kimelfeld;Yehoshua Sagiv
Affiliations:
The Hebrew University of Jerusalem, Jerusalem, Israel;The Hebrew University of Jerusalem, Jerusalem, Israel;The Hebrew University of Jerusalem, Jerusalem, Israel
Venue:
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Year:
2008

Citing 21
Cited 45

On generating all maximal independent sets

Information Processing Letters
First story detection in TDT is hard

Proceedings of the ninth international conference on Information and knowledge management
Retrieving and organizing web pages by “information unit”

Proceedings of the 10th international conference on World Wide Web
Optimal aggregation algorithms for middleware

PODS '01 Proceedings of the twentieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
XIRQL: a query language for information retrieval in XML documents

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Combining semantic and syntactic document classifiers to improve first story detection

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Novelty and redundancy detection in adaptive filtering

SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
DBXplorer: enabling keyword search over relational databases

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Querying Semistructured Heterogeneous Information

DOOD '95 Proceedings of the Fourth International Conference on Deductive and Object-Oriented Databases
Keyword Searching and Browsing in Databases using BANKS

ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Bidirectional expansion for keyword search on graph databases

VLDB '05 Proceedings of the 31st international conference on Very large data bases
An efficient and versatile query engine for TopX search

VLDB '05 Proceedings of the 31st international conference on Very large data bases
Finding and approximating top-k answers in keyword proximity search

Proceedings of the twenty-fifth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Effective keyword search in relational databases

Proceedings of the 2006 ACM SIGMOD international conference on Management of data
NUITS: a novel user interface for efficient keyword search over databases

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
eXtended cumulated gain measures for the evaluation of content-oriented XML retrieval

ACM Transactions on Information Systems (TOIS)
Discover: keyword search in relational databases

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Efficient IR-style keyword search over relational databases

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Efficiently enumerating results of keyword search over data graphs

Information Systems
Efficiently enumerating results of keyword search

DBPL'05 Proceedings of the 10th international conference on Database Programming Languages
Proceedings of the 4th international conference on Initiative for the Evaluation of XML Retrieval

INEX'05 Proceedings of the 4th international conference on Initiative for the Evaluation of XML Retrieval

Towards a theory of search queries

Proceedings of the 12th International Conference on Database Theory
MCN: A New Semantics Towards Effective XML Keyword Search

DASFAA '09 Proceedings of the 14th International Conference on Database Systems for Advanced Applications
Keyword search in databases: the power of RDBMS

Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Keyword search on structured and semi-structured data

Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Keyword search over relational tables and streams

ACM Transactions on Database Systems (TODS)
Efficient keyword proximity search using a frontier-reduce strategy based on d-distance graph index

IDEAS '09 Proceedings of the 2009 International Database Engineering & Applications Symposium
Language-model-based ranking for queries on RDF-graphs

Proceedings of the 18th ACM conference on Information and knowledge management
Structured search result differentiation

Proceedings of the VLDB Endowment
Fast ELCA computation for keyword queries on XML data

Proceedings of the 13th International Conference on Extending Database Technology
Improving XML search by generating and utilizing informative result snippets

ACM Transactions on Database Systems (TODS)
Understanding queries in a search database system

Proceedings of the twenty-ninth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Exploratory keyword search on data graphs

Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Towards a theory of search queries

ACM Transactions on Database Systems (TODS)
Structured data retrieval using cover density ranking

Proceedings of the 2nd International Workshop on Keyword Search on Structured Data
A framework for evaluating database keyword search strategies

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Ten thousand SQLs: parallel keyword queries computing

Proceedings of the VLDB Endowment
Toward scalable keyword search over relational data

Proceedings of the VLDB Endowment
Searching workflows with hierarchical views

Proceedings of the VLDB Endowment
Using structural information in XML keyword search effectively

ACM Transactions on Database Systems (TODS)
Scalable keyword search on large data streams

The VLDB Journal — The International Journal on Very Large Data Bases
Providing built-in keyword search capabilities in RDBMS

The VLDB Journal — The International Journal on Very Large Data Bases
Context-sensitive document ranking

Journal of Computer Science and Technology
Exact top-k keyword search on graph databases

Proceedings of the 2011 ACM Symposium on Applied Computing
Finding a minimal tree pattern under neighborhood constraints

Proceedings of the thirtieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Efficient diversity-aware search

Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
Keyword search in graphs: finding r-cliques

Proceedings of the VLDB Endowment
Processing keyword search on XML: a survey

World Wide Web
Keyword search over RDF graphs

Proceedings of the 20th ACM international conference on Information and knowledge management
Discovering top-k teams of experts with/without a leader in social networks

Proceedings of the 20th ACM international conference on Information and knowledge management
Finding information nebula over large networks

Proceedings of the 20th ACM international conference on Information and knowledge management
Learning to rank results in relational keyword search

Proceedings of the 20th ACM international conference on Information and knowledge management
Skynets: searching for minimum trees in graphs with incomparable edge weights

Proceedings of the 20th ACM international conference on Information and knowledge management
Cascading top-k keyword search over relational databases

Proceedings of the ACM 14th international workshop on Data Warehousing and OLAP
Language models for keyword search over data graphs

Proceedings of the fifth ACM international conference on Web search and data mining
Interactive predicate suggestion for keyword search on RDF graphs

ADMA'11 Proceedings of the 7th international conference on Advanced Data Mining and Applications - Volume Part II
Managing large dynamic graphs efficiently

SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
Diversifying top-k results

Proceedings of the VLDB Endowment
Ranking the answer trees of graph search by both structure and content

Proceedings of the 1st Joint International Workshop on Entity-Oriented and Semantic Search
A distributed index for efficient parallel top-k keyword search on massive graphs

Proceedings of the twelfth international workshop on Web information and data management
A personal perspective on keyword search over data graphs

Proceedings of the 16th International Conference on Database Theory
Extracting minimum-weight tree patterns from a schema with neighborhood constraints

Proceedings of the 16th International Conference on Database Theory
ROU: advanced keyword search on graph

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Efficient Top-k Keyword Search Over Multidimensional Databases

International Journal of Data Warehousing and Mining
Top-K nearest keyword search on large graphs

Proceedings of the VLDB Endowment
Semantics-based keyword search over XML and relational databases

Proceedings of the Fourth Symposium on Information and Communication Technology

Quantified Score

Hi-index	0.00

Visualization

Abstract

In keyword search over data graphs, an answer is a nonredundant subtree that includes the given keywords. An algorithm for enumerating answers is presented within an architecture that has two main components: an engine that generates a set of candidate answers and a ranker that evaluates their score. To be effective, the engine must have three fundamental properties. It should not miss relevant answers, has to be efficient and must generate the answers in an order that is highly correlated with the desired ranking. It is shown that none of the existing systems has implemented an engine that has all of these properties. In contrast, this paper presents an engine that generates all the answers with provable guarantees. Experiments show that the engine performs well in practice. It is also shown how to adapt this engine to queries under the OR semantics. In addition, this paper presents a novel approach for implementing rankers destined for eliminating redundancy. Essentially, an answer is ranked according to its individual properties (relevancy) and its intersection with the answers that have already been presented to the user. Within this approach, experiments with specific rankers are described.