The data warehouse toolkit: practical techniques for building dimensional data warehouses
The data warehouse toolkit: practical techniques for building dimensional data warehouses
Developing time-oriented database applications in SQL
Developing time-oriented database applications in SQL
The SDSS skyserver: public access to the sloan digital sky server data
Proceedings of the 2002 ACM SIGMOD international conference on Management of data
An Approach to Integrating Query Refinement in SQL
EDBT '02 Proceedings of the 8th International Conference on Extending Database Technology: Advances in Database Technology
DBXplorer: A System for Keyword-Based Search over Relational Databases
ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Keyword Searching and Browsing in Databases using BANKS
ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Effective keyword search in relational databases
Proceedings of the 2006 ACM SIGMOD international conference on Management of data
BLINKS: ranked keyword searches on graphs
Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Discover: keyword search in relational databases
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
NaLIX: A generic natural language search environment for XML data
ACM Transactions on Database Systems (TODS)
Précis: from unstructured keywords as queries to structured databases as answers
The VLDB Journal — The International Journal on Very Large Data Bases
SQAK: doing more with keywords
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Do we mean the same?: disambiguation of extracted keyword queries for database search
Proceedings of the First International Workshop on Keyword Search on Structured Data
Keyword search in databases: the power of RDBMS
Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Summarizing relational databases
Proceedings of the VLDB Endowment
SnipSuggest: context-aware autocompletion for SQL
Proceedings of the VLDB Endowment
Evaluating evidences for keyword query disambiguation in entity centric database search
DEXA'10 Proceedings of the 21st international conference on Database and expert systems applications: Part II
Keyword search over relational databases: a metadata approach
Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
Processing keyword search on XML: a survey
World Wide Web
Data-thirsty business analysts need SODA: search over data warehouse
Proceedings of the 20th ACM international conference on Information and knowledge management
The Credit Suisse Meta-data Warehouse
ICDE '12 Proceedings of the 2012 IEEE 28th International Conference on Data Engineering
Reverse engineering complex join queries
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Hi-index | 0.00 |
The purpose of data warehouses is to enable business analysts to make better decisions. Over the years the technology has matured and data warehouses have become extremely successful. As a consequence, more and more data has been added to the data warehouses and their schemas have become increasingly complex. These systems still work great in order to generate pre-canned reports. However, with their current complexity, they tend to be a poor match for non tech-savvy business analysts who need answers to ad-hoc queries that were not anticipated. This paper describes the design, implementation, and experience of the SODA system (Search over DAta Warehouse). SODA bridges the gap between the business needs of analysts and the technical complexity of current data warehouses. SODA enables a Google-like search experience for data warehouses by taking keyword queries of business users and automatically generating executable SQL. The key idea is to use a graph pattern matching algorithm that uses the metadata model of the data warehouse. Our results with real data from a global player in the financial services industry show that SODA produces queries with high precision and recall, and makes it much easier for business users to interactively explore highly-complex data warehouses.