Optimal multiway search trees for variable size keys
Acta Informatica
Elements of information theory
Elements of information theory
Machine Learning
Managing Gigabytes: Compressing and Indexing Documents and Images
Managing Gigabytes: Compressing and Indexing Documents and Images
A Survey of Temporal Knowledge Discovery Paradigms and Methods
IEEE Transactions on Knowledge and Data Engineering
Discovery-Driven Exploration of OLAP Data Cubes
EDBT '98 Proceedings of the 6th International Conference on Extending Database Technology: Advances in Database Technology
Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Total
ICDE '96 Proceedings of the Twelfth International Conference on Data Engineering
DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases
VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
Generalized Search Trees for Database Systems
VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
DBXplorer: A System for Keyword-Based Search over Relational Databases
ICDE '02 Proceedings of the 18th International Conference on Data Engineering
CORDS: automatic discovery of correlations and soft functional dependencies
SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Optimizing bitmap indices with efficient compression
ACM Transactions on Database Systems (TODS)
Interestingness measures for data mining: A survey
ACM Computing Surveys (CSUR)
Discover: keyword search in relational databases
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Optimized query execution in large search engines with global page ordering
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Liquid query: multi-domain exploratory search on the web
Proceedings of the 19th international conference on World wide web
Facetedpedia: dynamic generation of query-dependent faceted interfaces for wikipedia
Proceedings of the 19th international conference on World wide web
Semantic annotation based exploratory search for information analysts
Information Processing and Management: an International Journal
Exploring repositories of scientific workflows
Proceedings of the 1st International Workshop on Workflow Approaches to New Data-centric Science
WikiAnalytics: disambiguation of keyword search results on highly heterogeneous structured data
Procceedings of the 13th International Workshop on the Web and Databases
Invited paper: VisiNav: A system for visual search and navigation on web data
Web Semantics: Science, Services and Agents on the World Wide Web
Interesting-phrase mining for ad-hoc text analytics
Proceedings of the VLDB Endowment
Browsing-oriented semantic faceted search
DEXA'11 Proceedings of the 22nd international conference on Database and expert systems applications - Volume Part I
Evaluation methods for rankings of facetvalues for faceted search
CLEF'11 Proceedings of the Second international conference on Multilingual and multimodal information access evaluation
Finding dimensions for queries
Proceedings of the 20th ACM international conference on Information and knowledge management
TEXplorer: keyword-based object search and exploration in multidimensional text databases
Proceedings of the 20th ACM international conference on Information and knowledge management
Proceedings of the 5th International Workshop on Web APIs and Service Mashups
SemSearchPro - Using semantics throughout the search process
Web Semantics: Science, Services and Agents on the World Wide Web
ESWC'10 Proceedings of the 7th international conference on The Semantic Web: research and Applications - Volume Part II
Chapter 13: liquid queries and liquid results in search computing
Search Computing
Approximately optimal facet selection
Proceedings of the 27th Annual ACM Symposium on Applied Computing
On-the-Fly generation of facets as navigation signs for web objects
DASFAA'12 Proceedings of the 17th international conference on Database Systems for Advanced Applications - Volume Part I
A fuzzy-summary-based approach to faceted search in relational databases
ADBIS'12 Proceedings of the 16th East European conference on Advances in Databases and Information Systems
Journal of Web Engineering
Extracting query facets from search results
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Automated faceted reporting for web analytics
Proceedings of the 4th international workshop on Web-scale knowledge representation retrieval and reasoning
Hi-index | 0.00 |
We propose a dynamic faceted search system for discovery-driven analysis on data with both textual content and structured attributes. From a keyword query, we want to dynamically select a small set of "interesting" attributes and present aggregates on them to a user. Similar to work in OLAP exploration, we define "interestingness" as how surprising an aggregated value is, based on a given expectation. We make two new contributions by proposing a novel "navigational" expectation that's particularly useful in the context of faceted search, and a novel interestingness measure through judicious application of p-values. Through a user survey, we find the new expectation and interestingness metric quite effective. We develop an efficient dynamic faceted search system by improving a popular open source engine, Solr. Our system exploits compressed bitmaps for caching the posting lists in an inverted index, and a novel directory structure called a bitset tree for fast bitset intersection. We conduct a comprehensive experimental study on large real data sets and show that our engine performs 2 to 3 times faster than Solr.