Range query estimation with data skewness for top-k retrieval

  • Authors:
  • Anteneh Ayanso;Paulo B. Goes;Kumar Mehta

  • Affiliations:
  • Department of Finance, Operations, and Information Systems, Goodman School of Business, Brock University, 500 Glenridge Avenue, St. Catharines, ON L2S 3A1, Canada;Department of Management Information Systems, Eller College of Management, University of Arizona, 1130 E. Helen Street, Tucson, AZ 85721, USA;Department of Decision Science and MIS, School of Management, George Mason University, 4400 University Drive, Fairfax, VA 22030, USA

  • Venue:
  • Decision Support Systems
  • Year:
  • 2014

Quantified Score

Hi-index 0.00

Visualization

Abstract

Top-k querying can significantly improve the performance of web-based business intelligence applications such as price comparison and product recommendation systems. Top-k retrieval involves finding a limited number of records in a relational database that are most similar to user-specified attribute-value pairs. This paper extends the cost-based query-mapping method for top-k retrieval by incorporating data skewness in range estimation. Experiments on real world and synthetic multi-attribute data sets show that incorporating data skewness provides a robust performance across different types of data sets, query sets, distance functions, and histograms.