Tablerank: a ranking algorithm for table search and retrieval

  • Authors:
  • Ying Liu;Kun Bai;Prasenjit Mitra;C. Lee Giles

  • Affiliations:
  • College of Information Sciences & Technology, Pennsylvania State University, University Park, PA;College of Information Sciences & Technology, Pennsylvania State University, University Park, PA;College of Information Sciences & Technology, Pennsylvania State University, University Park, PA;College of Information Sciences & Technology, Pennsylvania State University, University Park, PA

  • Venue:
  • AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 1
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Tables are ubiquitous in web pages and scientific documents. With the explosive development of the web, tables have become a valuable information repository. Therefore, effectively and efficiently searching tables becomes a challenge. Existing search engines do not provide satisfactory search results largely because the current ranking schemes are inadequate for table search and automatic table understanding and extraction are rather difficult in general. In this work, we design and evaluate a novel table ranking algorithm-TableRank to improve the performance of our table search engine Table-Seer. Given a keyword based table query, TableRank facilities TableSeer to return the most relevant tables by tailoring the classic vector space model. TableRank adopts an innovative term weighting scheme by aggregating multiple weighting factors from three levels: term, table and document. The experimental results show that our table search engine out-performs existing search engines on table search. In addition, incorporating multiple weighting factors can significantly improve the ranking results.