CMIC at INEX 2007: Book Search Track

  • Authors:
  • Walid Magdy;Kareem Darwish

  • Affiliations:
  • Cairo Microsoft Innovation Center, Abou Rawash, Egypt;Cairo Microsoft Innovation Center, Abou Rawash, Egypt

  • Venue:
  • Focused Access to XML Documents
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

With massive book digitization efforts underway, the need for effective retrieval of books and pages in books is an important problem. This paper describes our submissions to the INEX 2007 Book Search track. We explored using book specific features such as table of content and index pages and headers along with non-book specific features. Our results show that indexing the entire contents of books and headers provided the most effective retrieval strategy.