Identifying Story and Preview Images in News Web Pages

  • Authors:
  • Jianying Hu;Amit Bagga

  • Affiliations:
  • -;-

  • Venue:
  • ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 2
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

The World Wide Web provides an increasingly powerfuland popular publication mechanism. Web documents oftencontain a large number of images serving various differentpurposes. This paper focuses on images that are associatedwith a story or preview to a story. Such images often accompanythe key content on a web page, thus their identificationis important for applications such as web page summarizationand mobile access. We present a novel algorithmfor automatic identification of story/preview images whichcombines features extracted from both the image itself andthe surrounding text. The effectiveness of this algorithm isdemonstrated by experimental results on over 1500 imagescollected from 25 news web sites.