Identifying persons in news article images based on textual analysis

  • Authors:
  • Choochart Haruechaiyasak;Chaianun Damrongrat

  • Affiliations:
  • Human Language Technology Laboratory, National Electronics and Computer Technology Center, Pathumthani, Thailand;Human Language Technology Laboratory, National Electronics and Computer Technology Center, Pathumthani, Thailand

  • Venue:
  • ICADL'10 Proceedings of the role of digital libraries in a time of global change, and 12th international conference on Asia-Pacific digital libraries
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

A large portion of news articles contains images of persons whose names appear in the news stories. To provide image search of persons, most search engines construct an index from textual descriptions (such as headline and caption) of images. The index search approach, although very simple and scalable, has one serious drawback. A query of a person name could match some news articles which do not contain images of the target person. Therefore, some irrelevant images could be returned as search results. Our main goal is to improve the performance of the index search approach based on the syntactic analysis of person name entities in the news articles. Given sentences containing person names, we construct a set of syntactic rules for identifying persons in news images. The set of syntactic rules is used to filter out images of non-target persons from the results returned by the index search. From the experimental results, our approach improved the performance over the basic index search by 10% based on the F1-measure.