Applying a lightweight iterative merging chinese segmentation in web image annotation

  • Authors:
  • Chuen-Min Huang;Yen-Jia Chang

  • Affiliations:
  • Department of Information Management, National Yunlin University of Science & Technology, Taiwan, R.O.C.;Department of Information Management, National Yunlin University of Science & Technology, Taiwan, R.O.C.

  • Venue:
  • MLDM'13 Proceedings of the 9th international conference on Machine Learning and Data Mining in Pattern Recognition
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

Traditional CBIR method relies on visual features to identify objects in an image and uses predefined terms to annotate images, thus it fails to depict the implicit meanings. Recent textual content analysis methods applied to image annotation were blamed for their complexity of computation. In this research, we propose a corpus-free, relatively light computation of term segmentation method, namely "Iterative Merging Chinese Segmentation (IMCS) ," to identify representative terms from a single web page to obtain anecdotes as a semantic enrichment of the target image. It requires minimum computation needs that allows to share characters/words and facilitate their use at fine granularities without prohibitive cost. In the experiment, this method achieves a precision rate of 86.02%, and gains acceptance from expert rating and user rating of 75% and 68%, respectively. In performance testing, it only takes 0.006 second to process each image in a collection of 1,728 testing data set.