A deep-learning model-based and data-driven hybrid architecture for image annotation

  • Authors:
  • Zhiyu Wang;Dingyin Xia;Edward Y. Chang

  • Affiliations:
  • Google Inc., Beijing, China;Google Inc., Beijing, China;Google Inc., Beijing, China

  • Venue:
  • Proceedings of the international workshop on Very-large-scale multimedia corpus, mining and retrieval
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Does adding more training data always help improve the effectiveness of a machine-learning or pattern-recognition task? Recent evidences in machine translation and speech recognition seem to suggest that the data-driven approach outperforms the traditional model-based approach. Instead of carefully modeling rules and their exceptions, the data-driven approach relies on identifying similar patterns in massive datasets and then uses the similar patterns to predict the labels (or other outcomes) of unseen instances. In this work, we compare representative data-driven and model-based schemes on an image annotation task. We enumerate pros and cons of these two approaches, and propose a hybrid approach, which can harness the strengths of the two.